Massive MIMO Systems, Paper 2020
Massive MIMO Systems, Paper 2020
Systems
Edited by
www.mdpi.com/journal/electronics
Massive MIMO Systems
Massive MIMO Systems
MDPI • Basel • Beijing • Wuhan • Barcelona • Belgrade • Manchester • Tokyo • Cluj • Tianjin
Special Issue Editors
Kazuki Maruta Francisco Falcone
Chiba University Public University of Navarre
Japan Spain
Editorial Office
MDPI
St. Alban-Anlage 66
4052 Basel, Switzerland
This is a reprint of articles from the Special Issue published online in the open access journal Electronics
(ISSN 2079-9292) (available at: https://www.mdpi.com/journal/electronics/special issues/mimo).
For citation purposes, cite each article independently as indicated on the article page online and as
indicated below:
LastName, A.A.; LastName, B.B.; LastName, C.C. Article Title. Journal Name Year, Article Number,
Page Range.
c 2020 by the authors. Articles in this book are Open Access and distributed under the Creative
Commons Attribution (CC BY) license, which allows users to download, copy and build upon
published articles, as long as the author and publisher are properly credited, which ensures maximum
dissemination and a wider impact of our publications.
The book as a whole is distributed by MDPI under the terms and conditions of the Creative Commons
license CC BY-NC-ND.
Contents
Qian Lv, Jiamin Li, Pengcheng Zhu, Dongming Wang and Xiaohu You
Downlink Spectral Efficiency Analysis in Distributed Massive MIMO with Phase Noise
Reprinted from: Electronics 2018, 7, 317, doi:10.3390/electronics7110317 . . . . . . . . . . . . . . . 5
Jiamin Li, Qian Lv, Jing Yang, Pengcheng Zhu and Xiaohu You
Spectral and Energy Efficiency of Distributed Massive MIMO with Low-Resolution ADC
Reprinted from: Electronics 2018, 7, 391, doi:10.3390/electronics7120391 . . . . . . . . . . . . . . . 21
Omar A. Saraereh, Imran Khan, Byung Moo Lee and Ashraf Tahat
Efficient Pilot Decontamination Schemes in 5G Massive MIMO Systems
Reprinted from: Electronics 2019, 8, 55, doi:10.3390/electronics8010055 . . . . . . . . . . . . . . . 51
Wei Lu, Yongliang Wang, Xiaoqiao Wen, Shixin Peng and Liang Zhong
Downlink Channel Estimation in Massive Multiple-Input Multiple-Output with Correlated
Sparsity by Overcomplete Dictionary and Bayesian Inference
Reprinted from: Electronics 2019, 8, 473, doi:10.3390/electronics8050473 . . . . . . . . . . . . . . . 97
Mostafa Hefnawi
Hybrid Beamforming for Millimeter-Wave Heterogeneous Networks
Reprinted from: Electronics 2019, 8, 133, doi:10.3390/electronics8020133 . . . . . . . . . . . . . . . 153
v
Hedi Khammari, Irfan Ahmed, Ghulam Bhatti and Masoud Alajmi
Spatio-Radio Resource Management and Hybrid Beamforming for Limited Feedback
Massive MIMO Systems
Reprinted from: Electronics 2019, 8, 1061, doi:10.3390/electronics8101061 . . . . . . . . . . . . . . 163
Fumiya Muramatsu, Kentaro Nishimori, Ryotaro Taniguchi, Takefumi Hiraguri and Tsutomu
Mitsui
Evaluation of Multi-Beam Massive MIMO Considering MAC Layer Using IEEE802.11ac
and FDD-LTE
Reprinted from: Electronics 2019, 8, 225, doi:10.3390/electronics8020225 . . . . . . . . . . . . . . . 189
Imran Khan, Mohammad Haseeb Zafar, Majid Ashraf and Sunghwan Kim
Computationally Efficient Channel Estimation in 5G Massive Multiple-Input
Multiple-output Systems
Reprinted from: Electronics 2018, 7, 382, doi:10.3390/electronics7120382 . . . . . . . . . . . . . . . 201
Wenjin Wang, Yufei Huang, Li You, Jiayuan Xiong, Jiamin Li and Xiqi Gao
Energy Efficiency Optimization for Massive MIMO Non-Orthogonal Unicast and Multicast
Transmission with Statistical CSI
Reprinted from: Electronics 2019, 8, 857, doi:10.3390/electronics8080857 . . . . . . . . . . . . . . . 227
Mudasar Latif Memon, Navrati Saxena, Abhishek Roy and Dong Ryeol Shin
Artificial Intelligence-Based Discontinuous Reception for Energy Saving in 5G Networks
Reprinted from: Electronics 2019, 8, 778, doi:10.3390/electronics8070778 . . . . . . . . . . . . . . . 243
Oluwole J. Famoriji, Zhongxiang Zhang, Akinwale Fadamiro, Rabiu Zakariyya and Fujiang
Lin
Planar Array Diagnostic Tool for Millimeter-Wave Wireless Communication Systems
Reprinted from: Electronics 2018, 7, 383, doi:10.3390/electronics7120383 . . . . . . . . . . . . . . . 263
vi
About the Special Issue Editors
Kazuki Maruta completed his Bachelor of Engineering and Master of Engineering degrees, as
well as his PhD, at Kyushu University, Japan, in 2006, 2008 and 2016, respectively. From 2008 to 2017,
he worked on the research and development of interference compensation techniques for future
wireless communication systems at NTT Access Network Service Systems Laboratories. From 2017
to 2020, he was an Assistant Professor at the Graduate School of Engineering, Chiba University. He is
currently a Specially Appointed Associate Professor at the Academy for Super Smart Society, Tokyo
Institute of Technology. His research interests include MIMO, adaptive array signal processing,
channel estimation, medium access control protocols and moving networks. He is a member of IEEE
and IEICE. He won the IEICE Young Researcher’s Award in 2012, the IEICE Radio Communication
Systems (RCS) Active Researcher Award in 2014, the Asia-Pacific Microwave Conference (APMC)
Prize in 2014, and the IEICE RCS Outstanding Researcher Award in 2018. He was also co-recipient of
the IEICE Best Paper Award in 2018.
Francisco Falcone completed his degree in telecommunication engineering and his PhD in
communication engineering at the Public University of Navarra (UPNA), Spain, in 1999 and 2005,
respectively. From 1999 to 2000, he was a Microwave Commissioning Engineer with Siemens-Italtel,
where he deployed microwave access systems. From 2000 to 2008, he was a Radio Access Engineer
at Telefónica Móviles, performing radio network planning and optimization tasks in mobile network
deployment. In 2009, he was a co-founding member and the Director of Tafco Metawireless, a spin-off
company from UPNA. He was an Assistant Lecturer at the Electrical and Electronic Engineering
Department, UPNA, from 2003 to 2009. In 2009, he became an Associate Professor with the EE
Department, and was the Head of Department from 2012 to 2018. From January 2018 to May 2018,
he was a Visiting Professor at the Kuwait College of Science and Technology, Kuwait. He is also
affiliated with the Institute for Smart Cities (ISC), UPNA, which hosts around 140 researchers, and is
the Head of the ICT section. He has more than 500 contributions in indexed international journals,
book chapters and conference contributions. His research interests are related to computational
electromagnetics applied to the analysis of complex electromagnetic scenarios, with a focus on the
analysis, design and implementation of heterogeneous wireless networks to enable context aware
environments. Prof. Falcone was a recipient of the CST 2003 and CST 2005 Best Paper Award, the
PhD Award from the Colegio Oficial de Ingenieros de Telecomunicación (COIT) in 2006, the Doctoral
Award UPNA, 2010, the 1st Juan Gomez Peñalver Research Award from the Royal Academy of
Engineering of Spain in 2010, the XII Talgo Innovation Award 2012, the IEEE 2014 Best Paper Award,
2014, the ECSA-3 Best Paper Award, 2016, and the ECSA-4 Best Paper Award, 2017. He is a Senior
Member of the IEEE.
vii
electronics
Editorial
Massive MIMO Systems: Present and Future
Kazuki Maruta 1, * and Francisco Falcone 2
1 Graduate School of Engineering, Chiba University, 1–33 Yayoi-cho, Inage-ku, Chiba-shi 263-8522, Japan
2 Department of Electrical, Electronic and Communication Engineering & Institute for Smart Cities,
Public University of Navarre, 31006 Pamplona, Spain; [email protected]
* Correspondence: [email protected]
1. Introduction
We are going to see the first decade since the fundamental concept of massive multiple-input
multiple-output (MIMO) (also called large-scale MIMO) has emerged [1]. Massive MIMO is expected
to be one of the most promising technologies towards the fifth generation mobile communications (5G)
and beyond. Implementation [2,3] and trials [4,5] are actively proceeded. Especially, massive array
beamforming has a good match to millimeter wave communication [6] which suffers from link
budget shortfall due to its high frequency. Further, thanks to its excessive degree of freedom
(DoF), massive MIMO has unlimited potentiality to further enhance system capabilities [7] and still
expands various research topics with depth. It should be further discussed and believed to break
limitations in wireless communications such as spectral and energy efficiencies for better support of
continuously increasing mobile data traffic, as well as terminals driven by Internet of things (IoT).
The key contribution of this special issue is to provide readers with new insights and facilitate plentiful
discussions in this field.
3. Future
Now discussions towards 6G has started. Massive MIMO is still expected as a promising
contributor for 6G [27–29], e.g., referred as ‘ultra massive MIMO’. Its potentiality will be truly realized
through relentless effort on R&D including the advance of hardware performance. Variety of massive
MIMO technologies, which were widely addressed in this special issue, could be one of the most
important solutions to bring a breakthrough towards beyond 5G or 6G.
Author Contributions: K.M. and F.F. worked together in the whole editorial process of the special issue, ‘Massive
MIMO Systems’ published by journal Electronics. K.M. drafted this editorial summary. K.M. and F.F. reviewed,
edited and finalized the manuscript. All authors have read and agree to the published version of the manuscript.
Acknowledgments: First of all we would like to thank all researchers who submitted articles to this special issue
for their excellent contributions. We are also grateful to all reviewers who contributed evaluations of scientific
merits and quality of the manuscripts and provided countless valuable suggestions to improve their quality and
the overall value for the scientific community. We would like to acknowledge the editorial board of Electronics
journal, who invited us to guest edit this special issue. We are also grateful to the Electronics Editorial Office staff
who worked thoroughly to maintain the rigorous peer-review schedule and timely publication.
Conflicts of Interest: The authors declare no conflicts of interest.
2
Electronics 2020, 9, 385
Abbreviations
The following abbreviations are used in this manuscript:
References
1. Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas.
IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
2. Malkowsky, S.; Vieira, J.; Liu, L.; Harris, P.; Nieman, K.; Kundargi, N.; Wong, I.C.; Tufvesson, F.; Öwall, V.;
Edfors, O. The World’s First Real-Time Testbed for Massive MIMO: Design, Implementation, and Validation.
IEEE Access 2017, 5, 9073–9088. [CrossRef]
3. Wirth, T.; Mehlhose, M.; Thiele, L.; Haustein, T. Proof-of-concept of flexible massive MIMO beamforming
at 2.4 GHz. In Proceedings of the 2017 51st Asilomar Conference on Signals, Systems, and Computers,
Pacific Grove, CA, USA, 29 October–1 November 2017; pp. 1061–1065. [CrossRef]
4. Obara, T.; Okuyama, T.; Inoue, Y.; Aoki, Y.; Suyama, S.; Lee, J.; Okumura, Y. Experimental Trial of 5G Super
Wideband Wireless Systems Using Massive MIMO Beamforming and Beam Tracking Control in 28GHz
Band. IEICE Trans. Commun. 2017, 100, 1256–1268. [CrossRef]
5. Sakai, M.; Kamohara, K.; Iura, H.; Nishimoto, H.; Ishioka, K.; Murata, Y.; Yamamoto, M.; Okazaki, A.;
Nonaka, N.; Suyama, S.; et al. Experimental Field Trials on MU-MIMO Transmission for High SHF
Wide-band Massive MIMO in 5G. IEEE Trans. Wirel. Commun. 2020. [CrossRef]
6. Wu, W.; Liu, D. Hybrid BD-GMD Precoding for Multiuser Millimeter-Wave Massive MIMO Systems.
IEICE Trans. Commun. 2018. [CrossRef]
7. Maruta, K.; Ahn, C.J. Uplink Interference Suppression by Semi-Blind Adaptive Array With Decision Feedback
Channel Estimation on Multicell Massive MIMO Systems. IEEE Trans. Commun. 2018, 66, 6123–6134.
[CrossRef]
8. Lv, Q.; Li, J.; Zhu, P.; Wang, D.; You, X. Downlink Spectral Efficiency Analysis in Distributed Massive MIMO
with Phase Noise. Electronics 2018, 7, 317. [CrossRef]
9. Li, J.; Lv, Q.; Yang, J.; Zhu, P.; You, X. Spectral and Energy Efficiency of Distributed Massive MIMO with
Low-Resolution ADC. Electronics 2018, 7, 391. [CrossRef]
10. Yu, S.; Lee, J.W. Channel Sounding for Multi-User Massive MIMO in Distributed Antenna System
Environment. Electronics 2019, 8, 36. [CrossRef]
3
Electronics 2020, 9, 385
11. Saraereh, O.A.; Khan, I.; Lee, B.M.; Tahat, A. Efficient Pilot Decontamination Schemes in 5G Massive MIMO
Systems. Electronics 2019, 8, 55. [CrossRef]
12. Al-hubaishi, A.S.; Noordin, N.; Sali, A.; Subramaniam, S.; Mohammed Mansoor, A. An Efficient Pilot
Assignment Scheme for Addressing Pilot Contamination in Multicell Massive MIMO Systems. Electronics
2019, 8, 372. [CrossRef]
13. Lu, W.; Wang, Y.; Wen, X.; Peng, S.; Zhong, L. Downlink Channel Estimation in Massive Multiple-Input
Multiple-Output with Correlated Sparsity by Overcomplete Dictionary and Bayesian Inference. Electronics
2019, 8, 473. [CrossRef]
14. Dao, H.T.; Kim, S. Multiple-Symbol Non-Coherent Detection for Differential QAM Modulation in Uplink
Massive MIMO Systems. Electronics 2019, 8, 693. [CrossRef]
15. Ha, J.G.; Ro, J.H.; Song, H.K. Throughput Enhancement in Downlink MU-MIMO Using Multiple Dimensions.
Electronics 2019, 8, 758. [CrossRef]
16. Yan, Q.; Sun, Y.; Yue, D.W. LOS-Based Equal Gain Transmission and Combining in General
Frequency-Selective Ricean Massive MIMO Channels. Electronics 2019, 8, 79. [CrossRef]
17. Hefnawi, M. Hybrid Beamforming for Millimeter-Wave Heterogeneous Networks. Electronics 2019, 8, 133.
[CrossRef]
18. Khammari, H.; Ahmed, I.; Bhatti, G.; Alajmi, M. Spatio-Radio Resource Management and Hybrid
Beamforming for Limited Feedback Massive MIMO Systems. Electronics 2019, 8, 1061. [CrossRef]
19. Muramatsu, F.; Nishimori, K.; Taniguchi, R.; Hiraguri, T.; Mitsui, T. Evaluation of Multi-Beam Massive
MIMO Considering MAC Layer Using IEEE802.11ac and FDD-LTE. Electronics 2019, 8, 225. [CrossRef]
20. Khan, I.; Zafar, M.; Ashraf, M.; Kim, S. Computationally Efficient Channel Estimation in 5G Massive
Multiple-Input Multiple-Output Systems. Electronics 2018, 7, 382. [CrossRef]
21. Feng, H.; Zhao, X.; Li, Z.; Xing, S. A Novel Iterative Discrete Estimation Algorithm for Low-Complexity
Signal Detection in Uplink Massive MIMO Systems. Electronics 2019, 8, 980. [CrossRef]
22. Wang, W.; Huang, Y.; You, L.; Xiong, J.; Li, J.; Gao, X. Energy Efficiency Optimization for Massive MIMO
Non-Orthogonal Unicast and Multicast Transmission with Statistical CSI. Electronics 2019, 8, 857. [CrossRef]
23. Memon, M.L.; Maheshwari, M.K.; Saxena, N.; Roy, A.; Shin, D.R. Artificial Intelligence-Based Discontinuous
Reception for Energy Saving in 5G Networks. Electronics 2019, 8, 778. [CrossRef]
24. Famoriji, O.; Zhang, Z.; Fadamiro, A.; Zakariyya, R.; Lin, F. Planar Array Diagnostic Tool for Millimeter-Wave
Wireless Communication Systems. Electronics 2018, 7, 383. [CrossRef]
25. Ojaroudi Parchin, N.; Jahanbakhsh Basherlou, H.; Alibakhshikenari, M.; Ojaroudi Parchin, Y.; Al-Yasir, Y.I.A.;
Abd-Alhameed, R.A.; Limiti, E. Mobile-Phone Antenna Array with Diamond-Ring Slot Elements for 5G
Massive MIMO Systems. Electronics 2019, 8, 521. [CrossRef]
26. Alibakhshikenari, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Falcone, F.; Limiti, E. High-Isolation
Leaky-Wave Array Antenna Based on CRLH-Metamaterial Implemented on SIW with ±30◦ Frequency
Beam-Scanning Capability at Millimetre-Waves. Electronics 2019, 8, 642. [CrossRef]
27. Yang, P.; Xiao, Y.; Xiao, M.; Li, S. 6G Wireless Communications: Vision and Potential Techniques. IEEE Netw.
2019, 33, 70–75. [CrossRef]
28. Bi, Q. Ten Trends in the Cellular Industry and an Outlook on 6G. IEEE Commun. Mag. 2019, 57, 31–36.
29. Zhang, Z.; Xiao, Y.; Ma, Z.; Xiao, M.; Ding, Z.; Lei, X.; Karagiannidis, G.K.; Fan, P. 6G Wireless Networks:
Vision, Requirements, Architecture, and Key Technologies. IEEE Veh. Technol. Mag. 2019, 14, 28–41.
c 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
4
electronics
Article
Downlink Spectral Efficiency Analysis in Distributed
Massive MIMO with Phase Noise
Qian Lv, Jiamin Li *, Pengcheng Zhu, Dongming Wang and Xiaohu You
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China;
[email protected] (Q.L.); [email protected] (P.Z.); [email protected] (D.W.); [email protected] (X.Y.)
* Correspondence: [email protected]; Tel.: +86-025-5209-1635
Keywords: distributed massive MIMO; phase noise; amplified thermal noise; spectral efficiency
1. Introduction
Massive multiple-input multiple-output (MIMO) is becoming a promising technology to provide
significant gains [1–6]. Since it was first proposed, massive MIMO has been studied extensively.
The main feature of massive MIMO is that hundreds (or even thousands) of antennas are employed
at each base station, simultaneously serving tens of users in the same time-frequency resource,
which offers big advantages compared to conventional MIMO. Firstly, it can bring unprecedented
spatial degrees-of-freedom, which enables the improvement of spectral efficiency and energy efficiency
even with simple linear receivers or precoders [7]. In addition, user channels in massive MIMO systems
are nearly orthogonal and fast fading, intra-cell interference can be averaged out. Massive MIMO can
be divided into two categories: one is co-located massive MIMO and the other is distributed massive
MIMO [8]. The latter has promising advantages of increasing energy efficiency, system coverage
and spectral efficiency, which results from the increase in macro-diversity gain and the reduction in
access distance [9–15]. Considering these advantages, we analyze the spectral efficiency of distributed
massive MIMO in this paper. Notably, due to the different access distance between each user and
all remote antenna units (RAUs), the channel vectors are non-isotropic, which makes the analysis of
performance in distributed massive MIMO more difficult and more complex.
In practical communication systems, inevitable hardware impairments occur and cannot be
eliminated even after applying calibration and compensation techniques [16,17]. These impairments
can be divided into two categories: multiplicative distortion and additive distortion. Phase noise
introduced by the local oscillators of transceivers is the multiplicative distortion. It will cause random
rotations of the transmitted data symbols, which degrades the system performance. Furthermore,
phase noise makes the estimated channel state information (CSI) more inaccurate and it introduces
a phenomenon called channel aging which means the estimated CSI obtained during pilot phase
is different from that used for downlink transmission. It is pointed out in [8] that the deployment
cost and circuit power consumption of massive MIMO scale linearly with the number of antennas.
Therefore inexpensive but hardware-constraint hardware may be deployed for the reason of cost,
which makes the hardware impairments more severe in massive MIMO.
Analyzing the spectral efficiency is a fundamental method to evaluate the impacts of phase
noise. The impacts of phase noise for uplink transmission have been studied in [18–21] and for
downlink transmission were investigated in [22–24]. The impacts of phase noise on physical layer
security for downlink massive MIMO were investigate in [22]. The achievable rate was derived in [23]
considering the frequency-selective multipath fading channel. The capacity of downlink transmission
with linear precoders was analyzed in [24] but it assumed that the number of antennas and users was
asymptotically large and it only considered a co-located MIMO system.
Herein, considering a distributed massive MIMO with phase noise and amplified thermal noise,
we analyzed the downlink spectral efficiency for any number of antennas and users. Followings are
the key contributions of this paper:
1. In distributed massive MIMO, the channel vectors are non-isotropic and the correlation between
channel vectors and intended beams for each user are destroyed by phase noise. Hence, we first
obtain the distributions of the desired signal and interference powers, which is challenging
and complex.
2. Considering both zero-forcing (ZF) and maximum ratio transmission (MRT) precoders, we obtain
the closed-form expressions of the downlink ergodic achievable rates with imperfect CSI and
hardware impairments in distributed massive MIMO. These closed-form expressions are accurate
for any number of antennas and users in both distributed massive MIMO and co-locate massive
MIMO. Furthermore, they are derived under imperfect CSI which is more realistic, and they
enable the analysis of performance degradation caused by phase noise.
3. The ultimate achievable rate per user is obtained when the number of antennas per remote
antenna unit (RAU) goes infinity. It can be used to investigate the asymptotic performance of
distributed massive MIMO with hardware impairments.
4. The theoretical results are verified by Monte Carlo simulations, and we have a deep insight into
the impacts of phase noise.
The rest of this paper is organized as follows. System model including system configuration,
a model describing phase noise and an effective channel model is introduced in Section 2. We obtain
the estimated CSI during the uplink pilot training phase and analyze the spectral efficiency with linear
precoders in Section 3. Numerical results are given in Section 4. A conclusion is provided in Section 5.
Notation: Column vectors x and matrices X are denoted by bold letters in lower case and in
upper case, respectively. I N is a N × N identity matrix. (·)H and (·)T are the conjugate transpose and
transpose operator, respectively. Scalars x are denoted by italic letters. | x | represents the absolute
value of x and X denotes the spectral norm of X. E[·] and var(·) represent the expectation operator
and variance operator, respectively. CN (0, σ2 ) represents circularly symmetric complex Gaussian
distribution with mean zero and variance σ2 . Γ(k, θ ) means Gamma distribution with shape parameter
k and scale parameter θ. Similarly, Nakagami(m, Ω) means Nakagami distribution with shape
parameter m and controlling spread parameter Ω.
2. System Model
Considering a distributed massive MIMO system, we first describe the system configuration and
give the conventional channel model. Next, we present a model describing phase noise and give an
effective channel model incorporating the impacts of phase noise.
6
Electronics 2018, 7, 317
$QWHQQD
$QWHQQD
8VHUN
5$8P
$QWHQQD1
H MMN W
8VHUN
H MIP W
5$8P
Frequency-flat fading channels are assumed and the system runs in time-division duplex (TDD)
protocol. The channel vector between all RAUs and the k-th user is given by
Δ
g̃k = g̃k1 · · · g̃kMN = Λ1/2
k hk , (1)
Δ −α
where Λk = E g̃k g̃Hk = diag(λ1,k · · · λ M,k ) ⊗ I N is the covariance matrix, λm,k = cdm,k denotes
the path loss between the m-th RAU and the k-th user, dm,k is the corresponding distance, α is the
path loss exponent, c is the median of the mean path gain at a reference distance dm,k = 1 km,
and hk ∼ CN (0, I MN ) is the small-scale fast fading vector.
In this paper, we consider a more realistic scenario where the antennas deployed at each
RAU are inexpensive and hardware-constrained. Specifically, each antenna experiences phase noise
which distorts communication. The phase noise means the multiplicative phase drift in the signal,
which comes from the local oscillators (LOs) of the RAUs and users. We assume that the LOs are
free-running without a phase-locked loop (PLL), and then the phase noise is commonly modeled as a
discrete-time independent Wiener process [8,25]. Mathematically, the phase noises at the LOs of the
n-th antenna and the k-th user are denoted as
φn (t) ∼ N φn (t − 1) , σφ,n 2
, (2)
ϕk (t) ∼ N ϕk (t − 1) , σϕ,k 2
, (3)
which equal the previous realization φn (t − 1) and ϕk (t − 1) plus an independent zero-mean Gaussian
random increment with variances σφ,n 2 and σ2 . The variances are dependent on the carrier frequency
ϕ,k
and symbol time [25].
The phase noise can be independent or correlated between antennas of each RAU. In our analysis,
we have assumed that the phase noise correlated between antennas of one RAU and independent
among RAUs. Then by expressing the total phase noise as a multiplicative factor, we can rewrite the
channel vector as
gk (t) =Θk (t) g̃k , (4)
1
Δ
where Θk (t) = diag e jθk (t) , · · · , e jθk (t) = e jϕk (t) Φ (t) ∈ C MN × MN is the total phase noise, wherein
MN
Δ
Φ (t) = diag(e jφ1 (t) , · · · , e jφM (t) ) ⊗ I N is the phase noise induced by all RAUs, and similarly, e jϕk (t)
corresponds to the phase drift pruduced by the k-th user. Notably, because of the presence of phase
noise, the effective channel becomes time-dependent.
7
Electronics 2018, 7, 317
2 = σ2 = 0, ∀ n, k.
Remark 1. The conventional channel model without phase noise is obtained when σφ,n ϕ,k
where
σ2 + σ2 2 + σ2
Δ φ,1 ϕ,j
σφ,M
ϕ,j
Bj u,v
= Λ j x j (u) x ∗j (v)diag e− 2 |u−v|
, · · · , e− 2 |u−v|
⊗ IN .
The pilot sequences can be designed in different ways. Without loss of generation, in this paper
we assume that the number of pilot symbols is equal to that of users, i.e., τ = K. More specifically,
Δ
we assume that the set of orthogonal pilot sequences XP = [x1 , · · · , xK ] is a diagonal matrix and each
√
element of it is ρp , wherein ρp is the average transmit power of pilot symbols. This is equivalent to
the assumption made in [18,20].
8
Electronics 2018, 7, 317
σ2 +σφ,m
2
|t−k| √ ρ
ϕ,k
e− 2 p λm,k
β m,k (t) = , (7)
ρp λm,k +ξ BS
where β m,k (t) is the equivalent large-scale fading part from user k to RAU m and
ĥk = [ĥT1,k , · · · , ĥTM,k ]T = Σ−1/2 Ψ ∼ CN (0, I MN ) represents the equivalent small-scale fast fading part.
Because of the orthogonality principle of LMMSE estimation theory, the channel vector gk (t) can
be decomposed as
gk (t) = ĝk (t) + ek (t), (9)
where ek (t) is the uncorrelated and statistically independent of ĝk (t) estimation error.
During the pilot transmission phase, we obtain the estimated channel showing in (8). In our
analysis, it is assumed that the beamforming vector is designed by using the estimated CSI once
during the pilot transmission phase and then is applied for the entire duration of the downlink
transmission phase.
K
x= ∑ l =1 w l s l , (11)
where sl ∼ CN (0, 1) is the transmitted data symbol assigned for user l, and wl is the beamforming
vector designed at time slot τ. MRT and ZF precoders are considered in our analysis. Mathematically,
these two linear precoders can be defined as
⎧
⎨ ĝl (τ )
ĝl (τ )
MRT,
wl = al (τ ) (12)
⎩ ZF,
al (τ )
−1
where al (τ ) is the l-th column of Ĝ (τ ) ĜH (τ ) Ĝ (τ ) , wherein Ĝ (τ ) = [ĝ1 (τ ), · · · , ĝK (τ )].
Considering (4), we can rewrite (10) as
√
rk (t) = k ( τ ) Θ̃k ( t ) x + n
ρdl gH UE
(t) , (13)
where
Θ̃k (t) = diag e− j(θk (t)−θk (τ )) , · · · , e− j(θk (t)−θk (τ )) .
Δ 1 1 MN MN
(14)
9
Electronics 2018, 7, 317
It is assumed that users have the statistical properties of the channel and they don’t carry out
channel estimation. So only statical CSI can be used by downlink users to detect the signal. Motivated
by [27], we rewrite the received data as
√
rk (t) = ρdl E gH
k ( τ ) Θ̃k ( t ) wk sk + n , (15)
where
√ √ K
n = ρdl gH
k ( τ ) Θ̃k ( t ) wk − E gk ( τ ) Θ̃k ( t ) wk
H
sk + ρdl ∑ gHk (τ ) Θ̃k (t) wi si + nUE (t) .
i =k
Suppose E gH k ( τ ) Θ̃k ( t ) wk sk is the only signal needed at user k, and treating n as unrelated
Gaussian distributed additive noise [28,29], the achievable downlink rate of user k is denoted as
2
ρdl E gH
k ( τ ) Θ̃k ( t ) wk
Rk = log2 1+ , (16)
A(t) + B(t) + ξ UE
where
A(t) = ρdl var gHk ( τ ) Θ̃k ( t ) wk ,
2
K
B(t) = ∑i =k ρdl E gH k ( τ ) Θ̃k ( t ) wi .
Lemma 1 ([28]). For an isotropic random vector x ∈ C N ×1 whose elements are independent and all distributed
as CN (0, σ2 ), then the distribution of xH x is Γ( N, σ2 ).
M
k ( t ) ĝk ( t ) =
ĝH ∑m=1 β2m,k (t)ĥHm,k ĥm,k , (17)
non-identically distributed but independent items. To obtain its distribution, Lemma 2 stated bellow
can be used.
Lemma 2 ([15]). If { xi } are a set of random variables and independent of each other, each term is distributed as
Γ(χi , θi ). Then the distribution of the sum ∑i xi can be approximated as ∑i Xi ∼ Γ(χ, θ ) wherein
( ∑ i χ i θ i )2 ∑ χθ2
χ= , θ= i i i . (18)
χ
∑i i i θ 2 ∑i χi θi
10
Electronics 2018, 7, 317
=1 β m,k ( t )
M 4
∑m
θk (t) = . (20)
=1 β m,k ( t )
M 2
∑m
k ( t ) ek ( t ) as Γ ( χe ( t ), θe ( t )), wherein
Similarly, we can also give the distribution of eH
2
=1 ηm,k ( t )
M 2
∑m
χe ( t ) = N , (21)
=1 ηm,k ( t )
M 4
∑m
=1 ηm,k ( t )
M 4
∑m
θe ( t ) = , (22)
=1 ηm,k ( t )
M 2
∑m
2 (t) = λ
where ηm,k m,k − β m,k ( t ).
2
Based on the lemmas and analysis above, we give the following lemma about the projection
principle of non-isotropic vectors.
Lemma 3 ([28]). When we project an MN-dimensional non-isotropic estimated channel vector ĝk ∈ C MN ×1
onto a p-dimensional subspace, we can give the approximated distribution of the projection power as
Γ( pχk /( MN ), θk ).
Remark 3. The dimension p can be given by p = MN with MRT precoder and p = MN − K + 1 with ZF
precoder, respectively, and for any independent beam, we can have p = 1 [30,31].
When MRT and ZF precoders are employed, based on the analysis above, we can give the
distribution of the signal power at user k and the distribution of the interference power at user k
2
H
ĝ (τ ) ĝk (τ ) ∼ Γ (χk (τ ), θk (τ )) , (23)
k ĝk (τ )
2
H
ĝ (τ ) ak (τ ) ∼ Γ MN −K +1 χk (τ ), θk (τ ) , (24)
k ak (τ ) MN
2
H 1
ek (τ )wi ∼ Γ χe ( τ ), θe ( τ ) . (25)
MN
Notably, wi in (25) can be either MRT precoder or ZF precoder and the equation still holds when
i = k, due to the independence of ek (t) and wi .
2
Based on the analysis above, we can give the approximated distribution of ĝH (τ ) Θ̃k (t) wk as
k
2
H
ĝ (τ ) Θ̃k (t) ĝk (τ ) ∼ Γ (χk (τ ), θk (τ )) , (26)
k ĝk (τ )
2
H
ĝ (τ ) Θ̃k (t) ak (t) ∼ Γ MN −K +1 χk (t), θk (t) . (27)
k ak (t) MN
Figure 2 verifies the accuracy of the approximation in (26) and (27). It illustrates the cumulative
2
distribution function (CDF) curves of ĝH
k ( τ ) Θ̃k ( t ) wk with MRT precoder. The phase noise variance
is set as σφ2 = σϕ2 = 10−2 . It can be seen that although the random variable Θ̃k (t) will destroy
the correlation of ĝk and wk , the approximation is exactly accurate when the variance of the phase
noise is 10−2 , and it will be more accurate when the variance of the phase noise is lower 10−2 .
11
Electronics 2018, 7, 317
For ZF precoder, we can get the same conclusion. Meanwhile, the phase noise variance is generally
σφ2 = σϕ2 = 1.58 × 10−4 [8,20,26]. Hence, it’s reasonable to use (26) and (27) to analyze the downlink
spectral efficiency.
Empirical CDF
1
0.9
0.8
Cumulative distribution function
0.7
0.6
0.5
0.4
0.3
0.2 simulation,M=5,N=1
Gamma approxiamtion,M=5,N=1
simulation,M=1,N=10
0.1
Gamma approxiamtion,M=1,N=10
0
0 1 2 3 4 5 6 7 8
10-10
Figure 2. Cumulative distribution function of signal power with MRT precoder under different M
and N.
Based on the lemmas above, the spectral efficiency for downlink transmission with both MRT and
ZF precoders under hardware impairments is analyzed. The theorems and corollary stated below give
the closed-form expressions of the downlink achievable rates and system asymptotic performance.
Theorem 1. When MRT precoder is used, the closed-form expression of the downlink achievable rate under
hardware impairments is given by
Dmrt (t)
Rmrt (t) = , (28)
Amrt (t) + Bmrt (t) + ξ UE /ρdl
where
⎛ ⎞2
Γ χk (t) + 12
Dmrt (t) = ⎝ ⎠ θ ( t ),
k
Γ χk (t)
2
− σϕ,k2 + σ2
φ,m | t − τ | 2
M
∑m =1 e β m,k ( t )
χk (t) = N ,
− 2 σ 2 + σ2 | t − τ |
M
∑m =1 e
ϕ,k φ,m
β4m,k (t)
−2 σϕ,k 2 + σ2
φ,m | t − τ | 4
M
∑m =1 e β m,k (t)
θk (t) = ,
− σ 2 + σ2
φ,m | t − τ | 2
M
∑m =1 e
ϕ,k
β m,k (t)
M 1 M
Amrt (t) = N ∑m=1 β2m,k (τ ) +
M ∑m=1 m,k
η 2 (τ ) − Dmrt (t),
=1 β m,i ( τ ) λm,k
M 2
K N ∑m
Bmrt (t) = ∑i =k θi (τ )(χi (τ ) − 1)
.
12
Electronics 2018, 7, 317
Theorem 2. When ZF precoder is used, the closed-form expression of the downlink achievable rate under
hardware impairments is given by
Dzf (t)
Rzf (t) = , (29)
Azf (t) + Bzf (t) + ξ UE /ρdl
where
⎛ ⎞2
Γ κ (t) + 12
D (t) = ⎝
zf ⎠ θ ( t ),
Γ (κ (t)) k
MN − K +1 M
Azf (t) =
M ∑m=1 β2m,k (τ ) − Dzf (t),
K M
M ∑m=1 m,k
Bzf (t) = η 2 ( τ ),
MN − K + 1
κ (t) = χ k ( t ).
MN
Then, in order to study the effects of phase noise further, we investigate a case where the
number of antennas employed at each RAU goes infinity and the number of RAUs and users is
fixed. The asymptotic performance provided in Corollary 1 is obtained based on (28) and (29).
Corollary 1. Let N → ∞, the ultimate rate of user k with both MRT and ZF precoders is given by
− σϕ,k
2 + σ2
φ,m | t − τ | 2
M
∑m =1 e β m,k (τ )
R∞
k (t) =
, (30)
− σ 2 + σ2
φ,m | t − τ | 2
=1 β m,k ( τ ) − ∑m=1 e β m,k (τ )
M 2 M
∑m ϕ,k
Proof. Since the proof is similar for both precoders, we only provide the proof for MRT precoder.
It can be seen that χk (t) → ∞ when the number of antennas N → ∞. Therefore we can have
2
Γ(χk (t)+ 12 )
lim N →∞ − χk (t) = 0 [29]. Then the limiting rate of user k can be obtained directly
Γ(χk (t))
by dividing the denominator and numerator of (28) by N. From Corollary 1 we can see that the
ultimate rate without phase noise will be unlimited when N tends to infinity, which means that phase
noise limits the downlink spectral efficiency.
4. Numerical Results
In this section, a series of Monte Carlo simulations is used to verify the theoretical results obtained
in Section 3. A circular single-cell massive MIMO system is considered. All of the RAUs and users are
randomly distributed in the cell and the minimum access distance between RAUs and users is set as
r0 = 30 m. The channels are generated by (4), and other simulation parameters are presented in Table 1.
Number of RAUs M 5
Number of users K 2
Cell radius R 1 km
Path loss exponent α 3.7
Power of uplink pilot symbol ρp 1 Watt
downlink transmit power ρdl 10 Watts
variance of thermal noise is assumed as σ2 −174 dBm
13
Electronics 2018, 7, 317
Figure 3 illustrates the theoretical and simulated spectral efficiency with MRT and ZF precoders
versus the number of antennas per RAU. The spectral efficiency is the average rate between users.
We assume that the variances of phase noise and amplified thermal noise are σφ,m 2 = σϕ,k
2 =
1.58 × 10−4 ∀m, k and ξ UE = ξ BS = 1.58σ2 . t is set as τ + 1. It can be seen that the closed-form
expressions in (28), (29) and the simulation results in (16) match well with each other. For both
precoders, the spectral efficiency increases and gets more and more close to the limiting average
rate with the increasing of N. When N = 100, the system achieves 80% of the ultimate rate with ZF
precoder and 76% with MRT precoder. Furthermore, it can be seen that ZF precoder achieves better
performance than MRT.
12
11
10
Spectral efficiency (bit/s/Hz)
Simulation-MRT
6 Theoretical-MRT
Simulation-ZF
Theoretical-ZF
5 Ultimate rate-MRT&ZF
4
10 20 30 40 50 60 70 80 90 100
M
Next, we investigate the effects of phase noise. Figure 4 illustrates the theoretical spectral efficiency
with MRT and ZF precoders against the variance of phase noise. Notably, the variance of phase noise
reflects the strength of phase noise. It is assumed that the number of antennas N = 50 and other system
parameters have the same value as Figure 3. Figure 4 reveals that the spectral efficiency decreases
monotonically with the variance of phase noise increasing. In addition, phase noise have a greater
impact on ZF precoder. This results form the fact that ZF precoder is more sensitive to CSI. It can be
noted that as the variance increases, the performance gap between MRT and ZF precoders becomes
smaller. This is because when the phase noise is severe, the loss caused by unknown CSI at user side
dominates rather than the interference between users.
7.5
MRT ZF
7
Spectral efficiency (bit/s/Hz)
6.5
5.5
5
0 0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008 0.009 0.01
Figure 4. Spectral efficiency against σφ2 = σϕ2 using MRT and ZF precoders with σφ,m
2 = σϕ,k
2 = σ2 =
φ
σϕ2 , ∀m, k.
14
Electronics 2018, 7, 317
Figure 5 illustrates the instantaneous spectral efficiency during the downlink transmission phase.
The number of antennas employed at each RAU is assumed as N = 40 and the variance of phase noise
is set as σφ,m
2 2 = 1.58 × 10−4 ∀ m, k. In addition, the coherence time of channel is set as T = 200.
= σϕ,k
As shown in Figure 5, the spectral efficiency degrades as t increases. This is because the uncertainty
of the phase drift between downlink transmission phase and pilot training phase increases with the
growing of t. Figure 5 reveals that it’s improper to use the estimated CSI obtained during the pilot
phase for the whole data transmission phase.
7.5
7 ZF MRT
6.5
Spectral Efficiency (bit/s/Hz)
5.5
4.5
3.5
0 20 40 60 80 100 120 140 160 180 200
5. Conclusions
In this paper, we analyzed downlink spectral efficiency with hardware impairments in distributed
massive MIMO. Initially, employing pilot symbol assisted transmission, we obtained the estimated
CSI in a more realistic scenario where transmission is distorted by phase noise and amplified thermal
noise. Next, we used the imperfect CSI to derive the closed-form expressions for downlink achievable
rates with MRT and ZF precoders. In addition, we obtained the ultimate rate when N → ∞. It can be
seen that the rate performance was limited by phase noise. Then, numerical results proved that the
theoretical analysis was accurate. Furthermore, they also revealed that ZF can achieve larger spectral
efficiency than MRT precoder, and hardware impairments had a greater impact on ZF precoder.
Finally, spectral efficiency degraded with the increasing of the variance of phase noise and downlink
transmission time.
In the future work, we intend to extend our research considering a more effective phase noise
model which could lead finer precoding strategies to improve the theoretical rates.
Author Contributions: Formal analysis, Q.L.; Supervision, J.L., P.Z., D.W. and X.Y.; Validation, Q.L.;
Writing—original draft, Q.L.; Writing—review & editing, Q.L. and J.L.
Funding: This work was supported in part by National Natural Science Foundation of China (NSFC)
(Grant NO. 61501113, 61571120, 61871122), Jiangsu Provincial Natural Science Foundation (Grant
No.BK20150630, BK20180011), six talent peaks project in Jiangsu province, and National Key Special Program
No.2018ZX03001008-002.
Conflicts of Interest: The authors declare no conflict of interest.
15
Electronics 2018, 7, 317
2
For the term E gH
k ( τ ) Θ̃k ( t ) wk
, we can obtain
H
E g (τ ) Θ̃k (t) wk 2
k
( a) 2
= E gH k ( τ ) E Θ̃k ( t ) wk
(b) H 2 (A1)
= E |ĝk (τ ) Dk,τ (t)wk |
2
(c) Γ(χk (τ )+ 12 )
= θ k ( τ ),
Γ(χk (τ ))
where ( a) is obtained because Θ̃k (t) is independent of gk (τ ) and wk . By exploiting the fact
σ2 +σφ,m
2
16
Electronics 2018, 7, 317
where ( a) results from Remark 2 and we omit (t) in (A5). Based on (A5), we have
2
E gH k ( τ ) Θ̃k ( t ) wi
2
( a) 1
E gH ( τ ) Θ̃ k ( t ) ĝ i ( τ ) E
k
ĝi (τ )2
(A6)
E ∑m M
=1 β m,i ( τ ) λm,k ĥm,k ĥm,k
2 H
=
θi (τ )(χi (τ ) − 1)
=1 β m,i ( τ ) λm,k
M 2
N ∑m
= ,
θi (τ )(χi (τ ) − 1)
where ( a) is obtained because Θ̃k (t) is independent of g(τ ) and wk . (b) is obtained due to the
independence of ek (τ ) and wk . (c) results from Lemma 2, Lemma 3 and the relationship between
Gamma distribution and Nakagami distribution.
Similar to the proof of Theorem 1, to get the closed-form expression of A(t), we need to calculate
the following two terms
H ak (τ ) 2
E ĝk (τ ) Θ̃k (t) , (A8)
ak (τ )
ak (τ ) 2
E eH
k ( τ ) Θ̃k ( t ) . (A9)
a (τ )
k
17
Electronics 2018, 7, 317
1 M
=
M ∑m=1 ηm,k
2
( τ ),
References
1. Wang, J.; Zhu, H.; Gomes, N.J.; Wang, J. Frequency Reuse of Beam Allocation for Multiuser Massive MIMO
Systems. IEEE Trans. Wirel. Commun. 2018, 17, 2346–2359. [CrossRef]
2. Wang, C.X.; Haider, F.; Gao, X.; You, X.H.; Yang, Y.; Yuan, D.; Aggoune, H.; Haas, H.; Fletcher, S.; Hepsaydir, E.
Cellular architecture and key technologies for 5G wireless communication networks. IEEE Commun. Mag.
2014, 52, 122–130. [CrossRef]
3. Jiang, F.; Chen, J.; Swindlehurst, A.L.; López-Salcedo, J.A. Massive MIMO for Wireless Sensing with a Coherent
Multiple Access Channel. IEEE Trans. Signal Process. 2015, 63, 3005–3017. [CrossRef]
4. Ciuonzo, D.; Rossi, P.S.; Dey, S. Massive MIMO Channel-Aware Decision Fusion. IEEE Trans. Signal Process.
2015, 63, 604–619. [CrossRef]
5. Ciuonzo, D.; Rossi, P.S.; Dey, S. Massive MIMO meets decision fusion: Decode-and-fuse vs. decode-then-fuse.
In Proceedings of the 2014 IEEE 8th Sensor Array and Multichannel Signal Processing Workshop (SAM),
A Coruna, Spain, 22–25 June 2014; pp. 265–268. [CrossRef]
6. Shirazinia, A.; Dey, S.; Ciuonzo, D.; Rossi, P.S. Massive MIMO for Decentralized Estimation of a Correlated
Source. IEEE Trans. Signal Process. 2016, 64, 2499–2512. [CrossRef]
7. Cao, J.; Wang, D.; Li, J.; Sun, Q.; Hu, Y. Uplink spectral efficiency analysis of multi-cell multi-user massive
MIMO over correlated Ricean channel. Sci. China Inf. Sci. 2018, 61, 082305. [CrossRef]
8. Björnson, E.; Matthaiou, M.; Debbah, M. Massive MIMO with non-ideal arbitrary arrays: Hardware scaling
laws and circuit-aware design. IEEE Trans. Wirel. Commun. 2015, 14, 4353–4368. [CrossRef]
9. Dai, L. An Uplink Capacity Analysis of the Distributed Antenna System (DAS): From Cellular DAS to DAS
with Virtual Cells. IEEE Trans. Wirel. Commun. 2014, 13, 2717–2731.
10. Liu, Z.; Dai, L. On the Scaling Behavior of Average Ergodic Capacity of Distributed MIMO Systems.
In Proceedings of the 2015 IEEE Global Communications Conference (GLOBECOM), San Diego, CA, USA,
6–10 December 2015; pp. 1–6. [CrossRef]
11. Wang, J.; Dai, L. Asymptotic Rate Analysis of Downlink Multi-User Systems with Co-Located and Distributed
Antennas. IEEE Trans. Wirel. Commun. 2015, 14, 3046–3058. [CrossRef]
12. Wang, J.; Dai, L. Downlink Rate Analysis for Virtual-Cell Based Large-Scale Distributed Antenna Systems.
IEEE Trans. Wirel. Commun. 2016, 15, 1998–2011. [CrossRef]
13. Khadka, A.; Adachi, K.; Sun, S.; Wang, J.; Zhu, H.; Wang, J. Cooperative Transmission Strategy over Users’
Mobility for Downlink Distributed Antenna Systems. In Proceedings of the GLOBECOM 2017–2017 IEEE
Global Communications Conference, Singapore, 4–8 December 2017; pp. 1–6. [CrossRef]
14. Zhu, H.; Wang, J. Radio Resource Allocation in Multiuser Distributed Antenna Systems. IEEE J. Sel.
Areas Commun. 2013, 31, 2058–2066. [CrossRef]
18
Electronics 2018, 7, 317
15. Lv, Q.; Li, J.; Zhu, P.; You, X. Spectral Efficiency Analysis for Bidirectional Dynamic Network with Massive
MIMO Under Imperfect CSI. IEEE Access 2018, 6, 43660–43671. [CrossRef]
16. Studer, C.; Wenk, M.; Burg, A. MIMO transmission with residual transmit-RF impairments. In Proceedings
of the 2010 International ITG Workshop on Smart Antennas (WSA), Bremen, Germany, 23–24 February 2010;
pp. 189–196. [CrossRef]
17. Papazafeiropoulos, A.; Clerckx, B.; Ratnarajah, T. Mitigation of phase noise in massive MIMO systems:
A rate-splitting approach. In Proceedings of the 2017 IEEE International Conference on Communications
(ICC), Paris, France, 21–25 May 2017; pp. 1–7. [CrossRef]
18. Pitarokoilis, A.; Mohammed, S.K.; Larsson, E.G. Uplink Performance of Time-Reversal MRC in Massive
MIMO Systems Subject to Phase Noise. IEEE Trans. Wirel. Commun. 2015, 14, 711–723. [CrossRef]
19. Pitarokoilis, A.; Mohammed, S.K.; Larsson, E.G. Achievable rates of ZF receivers in massive MIMO
with phase noise impairments. In Proceedings of the 2013 Asilomar Conference on Signals, Systems
and Computers, Pacific Grove, CA, USA, 3–6 November 2013; pp. 1004–1008. [CrossRef]
20. Li, J.; Wang, D.; Zhu, P.; You, X. Uplink Spectral Efficiency Analysis of Distributed Massive MIMO with
Channel Impairments. IEEE Access 2017, 5, 5020–5030. [CrossRef]
21. Fang, Y.; Li, X.; Qiu, L. Asymptotic Equivalent Performance of Uplink Massive MIMO Systems with Phase
Noise. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas City,
MO, USA, 20–24 May 2018; pp. 1–6. [CrossRef]
22. Zhu, J.; Schober, R.; Bhargava, V.K. Physical layer security for massive MIMO systems impaired by phase
noise. In Proceedings of the 2016 IEEE 17th International Workshop on Signal Processing Advances in
Wireless Communications (SPAWC), Edinburgh, UK, 3–6 July 2016; pp. 1–5. [CrossRef]
23. Corvaja, R.; Armada, A.G. Phase Noise Degradation in Massive MIMO Downlink with Zero-Forcing and
Maximum Ratio Transmission Precoding. IEEE Trans. Veh. Technol. 2016, 65, 8052–8059. [CrossRef]
24. Krishnan, R.; Khanzadi, M.R.; Krishnan, N.; Wu, Y.; i Amat, A.G.; Eriksson, T.; Schober, R. Linear Massive
MIMO Precoders in the Presence of Phase Noise - A Large-Scale Analysis. IEEE Trans. Veh. Technol. 2016,
65, 3057–3071. [CrossRef]
25. Petrovic, D.; Rave, W.; Fettweis, G. Effects of Phase Noise on OFDM Systems with and without PLL:
Characterization and Compensation. IEEE Trans. Commun. 2007, 55, 1607–1616. [CrossRef]
26. Papazafeiropoulos, A.; Clerckx, B.; Ratnarajah, T. Rate-splitting to mitigate residual transceiver hardware
impairments in massive MIMO systems. IEEE Trans. Veh. Technol. 2017, 66, 8196–8211. [CrossRef]
27. Li, J.; Wang, D.; Zhu, P.; You, X. Spectral efficiency analysis of single-cell multi-user large-scale distributed
antenna system. IET Commun. 2014, 8, 2213–2221. [CrossRef]
28. Li, J.; Wang, D.; Zhu, P.; Wang, J.; You, X. Downlink Spectral Efficiency of Distributed Massive MIMO
Systems with Linear Beamforming Under Pilot Contamination. IEEE Trans. Veh. Technol. 2018, 67, 1130–1145.
[CrossRef]
29. Jose, J.; Ashikhmin, A.; Marzetta, T.L.; Vishwanath, S. Pilot contamination and precoding in multi-cell TDD
systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
30. Hosseini, K.; Yu, W.; Adve, R.S. Large-scale MIMO versus network MIMO for multicell interference
mitigation. In Proceedings of the 2014 IEEE 15th International Workshop on Signal Processing Advances in
Wireless Communications (SPAWC), Toronto, ON, Canada, 22–25 June 2014; pp. 70–74. [CrossRef]
31. Li, J.; Wang, D.; Zhu, P.; You, X. Benefits of Beamforming Training Scheme in Distributed Large-Scale MIMO
Systems. IEEE Access 2018, 6, 7432–7444. [CrossRef]
32. Hoydis, J.; ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of Cellular Networks: How Many
Antennas Do We Need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
c 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
19
electronics
Article
Spectral and Energy Efficiency of Distributed Massive
MIMO with Low-Resolution ADC
Jiamin Li, Qian Lv *, Jing Yang, Pengcheng Zhu and Xiaohu You
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China;
[email protected] (J.L.); [email protected] (J.Y.); [email protected] (P.Z.); [email protected] (X.Y.)
* Correspondence: [email protected]; Tel.: +25-5209-1635
Abstract: In this paper, considering a more realistic case where the low-resolution analog-to-digital
convertors (ADCs) are employed at receiver antennas, we investigate the spectral and energy
efficiency in multi-cell multi-user distributed massive multi-input multi-output (MIMO) systems with
two linear receivers. An additive quantization noise model is provided first to study the effects of
quantization noise. Using the model provided, the closed-form expressions for the uplink achievable
rates with a zero-forcing (ZF) receiver and a maximum ratio combination (MRC) receiver under
quantization noise and pilot contamination are derived. Furthermore, the asymptotic achievable rates
are also given when the number of quantization bits, the per user transmit power, and the number of
antennas per remote antenna unit (RAU) go to infinity, respectively. Numerical results prove that
the theoretical analysis is accurate and show that quantization noise degrades the performance in
spectral efficiency, but the growth in the number of antennas can compensate for the degradation.
Furthermore, low-resolution ADCs with 3 or 4 bits outperform perfect ADCs in energy efficiency.
Numerical results imply that it is preferable to use low-resolution ADCs in distributed massive
MIMO systems.
Keywords: distributed massive MIMO; energy efficiency; spectral efficiency; pilot contamination;
quantization noise
1. Introduction
Massive multi-input multi-output (MIMO) systems are an essential technology for the fifth
generation (5G) mobile networks because they can significantly improve spectral efficiency and energy
efficiency [1–6]. In massive MIMO systems, a relatively small number of users are served by hundreds
or thousands of antennas employed at base stations in the same time-frequency resource. The huge
number of antennas provides a high number of degrees-of-freedom, which favors low-complexity
receivers, such as maximum ratio combination (MRC) and zero-forcing (ZF), and beamforming, such
as ZF and maximum ratio transmission (MRT) [2,7,8]. Therefore, we consider MRC and ZF receivers
for uplink transmission in this paper. There are two categories for massive MIMO: co-located massive
MIMO and distributed massive MIMO [9]. Compared to co-located massive MIMO, distributed
massive MIMO has advantages of increasing spectral efficiency, energy efficiency, and system coverage
due to the reduced access distance [8,10–12]. Hence, a distributed massive MIMO system is considered
in this paper.
Although massive MIMO systems have significant performance gains, they also face new
challenges: high total power consumption, expensive hardware, and mass data processing [13].
Specifically, each antenna is equipped with a radio frequency (RF) chain, including an analog-to-digital
converter (ADC) unit in massive MIMO systems. However, with the increase in the antenna number,
the hardware complexity and the power consumption of ADCs increase exponentially with the number
of quantization bits [14]. Therefore, one promising solution is to employ low-resolution ADCs in
massive MIMO systems. The study of low-resolution ADC in MIMO or massive MIMO systems has
caused widespread concern.
Spectral and energy efficiency are two fundamental metrics to analyze the impacts of
low-resolution ADCs. Spectral efficiency was investigated in [15–17]. The performance of 1-bit
resolution ADC in MIMO systems was studied in [15] considering the nonlinear characteristics of a
quantizer. In massive MIMO systems with low-resolution ADCs, the uplink achievable rate using
the common MRC receivers has been investigated in [16], and the uplink achievable rate using the
common ZF receivers was studied in [17]. However, these two papers made an assumption that
the base station had perfect channel state information (CSI), and they only considered a single-cell
massive MIMO system. In fact, the CSI is not available at the base station. On the other hand, energy
efficiency was studied in [13,18–20]. The optimal number of quantized bits and antenna selection were
considered to maximize the energy efficiency of general MIMO with low-resolution ADCs in [13].
It was pointed out in [18] that very low bit resolution is not preferable from the perspective of energy
efficiency. A function about energy efficiency and the number of quantized bits was obtained in [19].
The previous papers mainly studied a single-cell system, made an assumption that the base
station had perfect CSI, and did not analyze spectral efficiency and energy efficiency simultaneously.
Hence, in this paper, a multi-cell multi-user massive MIMO system with low-resolution ADCs is
considered, and we assume that the base stations estimate CSI during the uplink pilot transmission
phase. Furthermore, the uplink spectral and energy efficiency are both analyzed. Here are the key
contributions of this paper:
1. A joint uplink signal model is provided and it enables us to study the effects of pilot contamination
and quantization noise simultaneously.
2. Under imperfect CSI and considering MRC and ZF receivers, we derive the closed-form
expressions for the uplink achievable rates. The asymptotic performance with quantization bits,
the number of antennas per RAU, and per user transmit power are also obtained.
3. The theoretical results are verified by performing Monte Carlo simulations, and we obtain
deep insight into the impacts of quantization noise on the uplink spectral efficiency and energy
efficiency in distributed massive MIMO systems.
2. System Model
We consider a distributed massive MIMO system. There are L adjacent cells, and each cell consists
of M remote antennas units (RAUs) and K single-antenna users. Each RAU is equipped with an
array of N antennas. Each antenna is equipped with a low-resolution ADC, which means system
performance will be degraded by quantization noise. RAUs in the same cell transmit or receive signals
simultaneously while the beamforming design and signal processing are performed in a baseband
processing unit. An example is given in Figure 1. There are L = 7 adjacent cells, and in cell-1, there are
K = 6 users and M = 6 RAUs.
22
Electronics 2018, 7, 391
&HOO
&HOO &HOO
&HOO
&HOO &HOO
&HOO
Baseband LLow-resolution
processing unit User ADCs RAU
√ L
yl = pu ∑i=1 Gl,i xi + nl , (1)
where xi is the K × 1 signal vector transmitted by the K users in cell i, pu is the uplink transmitted
power, and nl ∼ CN (0, I MN ) is the additive white Gaussian noise, Gl,i = [gl,i,1 , ..., gl,i,K ] is the MN × K
channel matrix from M RAUs in cell l to K users in cell i, wherein
T
gl,i,k = λl,1,i,k hTl,1,i,k , · · · , λl,M,i,k hTl,M,i,k (2)
where λl,m,i,k is the path loss between the k-th user in the i-th cell and the m-th RAU in the l-th
cell, which is dependent on the corresponding distance, and hl,m,i,k ∼ CN (0, I N ) denotes the small
scale fading.
This paper assumes that the CSI is unknown to the base station, and pilot training is performed.
Motivated by [21], based on the minimum mean square error (MMSE) channel estimation, the
equivalent estimated channel can be given by
T
ĝi,l,k = β i,1,l,k ĥTi,k,1 , · · · , β i,M,l,k ĥTi,k,M (3)
where
λ2i,m,l,k
β i,m,l,k = . (4)
∑ Lj=1 λi,m,j,k + 1/(τ pu )
τ denotes the length of pilot sequences, β i,m,l,k denotes the equivalent path loss between the k-th user in
Δ
the l-th cell and the m-th RAU in the i-th cell, and ĥi,k = [ĥTi,k,1 , · · · , ĥTi,k,M ]T ∼ CN (0, I MN ) represents
the equivalent small scale fading part of the estimated channel. Because of the orthogonality principle
of MMSE estimation theory, gi,l,k can be decomposed as
23
Electronics 2018, 7, 391
where Q(.) represents the quantization function. Assuming that the gain of automatic gain control is
appropriately set, the additive quantization noise model (AQNM) can be employed to reformulate the
quantized signal vector as
√ L
yl,q = αyl + nl,q = α pu ∑i=1 Gl,i xi + αnl + nl,q (7)
where α = 1 − ρ, ρ is the inverse of the signal-to-quantization-noise ratio, and nl,q denotes the
additive uncorrelated quantization noise vector, which is Gaussian-distributed. The parameter ρ is a
constant dependent on the number of quantization bits b. According to [16], the covariance matrix of
quantization noise nl,q for a fixed channel realization can be denoted as
L
Rnl,q = α(1 − α)diag pu ∑i=1 Gl,i GH
l,i + I . (8)
rl,k = aH
l,k yl,q
√ L K √ L K
= α pu ∑ i =1 ∑ j =1 aH
l,k ĝl,i,j xi,j + α pu ∑i =1 ∑ j=1 al,k g̃l,i,j xi,j + αal,k nl + αal,k nl,q
H H H
(9)
where al,k is the linear receiver vector in cell l for user k, and xi,j ∼ CN (0, 1) is the j-th column of
xi . In this paper, we focus on two linear receivers, namely MRC and ZF. Mathematically, al,k can be
given by
ĝl,l,k , for MRC
al,k = (10)
fl,l,k , for ZF
−1
where fl,l,k is the k-th column of Ĝl,l ĜH
l,l Ĝl,l , and Ĝl,l = [ĝl,l,1 , · · · , ĝl,l,K ].
Motivated by [22,23], treating the interference as worst-case unrelated additive noise, the lower
bound of the achievable uplink rate of the k-th user in the l-th cell can be given by
pu α2 | aH
l,k ĝl,l,k |
2
Rl,k (p) = E log2 1+ (11)
E aH
l,k pu α ∑(i,j) =(l,k) ĝl,i,j ĝl,i,j + pu α ∑(i,j) g̃l,i,j g̃l,i,j + α + Rnl,q al,k | Ĝl,l
2 H 2 H 2
( a) pu α2 | aH
l,k ĝl,l,k |
2
= E log2 1 + (12)
E Il,k + pu α2 ∑i =l |aHl,k ĝl,i,k | + α al,k
2 2 H 2
where p is the transmitted power vector of K users. Since the denominator of Equation (11) is a
conditional expectation operator and the estimated error vector and estimated channel vector are
independent, Il,k is given as
L
Il,k = pu α2 ∑i=1 ∑ j =k aH
l,k E ĝl,i,j ĝl,i,j al,k + pu α ∑(i,j) al,k E g̃l,i,j g̃l,i,j al,k + al,k E Rnl,q al,k
H 2 H H H
(13)
24
Electronics 2018, 7, 391
wherein
E ĝl,i,j ĝH
l,i,j = diag β l,1,i,j I N , · · · , β l,M,i,j I N
E g̃l,i,j g̃H
l,i,j = diag ηl,1,i,j I N , · · · , ηl,M,i,j I N
E Rnl,q = diag pu Ĝl,l ĜH l,l + DR (14)
K K K K
DR = diag pu ∑ ηl,1,l,j + pu ∑ ∑ λl,1,i,j + 1 IN , · · · , pu ∑ ηl,M,l,j + pu ∑ ∑ λl,M,i,j + 1 IN .
j =1 i = l j =1 j =1 i = l j =1
(2) Power consumption model: According to [24–26], for cell l, the total power consumption
model can be given by
The first term PTC is the power consumption of transceiver chains, which can be given by
where PBS and PUE are the power consumption of running the circuit components employed at the
base station and users, PSYN are the power consumed by the local oscillator, and PADC = a0 N2b + a1
are the power consumed by ADC, wherein a0 and a1 are constant parameters, ρ = 1 for the distributed
antenna system (DAS), and ρ = 0 for the co-located antenna system (CAS). This results from the
assumption that antennas at the same RAU are connected to a common oscillator, while oscillators
at different RAUs are different in the DAS, and all antennas are connected to a single oscillator in
the CAS.
The second term PLP is the power consumption of the MRC/ZF receiver at the base station, which
can be given by
B 3MNK 3
MNK (3K +1)
PLP = B T −
T
τ 2MNK
L + LBS (1 − d ) + d 3LBS +
K
LBS (17)
BS T
where d = 0 for MRC while d = 1 for ZF, B is the bandwidth, T denotes the symbols for uplink
transmission, and LBS is the computational efficiency of arithmetic complex-valued operations for
a Joule.
The third term PT is transmit power, which can be represented as
T −τ K
PT = T ξ pu (18)
where P0 and PBT are the fixed and traffic-dependent power consumption at each backhaul, respectively.
(3) Global energy efficiency model: Based on the above analysis, the total power consumption for
all cells can be given by
T −τ L K
PTotal (p) = LPIND + ξT LKpu + PBT MB ∑l =1 ∑k=1 Rl,k (p) (20)
25
Electronics 2018, 7, 391
According to [24], the global energy efficiency is defined as the ratio of the achievable sum rate to
the total power consumption in Watts. Mathematically, it can be defined as
Theorem 1. Using MRC receiver with low-resolution ADCs and pilot contamination, the closed-form
expression for the uplink achievable rate of the k-th user in the l-th cell is given by
2
pu α ( N ∑mM=1 β l,m,l,k ) + N ∑m
M 2
=1 β l,m,l,k
l,k = log2
Rmrc 1+
pu αΩl,k + pu αΞl,k +(1−α)Φl,k +α ∑m
M (23)
=1 β l,m,l,k
2
where
M L K
Ωl,k = N ∑m=1 ∑i=1 ∑ j=1 β l,m,l,k ηl,m,i,j
M L M 2
Ξl,k = N ∑ β l,m,l,k ∑ ∑ β l,m,i,j + ∑ β l,m,i,k +∑ N ∑ β1/2 1/2
l,m,l,k β l,m,i,k
m =1 i =1 j = k i =l i =l m =1
⎛ ⎞
M 2 M M
Φl,k = pu ⎝ N ∑ β l,m,l,k +N ∑ β2l,m,l,k + N ∑∑ β l,m,l,k β l,m,l,j ⎠ + Υl,k
m =1 m =1 j = k m =1
M K K
Υl,k = N ∑m=1 β l,m,l,k 1 + pu ∑ j=1 ηl,m,l,j + pu ∑i =l ∑ j=1 λl,m,i,j . (24)
Theorem 2. Using ZF receiver with low-resolution ADCs and pilot contamination, the closed-form expression
for the uplink achievable rate of user k in the l-th cell is given by
⎛ ⎞
⎜ M
pu αζ ∑m ⎟
=1 β l,m,l,k
Rl,k = log2 ⎜ ⎟
zf
⎝1 + M L K
M
⎠ (25)
αpu ∑ ∑ ∑ β l,m,i,j + ∑ ∑ ηl,m,i,j +ζ ∑ β l,m,i,k +(1−α) ζ pu ∑ β l,m,l,k +Δl,k +αM
m =1 i =l j =k i =1 j =1 i =l m =1
where ζ = MN − K + 1, and
M K K
Δl,k = ∑ m =1 1 + pu ∑ j=1 ηl,m,l,j + pu ∑i =l ∑ j=1 λl,m,i,j . (26)
From Equations (23) and (25), it can be concluded that the quantization noise influences both the
numerator and the denominator of Equation (A4). This means that the quantization noise is unlike the
additive noise, which only affects the denominator.
26
Electronics 2018, 7, 391
Based on the theorems above, we analyze the asymptotic performance with quantization
bits, per user transmit power, and the number of antennas per RAU, respectively. The results are
given below.
Case 1: With a fixed transmitted power per user pu and a total number of antennas per cell MN,
when the number of quantization bits b → ∞, the inverse of the signal-to-quantization-noise ratio
ρ tends toward zero, which means that α in Equations (23) and (25) tends toward 1. The following
results can then be obtained in this case by replacing the α in Equations (23) and (25) with 1
2
pu ( N ∑mM=1 β l,m,l,k ) + N ∑m
M 2
=1 β l,m,l,k
R̃mrc
l,k = log2 1 + (27)
pu Ωl,k + pu Ξl,k +∑m
M
=1 β l,m,l,k
2
⎛ ⎞
⎜ M
pu ζ ∑ m ⎟
⎜ =1 β l,m,l,k ⎟.
l,k = log2 ⎝1 +
R̃zf M L K ⎠ (28)
pu ∑ ∑ ∑ β l,m,i,j + ∑ ∑ ηl,m,i,j +ζ ∑ β l,m,i,k + M
m =1 i =l j =k i =1 j =1 i =l
Case 1 shows the achievable uplink rates without considering the quantization noise caused by
ADC. It can be seen that the spectral efficiency is only limited by pilot contamination and channel
estimation error. Moreover, since the power consumption of an ADC PADC = a0 N2b + a1 is an
exponential function of b, PADC tends toward infinity when b → ∞. As shown in Equation (20), the
total power consumption also goes to infinity. Hence, the limited achievable rates and unlimited power
consumption lead to the fact that the global energy efficiency tends toward zero, that is φ(p) → 0
when b → ∞, while pu and MN are fixed.
Case 2: With a fixed number of quantization bits b and antennas per cell MN, when pu → ∞,
the ultimate rates of user k in cell l with both receivers can be directly obtained by dividing the
dominators and numerators of Equations (23) and (25) by pu , which are given by
2
α ( N ∑mM=1 β l,m,l,k ) + N ∑m
M 2
=1 β l,m,l,k
l,k = log2
R̃mrc 1+ αΩl,k +αΞl,k +(1−α)Φl,k
(29)
⎛ ⎞
⎜ M
αζ ∑m ⎟
⎜ =1 β l,m,l,k
l,k = log2 ⎝1 +
R̃zf ⎟
⎠ (30)
M L K M
α ∑ ∑ ∑ β l,m,i,j + ∑ ∑ ηl,m,i,j +ζ ∑ β l,m,i,k +(1−α) ζ ∑ β l,m,l,k +Δl,k
m =1 i =l j =k i =1 j =1 i =l m =1
where
⎛ ⎞
M 2 M M
Φl,k = ⎝ N ∑ β l,m,l,k +N ∑ β2l,m,l,k + N ∑∑ β l,m,l,k β l,m,l,j ⎠ + Υl,k
m =1 m =1 j = k m =1
M K K
Υl,k = N ∑m=1 β l,m,l,k ∑ j=1 ηl,m,l,j + ∑i =l ∑ j=1 λl,m,i,j
M K K
Δl,k = ∑m=1 ∑ j=1 ηl,m,l,j + ∑i =l ∑ j=1 λl,m,i,j .
Case 2 indicates that, as pu grows indefinitely, the achievable uplink rates approach certain values
dependent on the resolution of ADC. This observation shows that the performance degradation due to
low-resolution ADCs cannot be compensated by increasing the transmit power. Furthermore, it can
be seen that, as the transmit power increases, the system energy efficiency tends toward zero. This is
because the total power consumption presented in Equation (20) tends toward infinity as pu increases
indefinitely, but the unlimited growth in the transmit power cannot improve the achievable uplink
rates indefinitely.
Case 3: With a fixed number of quantization bits b, the number of RAUs per cell M and the
transmitted power per user pu , when N → ∞, the limiting rates of user k in cell l with both receivers
27
Electronics 2018, 7, 391
can be directly obtained by dividing the dominators and numerators of Equations (23) and (25) by N 2 ,
which are given by
2
α(∑m
M
=1 β l,m,l,k )
l,k = log2
Rmrc 1+ 2 2
(31)
M
α ∑i =l ∑ m =1 β1/2 1/2
l,m,l,k β l,m,i,k+(1−α)(∑m
M
=1 β l,m,l,k )
M
α ∑m =1 β l,m,l,k
l,k = log2 1 +
Rzf M
α ∑i =l ∑ m β l,m,i,k +(1−α) ∑m M . (32)
=1 =1 β l,m,l,k
Case 3 shows that, when the number of antennas per RAU grows without bound, the impacts of
quantization noise vanish. However, the achievable rates with both receivers tend toward certain and
limited values as N goes infinity. This results from the presence of pilot contamination. Furthermore,
since the power consumption of transceiver chains and linear processing at the base station are
proportional to N, they tend toward infinity when N → ∞. As shown in Equation (20), the total
power consumption also goes to infinity. Hence, the limited achievable rates and unlimited power
consumption lead to the fact that the global energy efficiency tends toward zero, that is φ(p) → 0
when N → ∞ while b, M, and pu are fixed.
4. Numerical Results
In this section, we verify the accuracy of the theoretical results in Section 3 by a series of Monte
Carlo simulations. A multi-cell distributed massive MIMO system is considered, which consists
of L = 7 cells, M = 7 RAUs per cell, K = 6 users per cell, and the cell radius D is normalized
to 1. In each cell, all users are uniformly
√ distributed, while RAUs have fixed locations with radiuses
r1 = 0, r2 = · · · = r7 = (3 − 3)/2 and angles θ1 = 0, θ2 = π/6, θ3 = π/2, θ4 = 5π/6, θ5 =
7π/6, θ6 = 3π/2, and θ7 = 11π/6. The path loss between the k-th user in the i-th cell and the m-th
RAU in the l-th cell λl,m,i,k is modeled as λl,m,i,k = d− ι
l,m,i,k , where dl,m,i,k is the corresponding distance,
and ι assumed as ι = 3.7 is the path loss exponent. Moreover, the length of pilot sequences is τ = K.
The coherence time of the channel is assumed as T = 196 symbols, and the power consumption
parameters are given in Table 1.
Parameters Values
Transmitted power per user pu 0.02 Watts
Power consumption per antenna at base station PBS 0.1 Watts
Power consumption per antenna at users PUE 0.1 Watts
Power consumption per local oscillator PSYN 1 Watts
Computational efficiency at base stations LBS 12.8 Gflops/Watt
Power amplified efficiency ξ 0.4
Fixed power consumption per backhaul P0 0.825 Watts
Traffic-dependent backhaul power PBT 0.25 Watts/(Gbit/s)
Parameters of ADC a0 &a1 10−3 & 0.02
We first prove the accuracy of the theoretical results given in Theorems 1 and 2. Figure 2 illustrates
the uplink spectral efficiency per cell versus the number of quantization bits with different numbers
of antennas per RAU. It can be seen that the closed-form expressions and simulation results match
well with each other using both MRC and ZF receivers. As the number of antennas per RAU increases,
the uplink spectral efficiency grows obviously for both receivers. Furthermore, for both receivers, the
uplink spectral efficiency increases rapidly with the increase in quantization bits b when b is small,
while the growth of b cannot improve the spectral efficiency further when b is large. It can be concluded
that low-resolution ADCs are acceptable in massive MIMO systems, and employing a large number
28
Electronics 2018, 7, 391
of antennas at each RAU can compensate for the performance degradation. In the following, the
closed-form expressions will be used for numerical work.
26
24
N=20
22
Spectral efficiency (bit/s/Hz)
20
18
N=10
16
14
N=5
12
ZF,Simulation
10 ZF,Theoretical
MRC,Simulation
MRC,Theoretical
8
1 2 3 4 5 6 7 8
Number of quantization bits
Figure 2. Spectral efficiency versus the number of quantization bits with different numbers of antennas
per RAU.
Next, Figure 3 illustrates the energy efficiency versus spectral efficiency with different numbers
of quantization bits and of antennas per RAU. It can be seen that, as the number of antennas and
the number of quantization bits increase, energy efficiency increases first and then decreases. This is
because the power consumption and spectral efficiency both increase with the increase in antennas
and quantization bits, but the improvement of spectral efficiency dominates first, and the power
consumption then dominates. The results illustrate that we cannot improve the spectral efficiency and
energy efficiency simultaneously without bound, and there needs to be a tradeoff between them, which
was investigated in [27–29]. Moreover, it can be seen in Figure 3b that b = 3 or b = 4 are preferable
under the system configuration mentioned above. If b increases further, the spectral efficiency can be
slightly improved while the energy efficiency decreases rapidly. It should be noted that the optimal
number of bits is dependent on system configuration and system parameters. Figure 3 also indicates
that low-resolution ADCs (b = 3 or 4 bits in our simulation results) are preferable in distributed massive
MIMO systems.
Finally, the energy efficiency against spectral efficiency with different numbers of quantization bits
and transmitted power per user is presented in Figure 4. The same conclusion about the relationship
between energy efficiency and spectral efficiency with different b can be obtained from Figure 4. As for
the transmitted power per user, it can be seen that, with its increase, the energy efficiency increases
first and then decreases. This results from the fact that the power consumption linearly increases with
the growth of transmitted power, but the spectral efficiency increases first and then tends toward a
certain value.
29
Electronics 2018, 7, 391
b=1 bits
1 b=2 bits
b=3 bits
b=6 bits
b=8 bits
b=9 bits
Energy efficiency 0.8 b=11 bits
0.6
0.4
0.2
0
0 5 10 15 20 25 30 35 40
Spectral Efficiency
(a)
N=4
0.9 N=6
N=8
N=12
0.8
N=16
N=22
0.7 N=30
Energy efficiency
0.6
0.5
0.4
0.3
0.2
0.1
0
5 10 15 20 25 30
Spectral efficiency
(b)
Figure 3. Energy efficiency versus spectral efficiency with different numbers of quantization bits b
and different numbers of antennas N per RAU. (a) Each line corresponds to different numbers of
quantization bits with b = [1, 2, 3, 6, 8, 9, 11] and the points on each line correspond to different numbers
of antennas per RAU with N= [1:1:6, 8, 10, 12, 15, 20, 30, 40, 50, 80, 100]. (b) Each line corresponds to
different numbers of antennas per RAU with N= [4, 6, 8, 12, 16, 22, 30] and the points on each line
correspond to different numbers of quantization bits with b = [1:1:12].
30
Electronics 2018, 7, 391
1
b=1 bits
b=2 bits
0.95 b=3 bits
b=4 bits
b=5 bits
0.9 b=6 bits
0.85
Energy efficiency
0.8
0.75
0.7
0.65
0.6
0.55
18 20 22 24 26 28 30 32
Spectral efficiency
Figure 4. Energy efficiency versus spectral efficiency with the number of quantization bits b = [1:1:6]
bits and transmitted power per user pu = [0.01, 0.02, 0.05, 0.08, 0.1, 0.15, 0.2, 0.3, 0.4, 0.6, 0.8, 1] W, where
each line corresponds to different b and the points on each line correspond to different pu .
5. Conclusions
In this paper, we analyzed the uplink spectral and energy efficiency simultaneously in distributed
massive MIMO systems with low-resolution ADCs. Furthermore, this paper considered a more
realistic scenario where the base station did not have CSI and it obtained the estimated CSI during
the pilot phase. In this case, the pilot contamination presents and degrades the system performance.
We first gave an additive quantization noise model and got the estimated CSI with pilot contamination.
Under the imperfect CSI, we derived the closed-form expressions for achievable uplink rates using
MRC and ZF receivers. Furthermore, we obtained the asymptotic performance with the number of
quantization bits, the per user transmit power, and the per RAU antenna number, respectively. The
theorems are verified by simulation. It can be noted that the increase in antennas can compensate for
the spectral efficiency degradation caused by quantization noise. Furthermore, the energy efficiency
with low-resolution ADCs are better than that with perfect ADCs. Numerical results imply that it is
preferable to use low-resolution ADCs in distributed massive MIMO systems.
We intend to extend our research considering the tradeoff between spectral efficiency and energy
efficiency, which involves multi-objective optimization. Furthermore, in order to make the system
more energy-efficient, we plan to extend our research considering RAU selection.
Author Contributions: Formal analysis, J.L.; Methodology, J.L.; Supervision, P.Z. and X.Y.; Validation, J.L. and
J.Y.; Writing original draft, J.L. and Q.L.; Writing review & editing, J.L. and Q.L.
Funding: This work was supported in part by National Natural Science Foundation of China (NSFC) (Grant No.
61501113, 61571120) and the Jiangsu Provincial Natural Science Foundation (Grant No. BK20150630, BK20180011).
Conflicts of Interest: The authors declare no conflict of interest.
31
Electronics 2018, 7, 391
Lemma A1 ([22]). Suppose { Xi } are independent Gamma distributed random variables, i.e., { Xi } ∼ Γ(k i , θi ).
Then the first two moments of the sum ∑i Xi can be given by
E ∑ i Xi = ∑i k i θi , (A1)
2 2
E ∑ i Xi = ∑i k i θi2 + ∑i k i θi . (A2)
Lemma A2 ([21]). For the p-dimensional non-isotropic channel vector x whose strength is distributed as
xH x ∼ Γ(k, θ ), when projected onto a s-dimensional subspace, the distribution of the projection power can be
approximated as Γ(sk/p, θ ).
Lemma A3 ([30]). If x is an N × 1 isotropic random vector and A is a constant matrix. Then we can have
tr(A)
Ex xH Ax = . (A3)
N
pu α2 E [ | aH
l,k ĝl,l,k | ]
2
Rl,k (p) ≈ log2 1+ . (A4)
E Il,k + pu α2 ∑i =l |aH l,k ĝl,i,k | + α al,k
2 2 H 2
Consider the MRC receiver, it can be seen from Equation (A4) that the following terms need to
be simplified:
E ĝl,l,k 4 = k̂ l,l,k θ̂l,l,k
2
+ (k̂ l,l,k θ̂l,l,k )2 (A5)
where
M
=1 β l,m,l,k
2
N (∑m
M
=1 β l,m,l,k )
2 ∑m
k̂ l,l,k = M
θ̂l,l,k = M
. (A6)
∑m =1 β2l,m,l,k ∑m =1 β l,m,l,k
where ( a) results from the fact that the channel strength is Gamma-distributed.
Because of the pilot contamination, ĝl,l,k and ĝl,i,k are dependent, we have
2
2 ( a) M M
E |ĝH
l,l,k ĝl,i,k | = N ∑m=1 β1/2 1/2
l,m,l,k β l,m,i,k + N ∑m=1 β l,m,l,k β l,m,i,k (A8)
where (a) is obtained due to the fact that the channel strength is Gamma-distributed and due to
Lemma A1.
Using the fact that ĝl,l,k and g̃l,i,j are independent, we have
M L K
∑(i,j) E l,l,k E g̃l,i,j g̃l,i,j ĝl,l,k = N ∑m=1 ∑i =1 ∑ j=1 β l,m,l,k ηl,m,i,j .
ĝH H
(A9)
32
Electronics 2018, 7, 391
Substituting Equations (A5), (A7)–(A9), and (A11) into Equation (A4) yields the closed-form
expression expressed by Equation (23). This completes the proof.
H
1 a
= aHl,k2 ĝl,l,k ∼ Γ MNMN
−K +1 k̂
l,l,k , θ̂l,l,k , (A12)
aH
l,k
2 l,k
which results form Lemma A2 and from the fact that, from the perspective of each user, an intended
beam lies in a subspace of dimension s = MN − K + 1 with ZF receivers. Thus,
MN −K +1
E aH
1
= k̂ l,l,k θ̂l,l,k . (A13)
l,k
2 MN
( a)
= ∑ i = l ∑ j = k E E aH H
l,k ĝl,i,j ĝl,i,j al,k
(b) 1
= E ĝHl,i,j ĝl,i,j
MN
(c) 1 M
= M ∑i =l ∑ j=k ∑m=1 βl,m,i,j . (A14)
where (a) results from the fact that al,k and ĝl,i,j are independent, (b) results from Lemma A3, and (c)
l,i,j ĝl,i,j ∼ Γ ( k̂ l,i,j , θ̂l,i,j ).
results from the fact that ĝH
Similarly, we have
aH L K M
∑(i,j) E ∑i=1 ∑ j=1 ∑m=1 ηl,m,i,j .
al,k
2E g̃l,i,j g̃H =
l,k 1
aH
(A15)
l,k aH
l,k
l,i,j 2 M
33
Electronics 2018, 7, 391
Due to the pilot contamination, al,k and ĝl,i,k are dependent, we have
2
aHl,k
∑i =l aHl,k l,i,k =
MN −K +1 M
E ĝ M ∑i =l ∑m=1 βl,m,i,k . (A16)
( a) MN −K +1 1
= pu k̂ l,l,k θ̂l,l,k + Δ
MN M l,k
References
1. Wang, J.; Zhu, H.; Gomes, N.J.; Wang, J. Frequency Reuse of Beam Allocation for Multiuser Massive MIMO
Systems. IEEE Trans. Wirel. Commun. 2018, 17, 2346–2359. [CrossRef]
2. Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas.
IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
3. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
4. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An Overview of Massive MIMO: Benefits and
Challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
5. Wang, D.; Zhang, Y.; Wei, H.; You, X.; Gao, X.; Wang, J. An overview of transmission theory and techniques of
large-scale antenna systems for 5G wireless communications. Sci. China Inf. Sci. 2016, 59, 081301. [CrossRef]
6. Zhang, J.; Wen, C.; Jin, S.; Gao, X.; Wong, K. On Capacity of Large-Scale MIMO Multiple Access Channels
with Distributed Sets of Correlated Antennas. IEEE J. Sel. Areas Commun. 2013, 31, 133–148. [CrossRef]
7. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling Up MIMO:
Opportunities and Challenges with Very Large Arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
8. Li, J.; Wang, D.; Zhu, P.; Wang, J.; You, X. Downlink Spectral Efficiency of Distributed Massive MIMO
Systems With Linear Beamforming Under Pilot Contamination. IEEE Trans. Veh. Technol. 2018, 67, 1130–1145.
[CrossRef]
9. Björnson, E.; Matthaiou, M.; Debbah, M. Massive MIMO with non-ideal arbitrary arrays: Hardware scaling
laws and circuit-aware design. IEEE Trans. Wirel. Commun. 2015, 14, 4353–4368. [CrossRef]
10. Zhu, H. Performance Comparison Between Distributed Antenna and Microcellular Systems. IEEE J. Sel.
Areas Commun. 2011, 29, 1151–1163. [CrossRef]
11. You, X.H.; Wang, D.M.; Sheng, B.; Gao, X.Q.; Zhao, X.S.; Chen, M. Cooperative distributed antenna systems
for mobile communications. IEEE Wirel. Commun. 2010, 17, 35. [CrossRef]
12. Dai, L. A Comparative Study on Uplink Sum Capacity with Co-Located and Distributed Antennas. IEEE J.
Sel. Areas Commun. 2011, 29, 1200–1213. [CrossRef]
34
Electronics 2018, 7, 391
13. Bai, Q.; Nossek, J.A. Energy efficiency maximization for 5G multi-antenna receivers. Trans. Emerg.
Telecommun. Technol. 2015, 26, 3–14. [CrossRef]
14. Walden, R.H. Analog-to-digital converter survey and analysis. IEEE J. Sel. Areas Commun. 1999, 17, 539–550.
[CrossRef]
15. Singh, J.; Dabeer, O.; Madhow, U. On the limits of communication with low-precision analog-to-digital
conversion at the receiver. IEEE Trans. Commun. 2009, 57, 3629–3639. [CrossRef]
16. Fan, L.; Jin, S.; Wen, C.; Zhang, H. Uplink Achievable Rate for Massive MIMO Systems With
Low-Resolution ADC. IEEE Commun. Lett. 2015, 19, 2186–2189. [CrossRef]
17. Qiao, D.; Tan, W.; Zhao, Y.; Wen, C.; Jin, S. Spectral efficiency for massive MIMO zero-forcing receiver with
low-resolution ADC. In Proceedings of the 2016 8th International Conference on Wireless Communications
Signal Processing (WCSP), Yangzhou, China, 13–15 October 2016; pp. 1–6. [CrossRef]
18. Verenzuela, D.; Bjrnson, E.; Matthaiou, M. Hardware design and optimal ADC resolution for uplink
massive MIMO systems. In Proceedings of the 2016 IEEE Sensor Array and Multichannel Signal Processing
Workshop (SAM), Rio de Janerio, Brazil, 10–13 July 2016; pp. 1–5. [CrossRef]
19. Sarajlić, M.; Liu, L.; Edfors, O. When Are Low Resolution ADCs Energy Efficient in Massive MIMO?
IEEE Access 2017, 5, 14837–14853. [CrossRef]
20. Roth, K.; Nossek, J.A. Achievable Rate and Energy Efficiency of Hybrid and Digital Beamforming Receivers
With Low Resolution ADC. IEEE J. Sel. Areas Commun. 2017, 35, 2056–2068. [CrossRef]
21. Li, J.; Wang, D.; Zhu, P.; You, X. Uplink Spectral Efficiency Analysis of Distributed Massive MIMO With
Channel Impairments. IEEE Access 2017, 5, 5020–5030. [CrossRef]
22. Li, J.; Wang, D.; Zhu, P.; You, X. Spectral efficiency analysis of single-cell multi-user large-scale distributed
antenna system. IET Commun. 2014, 8, 2213–2221. [CrossRef]
23. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and Spectral Efficiency of Very Large Multiuser MIMO
Systems. IEEE Trans. Commun. 2013, 61, 1436–1449. [CrossRef]
24. Zuo, J.; Zhang, J.; Yuen, C.; Jiang, W.; Luo, W. Energy-Efficient Downlink Transmission for Multicell Massive
DAS With Pilot Contamination. IEEE Trans. Veh. Technol. 2017, 66, 1209–1221. [CrossRef]
25. Ngo, H.Q.; Tran, L.; Duong, T.Q.; Matthaiou, M.; Larsson, E.G. On the Total Energy Efficiency of Cell-Free
Massive MIMO. IEEE Trans. Green Commun. Netw. 2018, 2, 25–39. [CrossRef]
26. Björnson, E.; Sanguinetti, L.; Hoydis, J.; Debbah, M. Optimal Design of Energy-Efficient Multi-User MIMO
Systems: Is Massive MIMO the Answer? IEEE Trans. Wirel. Commun. 2015, 14, 3059–3075. [CrossRef]
27. Pham, Q.; Hwang, W. Fairness-Aware Spectral and Energy Efficiency in Spectrum-Sharing Wireless
Networks. IEEE Trans. Veh. Technol. 2017, 66, 10207–10219. [CrossRef]
28. Deng, L.; Rui, Y.; Cheng, P.; Zhang, J.; Zhang, Q.T.; Li, M. A Unified Energy Efficiency and Spectral Efficiency
Tradeoff Metric in Wireless Networks. IEEE Commun. Lett. 2013, 17, 55–58. [CrossRef]
29. Xu, L.; Yu, G.; Jiang, Y. Energy-Efficient Resource Allocation in Single-Cell OFDMA Systems: Multi-Objective
Approach. IEEE Trans. Wirel. Commun. 2015, 14, 5848–5858. [CrossRef]
30. Heath, W.H., Jr.; Wu, T.; Kwon, Y.H.; Soong, A.C.K. Multiuser MIMO in Distributed Antenna Systems With
Out-of-Cell Interference. IEEE Trans. Signal Process. 2011, 59, 4885–4899. [CrossRef]
31. Lim, Y.; Chae, C.; Caire, G. Performance Analysis of Massive MIMO for Cell-Boundary Users. IEEE Trans.
Wirel. Commun. 2015, 14, 6827–6842. [CrossRef]
c 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
35
electronics
Article
Channel Sounding for Multi-User Massive MIMO in
Distributed Antenna System Environment
Seoyoung Yu † and Jeong Woo Lee *,†
School of Electrical and Electronics Engineering, Chung-Ang University, Seoul 06974, Korea;
[email protected]
* Correspondence: [email protected]; Tel.: +82-2-820-5734
† Current address: 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Korea.
Abstract: We propose a generation scheme for a sounding reference signal (SRS) suitable for
supporting a large number of users in massive multi-input multi-output (MIMO) system with
a distributed antenna system (DAS) environment. The proposed SRS can alleviate the pilot
contamination problem which occurs inherently in the multi-user system due to the limited number
of orthogonal sequences. The proposed SRS sequence is generated by applying a well-chosen phase
rotation to the conventional LTE/LTE-A SRS sequences without requiring an increased amount
of resource usage. We also propose using the correlation-aided channel estimation algorithm as a
supplemental scheme to obtain more reliable and refined channel estimation. It is shown that the
proposed SRS sequence and the supplemental channel estimation scheme improve significantly the
channel estimation performance in multi-user massive MIMO systems.
Keywords: massive multi-input multi-output (MIMO); distributed antenna systems (DAS); sounding
reference signal (SRS); channel estimation
1. Introduction
It is expected that the amount of mobile wireless traffic in 2020 will be 1000 times higher than that of
2010 [1–3]. Along with the dramatic growth in the demand for wireless communications, performance
requirements for data rate, spectral efficiency and energy efficiency are also getting higher [4–6].
To meet these growing demand and performance requirements for wireless communications, massive
multi-input multi-output (m-MIMO) technology was proposed as one of the key technologies for the
next generation cellular networks, known as fifth generation (5G) systems [7–10]. It is known that
m-MIMO systems, whose transmitter or receiver is equipped with massive number of antennas, can
improve the spectral efficiency and save energy in wireless communication systems [10–12]. As a result,
m-MIMO systems have recently attracted many researchers and engineers in many aspects. Multi-user
m-MIMO technology, in which a base station (BS) uses a large number of antennas to serve many pieces
of user equipment (UE) simultaneously on the same time-frequency resource, is one example that is
actively studied to be practically adopted in 5G systems [13–15]. The distributed antenna system (DAS)
has also been considered a key technology for feasible deployment of 5G systems [16–18]. In DAS
configuration, there exist a digital unit (DU) and multiple radio units (RUs) in a cell, where each RU is
connected to DU via fiber optic links. The DU manages RUs in a centralized manner, by which RUs
can transmit and receive signals in a cooperative manner.
The key requirement for enjoying the benefit of m-MIMO technology is to obtain the accurate
channel state information (CSI) for each link at the BS, or at RUs in DAS environment. In the
frequency-division duplex (FDD) approach, UEs estimate downlink (DL) channels by using DL
pilot signals, or sounding reference signals (SRS), transmitted from the BS [19]. The required number
of DL pilots in an FDD based approach is proportional to the number of BS antennas multiplied by the
number of served UEs, which complicates the adoption of such a DL channel estimation in massive
MIMO environments. Thus, the m-MIMO systems typically employ the time-division duplex (TDD)
approach to estimate the DL channel. In the TDD approach, UEs estimate DL channels by sending
mutually orthogonal uplink (UL) pilot signals (SRS) to BS based on the DL and UL channel reciprocity
within the channel coherence interval [9]. The total number of UL pilots required in such a TDD based
approach is proportional to the number of served UEs irrespective of the number of BS antennas [7,9].
For a given length of sequence, say M, we may generate at most M orthogonal sequences.
Conventionally, pilot sequences are mutually orthogonal, so the maximum number of pilot sequences
is limited by M. If the number of UEs in simultaneous service exceeds M, we need to reuse some or
all of the already generated orthogonal sequences. This results in the so-called pilot contamination
problem [20] due to the violation of orthogonality between pilot sequences. The pilot contamination
mainly limits the performance improvement of m-MIMO systems [7,15]. Most of the prior works
that proposed to resolve this problem have considered the use of mutually orthogonal SRS sequences.
They include pilot signal coordination [21], blind channel estimation with data samples [22] and
cooperative multi-cell precoding in m-MIMO systems [23]. The number of mutually orthogonal SRS
sequences is mainly limited by the length of the base sequence. In Long Term Evolution (LTE)/Long
Term Evolution-Advanced (LTE-A) systems, the number of orthogonal SRS sequences is 16, which is
not suitable for serving a large number of users in m-MIMO environments [24,25]. Using long pilot
sequences may reduce the pilot contamination, but they replace data sequences and reduce channel
spectral efficiency and throughput. If pilot sequences are made too long, they may occupy even the
restricted band, which must be prevented. Thus, it may be desirable to generate a larger number of
SRS sequences without increasing the sequence length.
For this purpose, we propose a mechanism for generating SRS sequences with a lower level of
pilot contamination which is suitable for serving a large number of UEs. In the proposed mechanism,
a phase rotation is applied to the base sequence without increasing the sequence length. The resultant
SRS sequences may be mutually correlated and thus still incur a pilot contamination as the number
of UEs grows and exceeds the length of base sequence. This results in high channel estimation error
when a linear estimation based on the orthogonality of SRS sequences is used. To resolve this problem,
we propose a two-step channel estimation algorithm by which the least square (LS) estimation [26] is
first applied and the minimum mean squared error (MMSE) estimation [27] is additionally applied
only to the group of UEs using mutually correlated SRS sequences. The proposed correlation-aided
channel estimation shows the improved performance in the channel estimation. It is shown that the
proposed SRS combined with the supplemental channel estimation algorithm guarantees lower mean
squared error (MSE) in channel estimation, which alleviates the pilot contamination problem.
The rest of the paper is organized as follows. In Section 2, we introduce the system model in
consideration. In Section 3, a brief review of SRS sequences in conventional LTE/LTE-A systems is
provided. We introduce the generation of proposed SRS sequences and the analysis on the resultant
correlation in Section 4. In Section 5, we propose the two-step channel estimation algorithm composed
of an LS estimation followed by a supplemental MMSE estimation. We also formulate and analyze the
MSE obtained for SRS sequences with an LS estimation. We verify the performances in various aspects
by computer simulations in Section 6 and conclude this paper in Section 7.
2. System Model
Consider a cell having a DU and R RUs each of which serves K UEs as shown in Figure 1,
where each RU has NT transmit and receive antennas. We index RU by r ∈ {0, · · · , R − 1}, and index
the UE served by RU r as r (k ), k ∈ {0, · · · , K − 1}. We consider orthogonal frequency division
multiplexing (OFDM) communications with Nc subcarriers between UE and RU. We suppose the
channel reciprocity, by which the DL channel from RU to UE can be estimated by using the UL pilots
sent from UEs under the constraint that the time delay from the UL channel estimation to the DL
transmission is less than the coherence time of the channel [7]. Then, the UL channel estimated
38
Electronics 2019, 8, 36
by sending SRS sequence from each UE to RU is used as the DL channel estimation. Let sr(k) =
[sr(k) [0] · · · sr(k) [ M − 1]] T denote the SRS sequence of UE r (k), where the superscript T denotes a
transpose of a vector. We also let hr(k),r [m] ∈ C NT ×1 denote the channel gain between UE r (k ) and
RU r over the m-th subcarrier. Then, the channel gains corresponding to M subcarriers, hr(k),r [m],
m = 0, · · · , M − 1, are estimated by using an SRS sequence.
The signal received by RU r over the subcarrier m is denoted by yr [m] ∈ C NT ×1 and obtained by
K −1 R −1 K −1
yr [ m ] = ∑ hr(k),r [m]sr(k) [m] + ∑ ∑ hr (k),r [m]sr (k) [m] + nr [m], (1)
k =0 r =0,r =r k =0
where nr [m] ∈ C NT ×1 is the zero-mean additive white Gaussian noise vector with covariance matrix
σn2 I NT × NT . Note that hr(k),r [m] = β r(k),r gr(k),r [m], where β r(k),r represents the large scale fading
while each entry of gr(k),r [m] ∈ C NT ×1 denotes the small scale fading represented by an independent
and identically distributed (i.i.d.) zero mean complex Gaussian random variable with unit variance.
Note that the large scale fading factor β2r(k),r = dr−(ka),r , where dr(k),r is a distance between UE r (k ) and
RU r , and a is an attenuation factor.
&HOO
'8
58U
KU VU N
N U
8(U KU 58Uc
N Uc
UN
U
8(Uc
Uc N Uc
39
Electronics 2019, 8, 36
Note that base sequences are divided into groups, where u is the group number associated with
the physical cell ID and the length of the SRS sequence, and v is the base sequence number within the
group. The SRS sequence x (α) [m] of length M is defined by applying a cyclic shift α, α ∈ {0, 1 · · · , 7},
to the base sequence x̄ [m] as
α
x (α) [m] = e j2π Lc m x̄ [m], 0 ≤ m < M, (5)
j2π Lαc
where Lc > 7 to obtain distinct values of e for different α ∈ {0, 1 · · · , 7}. For any αi and α j chosen
from {0, · · · , 7}, two sequences x(αi ) = [ x (αi ) [0] · · · x (αi ) [ M − 1]] T and x(α j ) = [ x (α j ) [0] · · · x (α j ) [ M − 1]] T
1 ( αi ) H ( α j )
are orthogonal if Mx x = δi− j , where δi− j = 1 if i = j and 0 otherwise, and the superscript H
M −1 j L c ( α j − α i ) m 2π
denotes a conjugate transpose of a vector. This condition reduces to M 1
∑m =0 e = δi− j by
using Equation (5) and the property | x̄ [m]| = 1 for all m, which is clear from Equations (2)–(4).
2
This condition is satisfied only if (α j − αi ) LMc is an even integer, or, in other words, Lc is a factor
of (α j − αi ) M
2 , for αi = α j ∈ {0, · · · , 7}. Consequently, Lc must be an integer that is greater than 7 and
is a factor of 12 M.
5%
VXEFDUULHU
1 VF5%
/
IUHTXHQF\
WLPH 656V\PERO
Multiple SRS sequences are defined from a single base sequence by using different values of
α and the decimation factor. In LTE/LTE-A, the decimation factor of two is used and the signal
occupies every second subcarrier within the allocated sounding bandwidth. By using distinct SRS
sequences obtained with different values of α and by using distinct sets of subcarriers as a result of
decimation, multiple UEs can estimate their channel gains and can be served by RU simultaneously.
40
Electronics 2019, 8, 36
In conventional LTE/LTE-A systems using α ∈ {0, · · · , 7} and the decimation factor of two, we can
obtain only 16 orthogonal sequences. In the multi-user systems with higher number of UEs than 16,
we need to design a larger set of SRS sequences having low cross-correlation.
where 0 ≤ s < LMc and 0 ≤ p < L p , and the last equality comes from Equation (5). Note that Lc is an
integer greater than 7 which can divide 12 M as introduced in Section 3 and L p is a prime number which
is smaller than Lc . For a given SRS sequence length M, we may generate up to M orthogonal sequences.
However, by using Equation (5), we can generate only eight orthogonal sequences with varying
s α
α = 0, · · · , 7. Thus, we use the phase rotation e j2π M m , 0 ≤ s < LMc , together with e j2π Lc m , 0 ≤ α < Lc ,
to generate M orthogonal SRS sequences without incurring pilot contamination. If the number of
UEs exceeds M, we need to generate extra SRS sequences instead of reusing already generated ones.
p
j2π m
For this purpose, we apply additional phase rotation e L p , 0 ≤ p < L p , where L p needs to be
coprime with M and thus be also coprime with Lc in order to make the resultant sequences distinct
from already generated M sequences. We empirically found that a prime number L p smaller than Lc
results in a good performance.
By Equations (2)–(4), we can rewrite Equation (6) as
⎧ ! "
⎨exp j2π p α qm(m+1)
(α,s,p)
s
+ + Lc m − , for 0 ≤ m < Lz ,
x [m] = ! M Lp 2Lz " (7)
⎩exp j2π p α q(m− Lz )(m− Lz +1)
M + + Lc m − for Lz ≤ m < M.
s
Lp 2Lz ,
where sr(k) [m] = x (α,s,p) [m], sr ( j) [m] = x (α ,s ,p ) [m] and the superscript ∗ represents the complex
conjugate of a complex variable. The detailed derivation of Equation (8) is provided in Appendix A.
It is clear that Cr(k),r ( j) obtained with s = s , p = p and α = α corresponds to the auto-correlation of
M −1 j ·0
sr(k) because sr ( j) = sr(k) , where Cr(k),r(k) = M
1
∑m =0 e = 1. The cross-correlation Cr(k),r ( j) = 0 if
p = p and either s = s or α = α , while Cr(k),r ( j) = 0 if p = p by the following reason.
M−1 j2πφm
Consider f (φ) = M 1
∑m =0 e , where f (φ) = 0 if φM is a nonzero integer, and f (φ) = 0,
−s
otherwise. If p = p , the rightmost side of Equation (8) becomes f s M + α L−c α . If s = s or α = α ,
s −s α −α
M + Lc M is a nonzero integer because M is a multiple of Lc as introduced in Section 3, and thus
−s −s
Cr(k),r ( j) = f s M + α L−c α = 0. On the other hand, if p = p , s M + p L−p p + α L−c α M cannot be a
41
Electronics 2019, 8, 36
s −s p −p
nonzero integer because ( p − p) M is not divisible by L p , and, thus Cr(k),r ( j) = f M + Lp + α L−c α =
0, where L p and M are coprime and | p − p | < L p .
Consequently, for each p, we can generate a set of M orthogonal SRS sequences, where 0 ≤ s < LMc
and 0 ≤ α < Lc enables the generation of M orthogonal sequences. Applying the phase rotation
p
j2π m
e L p , 0 ≤ p < L p , in Equation (7) enables to obtain L p sets of M orthogonal sequences. Any two
sequences obtained with different values of p are mutually correlated.
Distinct SRS sequences generated with different α, s and p by Equation (7) are assigned to different
UEs. The first set of M orthogonal sequences generated with p = 0 are assigned to the first M UEs.
Then, the next set of M orthogonal sequences generated with p = 1 are assigned to the next M UEs.
This procedure is repeated by increasing p until all UEs are assigned SRS sequences.
5. Channel Estimation
M −1
1
ĥr(k),r [m] = ĥr(k),r = ∑ sr∗(k) [i ]yr [i ]. (9)
M i =0
where sr∗(k) [i ]sr(k) [i ] = |sr(k) [i ]|2 = 1 is used. If the channel is actually block faded with length M,
the channel gain hr(k),r [m] is identical for all m = 0, · · · , M − 1 and thus, we can represent the channel
gain by hr(k),r . Then, Equation (10) can be rewritten as
K −1 R −1 K −1 M −1
1
ĥr(k),r = hr(k),r + ∑ hr( j),r Cr(k),r( j) + ∑ ∑ hr ( j),r Cr(k),r ( j) + M ∑ sr∗(k) [i ]nr [i ]. (11)
j=0,j =k r =0,r =r j=0 i =0
In case that all SRS sequences of UEs in the cell are mutually orthogonal, we have zero
cross-correlation between any two SRS sequences so that Equation (11) is simplified as
M −1
1
ĥr(k),r = hr(k),r + ∑ sr∗(k) [i ]nr [i ]. (12)
M i =0
We define the normalized mean squared error (MSE) of the channel estimation between UE r (k )
and RU r at subcarrier m as
42
Electronics 2019, 8, 36
K −1 R −1 M −1
1
2
σ̄MSE =
KRM ∑ ∑ ∑ 2
σMSE,r (k) [ m ] (14)
k =0 r =0 m =0
and use this as the performance metric of channel estimation. Note that, if the channel is actually
block faded with length M, the normalized MSE can be denoted by σMSE,r
2
(k) and Equation (14) can be
simplified as σ̄MSE
2 = KR
1
∑kK=−01 ∑rR=−01 σMSE,r
2
(k) . Under the assumption of block fading with length M,
the average MSE can be obtained from Equations (11), (13) and (14) as
K −1 R −1 Eĥr(k),r − hr(k),r 2
1
2
σ̄MSE =
KR ∑ ∑ β2r(k),r NT
k =0 r =0
# (15)
K −1 R −1 K −1 R −1 K −1
1 1 1 2
=
KR ∑ ∑ β2r(k),r
∑ β2r( j),r |Cr(k),r( j) |2 + ∑ ∑ β2r ( j),r |Cr(k),r ( j) |2 + σ
M n
,
k =0 r =0 j=0,j =k r =0,r =r j=0
We consider the reuse of M orthogonal SRS sequences to UEs repeatedly without applying the
p
j2π m
phase rotation e L p in Equation (7). We suppose that each RU in the cell serves the equal number
of UEs and distinct set of M
R orthogonal sequences are assigned repeatedly to UEs in each RU. Then,
we obtain a correlation matrix C ∈ C KR×KR of SRS sequences whose (i, j)-th entry is defined by
1, if |i − j| = kM, k = 0, · · · , KR/M,
Cij = (17)
0, else.
Every other M R column and row of C correspond to UEs in the same RU. Due to the reuse of
M
R orthogonal sequences for each RU, SRS sequences used for different RUs are always mutually
orthogonal. It follows that Cr(k),r ( j) = 0 for all k, j and r = r and Equation (16) becomes
K −1 R −1 K −1
1 σn2
2
σ̄MSE =
KR ∑ ∑ ∑ |Cr(k),r( j) |2 +
β21 M
. (18)
k =0 r =0 j=0,j =k
From Equation (17) and the allocation rule of orthogonal sequences to UEs introduced above,
we obtain
⎧
1 K −1 R −1 K −1 ⎨0, if KR ≤ M,
∑ ∑ ∑
KR k=0 r=0 j=0,j =k
|Cr(k),r( j) | =
2
⎩ 1 ∑ M 2(KR − iM), if KR > M.
KR (19)
KR i =1
43
Electronics 2019, 8, 36
R −1 K −1 M −1
1
ĥr(k),r = ∑ ∑ hr ( j),r Cr(k),r ( j) + M ∑ sr∗(k) [i ]nr [i ], (21)
r =0 j =0 i =0
where 0 ≤ r < R and 0 ≤ k < K. This can be expressed by using matrix forms as
Ĥ = HC + N, (22)
where Ĥ ∈ C NT ×KR , H ∈ C NT ×KR and N ∈ C NT ×KR are matrices whose columns are ĥr(k),r , hr(k),r
and nr(k),r , respectively, 0 ≤ r < R and 0 ≤ k < K. We let C ∈ C KR×KR denote a correlation matrix
whose entries are Cr(k),r ( j) , where 0 ≤ r, r < R and 0 ≤ k, j < K. We define two classes of UEs.
The first one is the set of UEs that are assigned mutually orthogonal SRS sequences. UEs assigned
mutually correlated SRS sequences compose the second class.
We reorder Ĥ, H, C and N and partition
I 0
them as Ĥ = Ĥu Ĥc , H = Hu Hc , C = and N = Nu Nc , where submatrices
0 A
with subscripts u and c correspond to the first and the second class of UEs, respectively, and A is a
non-diagonal matrix. Then, Equation (22) can be rewritten in partitioned forms as
I 0
Ĥu Ĥc = Hu Hc + Nu Nc , (23)
0 A
which implies
Ĥu = Hu + Nu , (24)
Ĥc = Hc A + Nc . (25)
It is clear from Equation (24) that ĥr(k),r = hr(k),r + nr(k),r for the first class of UEs. So, ĥr(k),r
is considered to estimate sufficiently the actual channel hr(k),r of UE r (k ) in the first class. On the
other hand, as can be seen from Equation (25), for the second class of UEs, ĥr(k),r includes the linear
combination of other UEs’ channels as well. Thus, we need a supplemental procedure to obtain
the more reliable channel estimations for UEs in the second class. For this purpose, we propose the
correlation-aided channel estimation which applies the MMSE algorithm [27] to the estimated channels
for the second class of UEs as the supplemental procedure. We obtain the refined channel estimation
˜ for UEs in the second class by multiplying the MMSE nulling matrix W to Ĥ as
Ĥ c c
˜ = Ĥ W,
Ĥ (26)
c c
44
Electronics 2019, 8, 36
where −1
W = A H AA H + B−1 I (27)
and B is a diagonal matrix whose diagonal entries are β2r(k),r corresponding to UEs in the second
class. Then, after ˜
refinement of Ĥc to Ĥc by Equation (26), the channel estimation Ĥ is replaced by
˜
Ĥ = Ĥu Ĥc .
In summary, first, we apply the LS estimation to obtain ĥr(k),r for all UEs. Next, we define the
first and the second classes of UEs based on the correlation matrix C. The first class is defined by UEs
whose SRS sequences are mutually orthogonal. The remaining UEs form the second class. Then, for the
second class of UEs, we apply additionally the MMSE algorithm to the channel estimation obtained by
LS scheme as the supplemental procedure. Finally, we use the result of LS estimation for the first class
and the result of supplement estimation for the second class as the channel estimation. If the number
of UEs is less than or equal to the number of mutually orthogonal SRS sequences, we only need to
perform a LS estimation as introduced in Equation (9). Otherwise, we need to apply the proposed
correlation-aided supplemental estimation after performing the LS estimation.
6. Numerical Results
We evaluate the performances of the proposed SRS and the proposed supplemental channel
estimation algorithm in terms of MSE in the multi-user m-MIMO DAS environment. For comparison,
the MSE obtained by using conventional LTE/LTE-A SRS scheme is also shown. As the proposed
scheme, we consider the following three types:
(a) using repeatedly M orthogonal sequences generated by varying α and s without applying the
p
j2π L p m
phase rotation e in Equation (7),
p
j2π m
(b) using the phase rotation e L p together with varying α and s for SRS generation, but not using
the supplemental correlation-aided channel estimation,
p
j2π m
(c) using the phase rotation e L p together with varying α and s for SRS generation, and the
supplemental correlation-aided channel estimation as well.
We consider the large scale fading factor between UEs and serving RU as β 1 and the large scale
fading factor between UEs and other neighboring RUs as β 2 , i.e., β r(k),r = β 1 and β r(k),r = β 2 for all k,
r and r = r. We let β 2 /β 1 = 0.7692, which comes from the assumption that the distance of UE from
serving RU is 1.3 times shorter than the distance from other neighboring RUs, where the attenuation
factor is assumed to be a = 2. We let each RU serve the same number of UEs. We choose Lc = 8 in
p
j2π m
Equations (5) and (7), and L p = 3 when applying the phase rotation e L p in types (b) and (c) of
the proposed scheme. We consider a block fading channel, International Telecommunication Union
Radiocommunication Sector (ITU-R) Ped A and ITU-R Veh A channels [28], where ITU-R Ped A [29]
and ITU-R Veh A channels are examples of frequency selective channels. The simulation parameters
used for performance evaluation are listed in Table 1.
We depict the average MSE of channel estimation with respect to the total number of UEs
in the cell obtained for block fading channel, ITU-R Ped A channel and ITU-R Veh A channel in
Figures 3–5, respectively. In each figure, we plot the average MSE obtained by using conventional SRS
in LTE/LTE-A systems and the proposed scheme of three types.
We also plot the average MSE obtained by applying supplemental MMSE estimation to all UEs
instead of applying MMSE selectively only to UEs belonging to the second class, where the phase
p
j2π m
rotation e L p is applied and the corresponding curves are marked with ‘nonselective supplemental
MMSE.’ In Figure 3, we include the average MSE predicted analytically by Equation (20) for the
proposed scheme of type (a) over block fading channel.
45
Electronics 2019, 8, 36
Parameters Values
number of subcarriers (Nc ) 1024
RB )
number of subcarriers per RB (Nsc 12
48 (proposed scheme)
SRS length (M)
24 (LTE/LTE-A with a decimation factor of 2)
sampling time 0.1 μs
average SNR (β21 /σn2 ) 0 dB
channel model block fading, ITU-R Ped A, ITU-R Veh A
number of RUs (R) 4
number of RU antennas (NT ) 8
0
average MSE [dB]
-5
-10
LTE/LTE-A
proposed, type (a)
-15
proposed, type (b)
proposed, type (c)
analytic prediction for type (a)
nonselective supplemental MMSE
-20
0 20 40 60 80 100 120
total number of UEs
Figure 3. Average MSE of channel estimation with respect to the total number of UEs over block fading channel.
0
average MSE [dB]
-5
-10
LTE/LTE-A
-15
proposed, type (a)
proposed, type (b)
proposed, type (c)
nonselective supplemental MMSE
-20
0 20 40 60 80 100 120
total number of UEs
Figure 4. Average MSE of channel estimation with respect to the total number of UEs over ITU-R Ped A channel.
46
Electronics 2019, 8, 36
2
average MSE [dB]
-2
-4
-6
-8 LTE/LTE-A
proposed, type (a)
proposed, type (b)
-10 proposed, type (c)
nonselective supplemental MMSE
-12
0 20 40 60 80 100 120
total number of UEs
Figure 5. Average MSE of channel estimation with respect to the total number of UEs over ITU-R Veh
A channel.
It is observed that the channel estimation MSE obtained by using any SRS scheme rapidly
increases when the number of UEs in service exceeds the number of orthogonal SRS sequences, where
the conventional LTE/LTE-A scheme and the proposed scheme generate 16 and M (=48) orthogonal
SRS sequences, respectively. For more than 48 UEs, the proposed scheme also provides much lower
channel estimation MSE than LTE/LTE-A scheme. Even reusing repeatedly M (=48) orthogonal SRS
sequences generated in type (a) can lower significantly the channel estimation MSE for any number of
p
j2π m
UEs. It is observed that applying phase rotation e L p to SRS generation further lowers the channel
estimation MSE, and the use of supplemental selective MMSE estimation can even further improve
the channel estimation performance at the cost of increasing complexity. Note that the computational
complexity of supplemental selective MMSE estimation is, in general, O(n3 ) because it requires the
matrix inversion, where n is the dimension of a square matrix A. Considering, as a reference, the case
that M orthogonal sequences are repeatedly used to generate SRS sequences, the dimension of A is
determined from Equation (17) as n = min(KR, 2 max(0, KR − M)). Applying supplemental MMSE
estimation nonselectively to all UEs degrades the performance when the number of UEs is not high
enough. In Figure 3, the analytic prediction of MSE for block fading channel is observed to match
the numerical result very well. It is clear that the proposed scheme of type (c) shows the best channel
estimation performance in terms of MSE for all numbers of UEs. The MSE gain of the proposed scheme
of type (c) over LTE/LTE-A system is about 6 dB in block fading and ITU-R Ped A channels, and about
8 dB in ITU-R Veh A channel when serving 120 UEs through four RUs. The performance gain achieved
by using the proposed scheme can be observed in block fading channel as well as frequency selective
channel.
Conventional LTE/LTE-A systems may not effectively employ m-MIMO transmission for
the service of a large number of UEs mainly due to the pilot contamination problem. However,
the proposed SRS and the supplemental channel estimation can relieve the pilot contamination
problem in a significant level. It is expected that the proposed scheme can be effectively adopted in
multi-user m-MIMO systems.
7. Conclusions
In this paper, we proposed a generation of SRS sequences resulting in improved channel estimation
performance without increasing the sequence length. The proposed SRS can easily be generated
47
Electronics 2019, 8, 36
by imposing a phase rotation to the base sequence. Even though the proposed SRS sequences
are mutually correlated when the number of UEs is higher than the sequence length, it lowers
significantly the MSE of channel estimation and thus alleviates the pilot contamination problem.
We also proposed a supplemental correlation-aided channel estimation scheme to further improve
the channel estimation performance of multi-user m-MIMO technology in DAS environments. It is
expected that the proposed SRS scheme and the supplemental channel estimation scheme can be
effectively adopted in m-MIMO systems.
Author Contributions: Writing, J.W.L.; Software, S.Y.; Conceptualization, J.W.L.; Supervision, J.W.L.
Funding: This research was supported by the Chung-Ang University Graduate Research Scholarship in 2017
and by the Basic Science Research Program through the National Research Foundation of Korea funded by the
Ministry of Education (NRF-2016R1D1A1B03933174).
Conflicts of Interest: The authors declare no conflict of interest.
M −1
1 1
Cr(k),r ( j) s Hs = ∑ sr∗(k) [m]sr ( j) [m]
M r (k) r ( j) M m =0
L z −1 qm(m+1)
+ Lαc m− 2Lz
qm(m+1) p
M + Lp
s
+ Lαc m− 2Lz
p
1 − j2π M + Lp
s j2π
=
M ∑ e e
m =0
M −1 q(m− Lz )(m− Lz +1) p
+ Lαc m−
q(m− Lz )(m− Lz +1)
M + Lp
s
+ Lαc m−
p
1 − j2π M + Lp
s j2π
∑
2Lz
+ e 2Lz
e
M m= Lz
M−1 − j2π s−s +
p− p
+ α− α
1 m
∑
M Lp Lc
= e .
M m =0
K −1 R −1 Eĥr(k),r − hr(k),r 2
1
2
σ̄MSE =
KR ∑ ∑ β2r(k),r NT
k =0 r =0
$ $2
1 K −1 R −1 1 $ K −1 R −1 K −1
1 M −1 ∗ $
$ $
= ∑ ∑
KRNT k=0 r=0 β r(k),r
2
E $
$ ∑ h C
r ( j),r r (k ),r ( j) + ∑ ∑ h C
r ( j),r r (k ),r ( j) + ∑
M i =0
s r ( k ) [ i ] n r [ i ] $
$
j=0,j =k r =0,r =r j=0
1 K −1 R −1 1 K −1 R −1 K −1
= ∑ ∑
KRNT k=0 r=0 β r(k),r j=0,j =k
2 ∑ Ehr( j),r 2 |Cr(k),r( j) |2 + ∑ ∑ Ehr ( j),r 2 |Cr(k),r ( j) |2
r =0,r =r j=0
#
1 M −1
+ 2 ∑ Enr [i ] 2
M i =0
#
1 K −1 R −1 1 K −1 R −1 K −1
1 M −1
KRNT k∑ ∑ 2 ∑ NT βr( j),r |Cr(k),r( j) | + ∑ ∑ NT βr ( j),r |Cr(k),r ( j) | + M2 ∑ NT σn
= 2 2 2 2 2
=0 r =0 β r (k),r j=0,j =k r =0,r =r j=0 i =0
#
K −1 R −1 K −1 R −1 K −1
1 1 1
KR k∑ ∑ 2 ∑ β2r( j),r |Cr(k),r( j) |2 + ∑ ∑ β2r ( j),r |Cr(k),r ( j) |2 + M σn2 .
=
=0 r =0 β r (k),r j=0,j =k r =0,r =r j=0
48
Electronics 2019, 8, 36
References
1. Ge, X.; Tu, S.; Mao, G.; Wang, C.-X.; Han, T. 5G ultra-dense cellular networks. IEEE Wirel. Commun. 2016,
23, 72–79. [CrossRef]
2. Ge, X.; Cheng, H.; Guizani, M.; Han, T. 5G wireless backhaul networks: Challenges and research advances.
IEEE Netw. 2014, 28, 6–11. [CrossRef]
3. Nakamura, T.; Nagata, S.; Benjebbour, A.; Kishiyama, Y.; Hai, T.; Xiaodong, S.; Ning, Y.; Nan, L. Trends in
small cell enhancements in LTE Advanced. IEEE Commun. Mag. 2013, 51, 98–105. [CrossRef]
4. Chin, W.H.; Fan, Z.; Haines, R. Emerging technologies and research challenges for 5G wireless networks.
IEEE Commun. Mag. 2014, 21, 106–112. [CrossRef]
5. Angelis, C.T.; Chronopoulos, S.K. System performance of an LTE MIMO downlink in various fading
environments. In Proceedings of the Ambient Media and Systems, Berlin, Germany, 27–28 January 2011;
pp. 36–43._5. [CrossRef]
6. Votis, C.; Christofilakis, V.; Raptis, V.; Tatsis, G.; Chronopoulos, S.K.; Kostarakis, P. Design and analysis of a
multiple-output transmitter based on DDS architecture for modern wireless communications. AIP Conf. Proc.
2010, 1203, 421–426. [CrossRef]
7. Marzetta, T. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Comm. 2010, 9, 3590–3600. [CrossRef]
8. Rusek, F.; Persson, D.; Lau, B.; Larsson, E.; Marzetta, T.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
9. Larsson, E.G.; Tufvesson, F.; Edfors, O.; Marzetta, T.L. Massive MIMO for next generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
10. Pitarokoilis, A.; Mohammed, S.K.; Larsson, E.G. On the optimality of single-carrier transmission in large-scale
antenna systems. IEEE Wirel. Commun. Lett. 2012. 1, 276–279. [CrossRef]
11. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overview of massive MIMO: Benefits and
challenges. IEEE J. Sel. Top. Signal Process. 2013, 8, 742–758. [CrossRef]
12. Bjornson, E.; Larsson, E.G.; Marzetta, T.L. Massive MIMO: Ten myths and one critical question.
IEEE Commun. Mag. 2016, 54, 114–123. [CrossRef]
13. Zuo, J.; Zhang, J.; Yuen, C.; Jiang, W.; Luo, W. Multicell multiuser massive MIMO transmission with downlink
training and pilot contamination precoding. IEEE Trans. Veh. Technol. 2016, 65, 6301–6314. [CrossRef]
14. Ge, X.; Zi, R.; Wang, H.; Zhang, J.; Jo, M. Multi-user massive MIMO communication systems based on
irregular antenna arrays. IEEE Trans. Wirel. Commun. 2016, 15, 5287–5301. [CrossRef]
15. Gopalakrishnan, B.; Jindal, N. An analysis of pilot contamination on multi-user MIMO cellular systems with
many antennas. In Proceedings of the Signal Processing Advances in Wireless Communications (SPAWC),
San Francisco, CA, USA, 26–29 June 2011, doi:10.1109/SPAWC.2011.5990435.
16. Heath, R.; Peters, S.; Wang, Y.; Zhang, J. A current perspective on distributed antenna systems for the
downlink of cellular systems. IEEE Commun. Mag. 2013, 51, 161-167. [CrossRef]
17. Chen, X.; Zhang, Z.; Chen, H.-H. On distributed antenna systems with limited feedback precoding:
opportunities and challenges. IEEE Wirel. Commun. 2010, 17, 80-88. [CrossRef]
18. Yang, C.; Han, S.; Hou, X.; Molisch, A. How do we design CoMP to achieve its promised potential?
IEEE Wirel. Commun. 2013, 20, 67–74. [CrossRef]
19. Kobayashi, M.; Jindal, N.; Caire, G. Training and feedback optimization for multiuser MIMO downlink.
IEEE Trans. Commun. 2011, 59, 2228–2240. [CrossRef]
20. Elijah, O.; Leow, C.Y.; Abdul Rahman, T.; Nunoo, S.; Zakwoi Iliya, S. A comprehensive survey of pilot
contamination in massive MIMO-5G system. IEEE Commun. Surv. Tuts. 2016, 18, 905–923. [CrossRef]
21. Yin, H.; Gesbert, D.; Filippou, M.; Liu, Y. A coordinated approach to channel estimation in large-scale
multiple-antenna systems. IEEE J. Sel. Areas Commun. 2013, 31, 264–273. [CrossRef]
22. Ngo, H.Q.; Larsson, E.G. EVD-based channel estimations for multicell multiuser MIMO with very large
antenna arrays. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and Signal
Processing (ICASSP), Kyoto, Japan, 25–30 March 2012; doi:10.1109/ICASSP.2012.6288608.
23. Jose, J.; Ashikhmin, A.; Marzetta, T.L.; Vishwanath, S. Pilot contamination and precoding in multi-cell TDD
systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
49
Electronics 2019, 8, 36
24. 3GPP Technical Specification 36.211. Evolved Universal Terrestrial Radio Access (E-UTRA) Physical Channels
and Modulation. Available online: www.3gpp.org (accessed on 20 November 2018).
25. 3GPP Technical Specification 36.213. Evolved Universal Terrestrial Radio Access (E-UTRA) Physical Layer
Procedures. Available online: www.3gpp.org (accessed on 20 November 2018).
26. Coleri, S.; Ergen, M.; Puri, A.; Bahai, A. Channel estimation techniques based on pilot arrangement in OFDM
systems. IEEE Trans. Broadcast. 2002, 48, 223–229. [CrossRef]
27. Jafarkhani, H. Space-Time Coding: Theory and Practice; Cambridge University Press: Cambridge, UK, 2005;
pp. 229–230, ISBN 0-521-84291-3.
28. Recommendation ITR-R M.1225. Guidelines for Evaluation of Radio Transmission Technologies for IMT-2000;
Int. Telecommun. Union (ITU): Geneva, Switzerland, 1997.
29. Chronopoulos, S.K.; Christofilakis, V.; Tatsis, G.; Kostarakis, P. Performance of turbo coded OFDM under the
presence of various noise types. Wirel. Pers. Commun. 2016, 87, 1319–1336. [CrossRef]
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
50
electronics
Article
Efficient Pilot Decontamination Schemes in 5G
Massive MIMO Systems
Omar A. Saraereh 1 , Imran Khan 2 , Byung Moo Lee 3, * and Ashraf Tahat 1
1 Communication Engineering Department, King Abdullah II School of Engineering, Princess Sumaya
University for Technology PSUT, Amman 11941, Jordan; [email protected] (O.A.S.);
[email protected] (A.T.)
2 Department of Electrical Engineering, University of Engineering & Technology, Peshawar 814, Pakistan;
[email protected]
3 School of Intelligent Mechatronics Engineering, Sejong University, Seoul 05006, Korea
* Correspondence: [email protected]
Keywords: Massive MIMO; pilot decontamination; MSE; dynamic user scheduling; dynamic
pilot allocation
1. Introduction
With the advent of the era of big data and increasing demand by the explosion of growing
numbers of subscribers, the demand for communication networks has exploded, and the existing
mobile communication networks (4G) are increasingly unable to meet the needs of users for the
network [1]. Massive MIMO is a key technology for 5G wireless communications to increase the
spectral efficiency [2–4]. It has the ability to be deployed in various communications paradigms such as
multicarrier communication, Orthogonal Frequency Division Multiplexing (OFDM) and cooperative
52
Electronics 2019, 8, 55
the downlink reachability and rate of the entire system. Pilot contamination is a problem inherent in a
massive MIMO system. The method of solving pilot contamination in different situations is not unique.
Ideally, massive MIMO should use Fully Orthogonal Pilot Scheduling (FOPS) to assign orthogonal
pilots to each user, but the length of the pilot sequence and the pilot set size are limited by the
channel coherence time. In a typical scenario, the maximum number of orthogonal pilot sequences
in a 1ms coherence time is about 200 [23]. Therefore, a massive MIMO system generally adopts
Fully Reused Pilot Scheduling (FRPS). Since the pilot between users is non-orthogonal or identical,
pilot contamination is unavoidable [16]. When the number of BS antennas tends to be infinite and
there is no cooperation, the main factor affecting the system performance is inter-cell interference (ICI)
caused by pilot contamination [23], so pilot contamination is critical to the performance improvement
of massive MIMO systems.
At present, massive MIMO pilot decontamination is mainly carried out from three aspects:
channel estimation method [24–28], matrix precoding method case [29,30], and pilot allocation
strategy [21]. The authors in [25] utilize a diagonal Jacket matrix for pilot reduction which has
low complexity, excellent eigenvector and a constant diagonal treatment and an energy harvest.
The drawback of such a method is that it assumes perfect multipath fast channel estimation. In [26],
the authors propose a pilot mitigation algorithm using a low-rate coordination between cells during
the channel estimation phase itself. The coordination makes use of the additional second-order
statistical information about the user channels, which are shown to offer a power way of discriminating
across interfering user with even strongly correlated pilot sequences. The authors in [27] propose
a highly efficient Discrete Fourier Transform (DFT) based approximation of the Linear Minimum
Mean Square Error (LMMSE) estimator for reducing the pilot contamination problem. The authors
in [28] propose an eigenvalue-decomposition-based approach to channel estimation, that estimates
the channel blindly from the received data. The approach exploits the asymptotic orthogonality
of the channel vectors in massive MIMO systems. It also deploys a short training sequence to
resolve the multiplicative ambiguity of the received signals covariance. In [29], the authors propose
a zero forcing (ZF) time-shifted pilot scheme, which was known to mitigate the pilot contamination
effectively using conjugate beamforming. The authors in [30] proposed interreference-cancellation (IC)
precoding scheme for pilot contamination mitigation in massive MIMO systems. They investigated the
quality-of-service (QoS) guaranteed user-scheduling which is improved by deploying their proposed
scheme. The authors in [31] pilot contamination reduction scheme which is dependent on complex
exponential basis expansions. The Linear Time-Varying (LTV) channel is estimated and the optimal
pilot symbols are derived following the minimum mean square error (MMSE) criterion and it is shown
that the optimal pilot strategy is to group consecutive pilot tones together as a pilot cluster and to
distribute uniformly all pilot clusters in frequency-domain. Depending on the pilot allocation strategy,
the research found that: In [30], under the principle of maximizing Signal Leakage Noise Ratio (SLNR)
precoding, the same guidance is adopted for users with small ICI. Users with high frequency and
mutual interference user orthogonal pilots to improve the overall performance of the system in the
case of pilot contamination. However, the pilot scheduling scheme requires large-scale cooperation
between BSs and needs to know the large-scale fading factor of each user. Such a factor is very
difficult in a massive MIMO system with its own complex structure. The authors in [32] proposed
a pilot allocation strategy depending on power control, which makes the pilot transmit time slots
between cells with relatively large crossover gains staggered, but it is not easy to ensure pilot dynamic
synchronization of several cells. The choice of mechanism will directly affect the performance of
the strategy. The authors in [33] proposed a pilot coordinated allocation scheme, which allocates
pilot sequences by identifying pilot usage conditions and selects user multiplexed pilots that are least
affected by pilot contaminations, thereby reducing pilot contamination, but inter-cell cooperation to
system brings additional burdens and expenses. The authors in [34] proposed an improved strategy
depending on soft pilot multiplexing. On the basis of soft pilot multiplexing technology, packet
parameters are introduced for secondary grouping, but the path loss factor, shadow fading, and the size
53
Electronics 2019, 8, 55
of each user need to be known. The parameters such as the scale fading factor are more complicated,
and the computational complexity of implementing the secondary grouping is higher.
3. System Model
Figure 1. Massive MIMO multi-cell multi-user TDD system model.
54
Electronics 2019, 8, 55
At the beginning of each coherence time, all users in all cells simultaneously transmit pilot sequences.
At the beginning of each coherence time, all users in all cells simultaneously transmit pilot sequences.
Suppose ψi = (ψ1i , ψ2i , . . . , ψKi ) T is the K × τ dimension pilot sequence matrix of all users in ith
cell (τ is the sequence length), which satisfies ψi ψiH = IK , where IK is the unit matrix of order K × K.
Under the FRPS policy, the pilot matrix received by the uplink cell BS is:
L
√
∑ Hij ψi
p p
yi = pp + ni (3)
j =1
p
where, p p is the pilot signal transmission power and ni is the additive white Gaussian noise matrix
of order M × τ. After receiving the pilot signal, the BS starts uplink channel estimation. The channel
estimation value Ĥii of the target cell is obtained by using the LS channel estimation introduced earlier
which is expressed as:
1 p 1 p
Ĥii = √ yi ψiH = Hii + ∑ Hij + √ ni ψiH (4)
pp j =i
pp
It can be seen from Equation (4) that the ith cell channel estimation value Ĥii includes the
superposition of the channel propagation matrix of other cells to the target cell in addition to the
influence of the target channel and noise, which is pilot contamination.
After the user sends the pilot, all users send data signals to the BS and use the same time-frequency
resource. The signal received by the ith cell BS is defined as:
√ L K
yiu = pu ∑ ∑ hijk xujk + niu (5)
j =1 k =1
where x ujk is the transmitted data symbol of user k in jth cell; pu is the uplink user data symbol average
transmit power; hijk represents the channel transmission vector of user k to ith cellBS in jth cell, which
is the kth column of Hij ; niu is the additive white Gaussian noise vector of order M × 1. The BS uses the
channel estimation value Ĥii of the Equation (4) and the received signal vector yiu , and the MF detects
the original data symbol x̂ ujk transmitted by the user k in the ith cell is expressed as:
H
L
√ L K
x̂ ujk = ĥiik yiu = ∑ hijk
H
+ vik
H
pu ∑ ∑ hijk xujk + niu (6)
j =1 j =1 k =1
p √
where vik is the column vector of the matrix ni ψiH / p p . When the number of BS antennas M
approaches positive infinity, it is easy to know from Equation (2) that the channel of the massive MIMO
system exhibits progressive orthogonality, and the progressive expression of Equation (6) is:
√
u
x̂ik ≈ M pu β iik x ujk + ∑ β ijk x ujk (7)
j =i
It can be known from Equation (7) that when the BS antenna M → ∞ , the data symbol
u will not be affected by the small-scale fading factor and noise. Therefore, when M → ∞ ,
x̂ik
the signal-to-interference and noise ratio (SINR) of the user k uplink received signal in the ith cell can
be defined as:
β2iik
u
SI NRik = (8)
∑ j =i β2ijk
It can be seen from Equation (8) that due to the existence of pilot contamination, the uplink SINR
is limited by the large-scale fading factor of the same pilot user in the interfering cell. According to
Equation (8), the uplink achievable rate of user k in the target ith cell is given by:
55
Electronics 2019, 8, 55
u
Cik = (1 − μ0 ) E{lb(1 + SI NRik
u
)} (9)
where μ0 is the pilot overhead coefficient for full multiplexing, indicating the spectral efficiency loss
caused by the pilot sequence used for channel estimation, when other pilot allocation algorithms are
used, the adjustment in Equation (9) is to be appropriately made.
Let the central user set be Uc , the edge user set be Ue , the central user pilot set be ψc , and the edge
T
user pilot set be ψe , the pilot set of jth cell is ψj = ψ1j , ψ2j , . . . , ψKj , j = 1, 2, . . . L, then:
1 → ψki ⊆ ψc and ψkj ⊆ ψc
H
ψki ψkj = (11)
0 → otherwise
Then, the pilot vector received by the target ith cell BS is:
L K L
√
∑ ∑ ∑
p p
yi = pp hijk ψkj + hijk ψkj + ni (12)
j =1 k =1, k ∈Uc k =1, k ∈Ue
For the target ith cell user k, the Least Square (LS) channel estimation is used to obtain the target
channel estimation value:
1 p
ĥiik = √ yi ψkiH (13)
pp
Combining Equations (12) and (13), when the number of antennas M → ∞ the SINR of the central
user under the IPLUG algorithm is:
2
H
hiik hiik
uc
SI NRik = 2 (14)
H
∑ j =i, ψki ⊆ψc , ψkj ⊆ψc h jik h jik
The edge user is not affected by the pilot contamination, so when M → ∞ , the edger user
ue → ∞ .
SI NRik
Figure 3 illustrates the flowchart for the IPLUG algorithm.
56
Electronics 2019, 8, 55
Start
Initialize parameters
d ≥ λR
No
Yes
Add the user to Edger user Add the user to Center user
group group
End
57
Electronics 2019, 8, 55
3.4.1. Uplink
Assume that all users in any cell send a pilot sequence of length τ, if the pilot transmitted by K
users in the jth cell is ψj = φj1 φj2 . . . φjK , where φjk ∈ Cτ ×1 is the pilot transmitted by the kth user in
H φ = 1), when the average transmit power of the user is ρ , the pilot signal received by the
the cell (φjk jk r
lth cell BS is:
√ L
Yl = ρr τ ∑ Gjl ψjT + nl (15)
j =1
where Gjl = Hjl D jl is the channel between all users of the jth cell and the lth cell BS. Let β jlk
denote the large-scale fading coefficient of the user k of the jth cell to the lth cell BS, then D jl can be
expressed as a diagonal matrix, and the diagonal element is β jl = β jl1 , β jl2 , . . . , β jlK , nl denotes
the additive white Gaussian noise of the lth cell. The term Hjl can be determined from the following
matrix expression:
⎡ ⎤
h j1 l1 · · · h j1 l M
⎢ ⎥
Hjl = ⎣ ... ···
..
. ⎦ (16)
h jK l1 · · · h jK l M
The elements of Hjl are independently and identically distributed (i.i.d), and h jk lm represents a small-
scale fading coefficient between the kth user in the jth cell and the mth antenna of the BS in the lth cell.
The BS estimates the channel using the received pilot signal, which is known from the standard
results in the estimation theory [24]. When the number of BS antennas tends to infinity, the MMSE
estimate of the channel is:
−1
√ L
Ĝjl = ρr τYl I + ρr τ ∑ ψi∗ Dil ψiT ψ∗j D jl (17)
i =1
3.4.2. Downlink
Considering the BS of the lth cell, it is assumed that the information symbol transmitted by the lth
cell to the user is Sl = [Sl1 Sl2 . . . SlK ] T , and Al = f Ĝll is a linear precoding matrix of order M × K,
where f (·) represents a specific linear precoding technique at the BS. After precoding, the signal
matrix sent by the BS can be represented as Al Sl , and the BS satisfies the average power constraint,
+ ,
that is, tr E Al Sl SlH AlH ≤ P, then the data information received by the user of the jth cell can be
expressed as:
L
-
X j = ρ f ∑ GjlT Al Sl + n j (18)
l =1
where ρ f is the downlink transmit power and n j is the AWGN noise of the corresponding cell.
Assuming a simple MF precoding technique at the BS, i.e., Al = Ĝll∗ , Equation (25) can be expressed as:
⎛ ⎞∗
- L
- L L
Xj = ρf ∑ GjlT Ĝll∗ Sl + nj = ρ f ρr τ ∑ GjlT ⎝ρr τ ∑ Gj l ψjT + n l ⎠ Z ∗ Sl + n j (19)
l =1 l =1 j =1
−1
where Z = I + ρr τ ∑iL=1 ψi∗ Dil ψiT ψ∗j D jl . When the number of BS antennas increases and tends to
infinity then:
−1
1 √ L L
√ X j → ρr τ ∑ D jl ψlH ψj + ρr τ ∑ ψlH ψi Dil∗ ψiH ψj Dll∗ Sl (20)
M ρ f ρr τ l =1 i =1
58
Electronics 2019, 8, 55
It can be obtained from Equation (20) that when all cells multiplex the same set of orthogonal
pilots, the right side of Equation (20) can be equated to:
L L −1
At this time, the signal received by the jth cell will be interfered by the signals transmitted by
other cells, especially when the large-scale fading coefficient is large; this phenomenon is called pilot
contamination. When all cells use orthogonal pilots, the right side of Equation (20) can be equated to:
−1
D jj I + ρr τD ∗jj D ∗jj Sl (22)
In this case, the signal received by the user is a scaling of the signal transmitted by the BS of the cell,
and there is no pilot contamination. However, in actual communication, because the coherence interval
is limited, it cannot guarantee that all cells can adopt an orthogonal pilot. Therefore, pilot contamination
has become an important factor limiting the performance of massive MIMO systems.
where xi indicates the ith shift register; c0 = c1 = 1, for any i = 0 and i = n, ci ∈ {0, 1}.
Figure 4. Pseudo-random code linear feedback shifter register.
Since the pilot contamination is generated because different cells multiplex the same set of
orthogonal pilots and the pseudo-random sequences have good cross-correlation characteristics,
this paper uses this feature to improve the orthogonality of pilots between different cells. Depending
on the above considerations, this paper proposes a pilot design scheme depending on pseudo-random
code: in the case that all cells multiplex the same set of orthogonal pilots, the pseudo-random code is
used as the code sequence for distinguishing different cells, for each cell allocates a pseudo-random
code with different delays, and synchronously scrambles the pseudo-random code in each cell to the
user pilot of the corresponding cell to obtain a new pilot.
59
Electronics 2019, 8, 55
The pilot design scheme depending on the pseudo-random code is shown in Figure 5. Since the
flow of the pilot design of different cells is the same, in order to facilitate the representation without
loss of generality, the pilot design scheme is specifically described here. Assume that the pilot of the
kth user in a cell is φk = [φk1 , φk2 , . . . , φkτ ] T , and the pseudo-random sequence generated by the linear
feedback shifter register in the corresponding cell is m = [m1 , m2 , . . . , m N ]. The scrambler in Figure 5
indicates that the pseudo-random matrix is multiplied by the matrix of the corresponding user’s pilot
matrix. In general, a pseudo-random sequence is a 0–1 bit stream whose length is much larger than the
length of the pilot sequence.
Figure 5. Pseudo-random code pilot design scheme.
In order to ensure that the generated pseudo-random sequence can be used for scrambling,
it needs to process as follows:
1. The pseudo-random sequence generated by the linear feedback shift register is truncated, and the
method is to generate a rectangle depending on the pilot length τ and using a rectangular window
for pseudo-random sequences to make truncation.
2. Performing BPSK modulation on the truncated pseudo-random sequence.
3. The pseudo-random sequence is diagonalized after modulation, that is, τ pseudo-random
numbers are generated as pseudo-random matrices on the diagonals of the diagonal matrix.
The pseudo-random matrix used for scrambling after the above processing can be expressed as:
⎡ ⎤
m1 0 ··· 0
⎢ ⎥
⎢ 0 m2 ··· 0 ⎥
P=⎢
⎢ .. .. .. .. ⎥
⎥ (24)
⎣ . . . . ⎦
0 0 ··· mτ
After the pseudo-random matrix is scrambled by the scrambler to the user pilot, the pilot of the
kth user outputted by the scrambler is obtained as:
60
Electronics 2019, 8, 55
The scrambling method used in this paper is to use the same pseudo-random sequence to
synchronously scramble the pilots of all users in the cell, and the pilot of K users in the cell can be
obtained after the pilot design as:
⎡ ⎤
m1 φ11 ··· m1 φK1
⎢ .. .. ⎥
Ψ=⎣ . ··· . ⎦ (26)
mτ φ1τ ··· mτ φKτ
The designed pilot is transmitted to the BS through the channel, and the BS can obtain the required
Channel State Information (CSI) by using channel estimation.
When the pilot design is performed on users of other cells, considering the pilot contamination
is related to the distance between users, when the distance between two users is large enough,
the influence of pilot contamination is neglected. Therefore, the proposed pseudo-random code pilot
scheduling scheme only considers the pilot contamination of the target cell and ignores the pilot
contamination of the farther cell. Under the above assumptions, only the pseudo-random sequence
needs to be allocated to the target cell and the neighboring cell, and the pseudo-random sequence
can be reused for the farther cell, which greatly reduces the usage and implementation complexity
of the pseudo-random sequence and makes it easy to design a better pilot sequence. Depending on
the above considerations, the pilot design of all cells can be designed by using the proposed pilot
design scheme. The only difference is that the pseudo-random sequence used by the target cell and the
neighboring cell cannot be the same, which involves a pseudo-random sequence selection problem. Since
the cross-correlation between different pseudo-random sequences is different, if the cross-correlation
value between the selected pseudo-random sequences is large, even if the user pilot is designed, the pilot
contamination between the cells is still serious, so that the pilot design seems to be meaningless, so it is
necessary to follow the certain criteria when selecting the pseudo-random sequence as the code sequence
to distinguish difference cells. Theoretically, the cross-correlation values between pseudo-random
sequences are connected. Nearly 0, the effect of the pilot design is more obvious, but the complexity
of acquiring these pseudo-random sequences will become very high. In order to obtain the desired
effect within a certain complexity range, this paper adopts the following criteria to obtain the required
pseudo-random sequence. Assuming that the number of cells requiring different pseudo-random
sequences is M, the corresponding M pseudo-random sequences need to be satisfied:
ρ mi , m j < γi , j = 1, 2, . . . , M (27)
i=j
where ρ mi , m j = n1 ∑nk=1 mik m jk , is a function that represents a normalized cross-correlation value
between two pseudo-random sequences. γ ∈ [0, 1] is a constant values indicating the upper limit of
the normalized cross-correlation value between any two selected pseudo-random sequences.
It can be seen from Equation (27) that in the case where the pseudo-random sequence used is
determined, the setting of the value is an important factor affecting the impact of the pilot design
scheme. The smaller the value, the smaller the cross-correlation value between pseudo-random
sequences. After the pilot design, the orthogonality of user pilots between different cells is stronger,
and the effect of reducing pilot contamination is more obvious. Otherwise, it cannot effectively reduce
the pilot contamination. The BS and the user in each cell share a pseudo-random sequence, and the BS
can distinguish pilots from different cells by using detection techniques. At the same time, since each
cell has the same pseudo-random sequence to scramble a set of orthogonal pilots of all users in the
cell, after pilot design, the user pilots in each cell remain orthogonal, and in addition, no additional
intra-cell interference will be introduced. Moreover, increasing the pilot length is beneficial to improve
the efficiency of the pilot design. This is because the length of the pseudo-random sequence is the same
61
Electronics 2019, 8, 55
as the length of the pilot, and the longer the pseudo-random sequence, the better the orthogonality
between them, so increasing the pilot length is equivalent to improving the orthogonality between the
pseudo-random sequences. Thereby, the orthogonality of user pilots between different cells after the
pilot design is improved.
3.4.4. Mean Square Error (MSE) Performance Analysis of Expected Channel Estimation
This section will use the new pilot to derive and analyze the MSE of the channel estimate and
explore the pilot design method described in Section 3.3. It can be seen from [15] that when the
pilot contamination is relatively serious, the channel estimation performance of the system drops
sharply. In this case, even if a complex multi-cell MMSE precoding scheme is applied, the performance
improvement of the massive MIMO system is limited. In order to facilitate derivation and analysis,
this section only studies simple single-cell ZF precoding and its precoding matrix is defined as:
−1
Ĝll Ĝ H Ĝll
Al = . ll (28)
−1
tr ĜllH Ĝll
It can be found from (28) that a single-cell ZF precoding matrix can be designed as long as the
information of the desired channel is obtained. Therefore, the subsequent derivation and analysis in
this section are depending on the estimated expected channel.
As can be seen from Section 3.4, the user pilot of the jth cell is Ψj = Pj Ψ, then Equation (15) can be
expressed as:
√ L
√ L
Yl = ρr τ ∑ Gjl ΨjT + nl = ρr τ ∑ Gjl ψ T PjT + nl (29)
j =1 j =1
In the case where the number of BS antennas is limited, the MMSE estimation of the desired
channel is:
−1
√ L
Ĝll = ρr τ Yl Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT Pi∗ ψ∗ Cil (30)
i =1
+ ,
where Cn = E , is the autocorrelation matrix, representing the received noise nl ; Cjl =
nlH nl
! "
E GjlH Gjl is the autocorrelation matrix representing the channel transfer matrix Gjl .
The MMSE defining the expected channel estimation of the lth cell BS is:
! 2 "
Mmse E Ĝll − Gll F (31)
√ −1
Let R = ρr τ Cn + ρr τ ∑iL=1 Pi∗ ψ∗ Cil ψ T PiT Pi∗ ψ∗ Cil , then Ĝll = Yl R, substituting into (38) we
get: ! ! ""
Mmse = E tr (Yl R − Gll ) H (Yl R − Gll )
+ + , + , + , + ,, (32)
= tr R H E YlH Yl R − R H E YlH Gll − E GllH Yl R + E GllH Gll
Using the model shown in Equation (29), there is:
! " L
E YlH Yl = Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT (33)
i =1
! " √
E YlH Gll = ρr τPi∗ ψ∗ Cll (34)
! " √
E GllH Yl = ρr τCll ψ T PlT (35)
62
Electronics 2019, 8, 55
Substituting Equations (32)–(34) into Equation (35), we can rewrite Mmse as:
L √ √
Mmse = tr R H Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT R − ρr τR H Pi∗ ψ∗ Cll − ρr τCll ψ T PlT R + Cll
+ i =1 ,
√
= tr Cll − ρr τCll ψ T PlT R #
−1
L
= tr Cll − ρr τ Cll ψ T PlT Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT Pl∗ ψ∗ Cll
i =1
(36)
Since the matrix Pi is a diagonal matrix composed of diagonal elements 1 and −1, and ψ H ψ = I,
so ( Pi ψ) H ( Pi ψ) = I can be obtained according to the properties of the matrix conjugate transpose.
Using the matrix inversion principle, we can express Equation (36) as:
⎧ ⎫
⎨ L −1 ⎬
M mse
= tr Cll − ρr τ Cll ΨlT Cn Ψl∗ + ρr τ ∑ ΨlT Ψi∗ Cil ΨiT Ψi∗ Cll (37)
⎩ i =1
⎭
When the number of users in each cell is K = 1, ΨlT Ψi∗ = ( Pl ψ) T ( Pi ψ)∗ = γli , where γli
is a normalized cross-correlation value between the pseudo-random code in the lth cell and the
pseudo-random code in the ith cell, which can be obtained from equation (27). Assuming that the
large-scale fading coefficient Dil = [ β il1 ] at this time, then Equation (40) can be simplified as:
1 mse ρr τβ2ll1
M = β ll1 − (41)
M 1 + ρr τβ ll1 + ρr τ ∑i =l γli2 β li1
It can be seen from the results of Equation (41) that when the large-scale fading coefficient and the
uplink transmit power are constant and the pilot length is constant, the MSE performance of the channel
estimation that can be obtained by applying the pilot scheme depending on the pseudo-random code
and it is mainly limited by the cross-correlation performance between the pseudo-random sequences
used: when the normalized cross-correlation value between the pseudo-random sequences used is
large (|γli | is close to 1), the MSE performance of the channel estimation is relatively poor. When
the normalized cross-correlation value between the pseudo-random sequences used is small (|γli |
is close to 0), the MSE performance of the channel estimation is better. It can be concluded that if a
suitable pseudo-random sequence is selected, the above method is adopted. The proposed pilot design
scheme can effectively improve the performance of channel estimation, thus achieving the purpose
of reducing pilot contamination. Figure 6 shows the flowchart of the proposed pseudo-random pilot
code algorithm.
63
Electronics 2019, 8, 55
Start
Initialize parameters
Yes
End
4. Simulation Results
64
Electronics 2019, 8, 55
According to the above simulation parameters, the Monte Carlo method is used to simulate
the uplink SINR, and the normalized channel estimation MSE and the target cell achievable rate,
thereby comparing the performance between the PLUG algorithm, the IPLUG, and the FRPS
algorithm respectively.
65
Electronics 2019, 8, 55
It can be seen from Figures 8 and 9 that the channel estimation normalized MSE of the edge user
and the central user under both algorithms has a decreasing trend with the increase in the number
of antennas. However, the MSE and SINR curves of the central users in both algorithms overlap,
indicating that the center user multiplexes the pilots and causes the same pilot contamination. The edge
user allocates orthogonal pilots under the PLUG algorithm, so there is no pilot contamination. When
the number of antennas is 256, the NMSE performance of the edge users is increased by 10.88 dB,
and the SINR performance is improved by 3.23 dB. Figure 10 is a graph showing the average achievable
sum-rate performance of the center user and the edge user as the number of antennas increases when
66
Electronics 2019, 8, 55
the number of edge users is one. Figure 12 shows the performance variation of the uplink achievable
rate of the target cell of the central user and the edge user when the number of edge users is 1. It can
be seen from Figure 10 that the average achievable rate of the central user in the PLUG algorithm is
slightly lost. This is because the orthogonal pilot set is added under such algorithm, and the pilot
overhead is increased, resulting in a decrease in spectral efficiency (SE), but the performance of the
edge user is very good. With a large boost, the average achievable is increased by 0.42 bps/Hs when
the number of antennas is 256. It can be seen from Figure 12 that the uplink achievable rate curves
of the target cell are basically coincident under the two algorithms, which indicates that the PLUG
algorithm does not bring loss to the overall performance of the target cell, but only increases the
fairness of the central user and the edge use. It is also obvious from the results that the proposed PLUG
algorithm reduces the probability of edge user communication interruption. Figure 13 compares the
uplink target cell achievable rate in the case where the number of edge users of the FRPS and PLUG
algorithms is different and for the number of antennas of 256. As can be seen from Figure 13, when
the number of edge users increases, the achievable rate of the PLUG algorithm gradually decreases.
This is because when the number of edge users is greater than one, the cost of pilot overhead exceeds
the performance gains of the PLUG algorithm.
Figure 10. Comparison achievable rate of target cell of FRPS and PLUG algorithms.
67
Electronics 2019, 8, 55
Figure 14 compares the uplink target cell achievable rate for FRPS and IPLUG with different
decision parameters when the number of antennas is 256. As can be seen from Figure 14, when the
decision parameter increases to λ ≥ 0.68, the achievable rate of the IPLUG algorithm gradually
increases. Therefore, the achievable rate of the proposed IPLUG algorithm have significantly exceeded
the FRPS algorithm, which is a clear advantage of the IPLUG algorithm, avoiding the waste of pilot
overhead caused by some users with good channel conditions being misclassified as edge users,
or some users with poor channel conditions are being misclassified as the central user which results in
communication interruption.
68
Electronics 2019, 8, 55
Figure 12. Achievable rate comparison of FRPS and PLUG algorithms for user fairness analysis.
Figure 13. Comparison of achievable sum rate against a different number of edge users for FRPS and
PLUG algorithms.
69
Electronics 2019, 8, 55
Figure 14. Comparison of achievable sum rate against a different number of user grouping factor λ for
IPLUG and FRPS algorithms.
Figure 15. PDF comparison of the proposed pseudo-random code scheme with other cases.
Figure 16 depicts the trend of the system’s downlink transmission BER as the number of antennas
increases. The number of users per cell is K = 8, the number of BS antennas M = 100 and pilot length
τ = 32. It can be seen from Figure 16, as the number of BS antennas increases, the performance of the
system is improved. In the case of full pilot contamination, the pilot performance is too serious, and the
improvement of system performance is not obvious with the increase of the number of antennas.
Therefore, it can be foreseen that when the number of antennas reaches a certain value, the system
performance will not be improved.
70
Electronics 2019, 8, 55
Figure 16. Comparison of the BER of the proposed pseudo-random pilot design scheme with other
schemes against the number of BS antennas.
When the proposed pilot design scheme or no contamination is used, the performance of the
system is significantly improved with the increase of the number of antennas. Figure 17 depicts
the trend of the downlink BER of the system as the average transmit power ρ f of the BS increases.
The results are analyzed for different pilot lengths. The number of users per cell K = 8, the number
of antennas configured by the BS M = 100. As can be seen from Figure 17 that when the number
of BS antennas is large, the increase of the BS transmit power is effective for improving the system
performance, but when it exceeds a certain value, the system performance will not be improved,
and this value will be affected by the severity of pilot contamination. It can be seen from Figure 17 that
in the case of complete pilot contamination, the system performance will not be improved after the BS
transmission power reaches 25 dB. In the case of the proposed pilot design scheme (pseudo-random
code) or no contamination, this value is greater than 25 dB. At the same time, it can be found that in the
case of complete pilot contamination, the system performance will not be improved with the increase
of the pilot length τ; in the absence of pilot contamination, the increase of the pilot length can improve
the system performance to a small extent, but there is no need to sacrifice pilot overhead to boost such
tiny performance. After using the proposed pseudo-random pilot design scheme, the performance
of the system will increase significantly with the increase of the pilot length. Furthermore, when the
pilot length τ = 64, the performance achieved by the proposed pilot design scheme is very close
to the case of no pilot contamination. Figure 18 compares the BER of the proposed pseudo-random
pilot design scheme with other cases versus the number of cell users K. The number of antennas
configured by the BS is M = 100, and pilot length τ = 128. It can be seen from Figure 18 that in the
case where the number of BS antennas is large, the increase in the number of cell users may deteriorate
the performance of the system, but the performance of the system is still far better when the proposed
pseudo-random pilot design or no pilot contamination scheme is used. At the same time, when the
number of users is small and the pilot length τ = 128, the same performance of the system can be
almost achieved in the case of no contamination when the proposed pilot design scheme is adopted.
This is because the increase in the length of the pseudo-random code can only make them infinitely
close to the orthogonal but cannot completely remove the pilot contamination. In the case of a small
number of users, the pilot contamination is small enough to affect the performance of the entire system.
71
Electronics 2019, 8, 55
In the case of a large number of users, the impact on the overall system performance is considerable
due to the superposition of pilot contamination. Overall, the proposed pilot design scheme is very
obvious for the improvement of system performance.
Figure 17. Comparison of the BER of the proposed pseudo-random code and another scheme versus
the average BS power ρ f .
Figure 18. BER comparison against the number of cell users for the proposed pseudo-random pilot
design and other cases.
Figure 19 compares the NMSE of the proposed IPLUG algorithm with the conventional
state-of-the-art algorithms [29,30,32] wherein the IPLUG algorithm corresponds to a decision parameter
72
Electronics 2019, 8, 55
0.9. As can be seen from the figure that the proposed IPLUG algorithm shows better NMSE performance
than the conventional algorithms for increasing number of base station antennas. Figure 20 compares.
Figure 20. Comparison of the BER of the proposed pseudo-random pilot design scheme with other
schemes against the number of BS antennas.
73
Electronics 2019, 8, 55
Figure 20 illustrates the BER comparison of the proposed pseudo-random code scheme and
the conventional schemes as the number of antennas increases. The number of users per cell is
K = 8, the number of BS antennas M = 160 and pilot length τ = 32. It can be seen from Figure 20,
as the number of BS antennas increases, the performance of the system is improved. The proposed
pseudo-random code scheme shows better BER performance as compared with the conventional
algorithms with increasing number of antennas.
5. Conclusions
This paper proposes a robust approach for effective pilot decontamination in massive MIMO
systems. Two efficient pilot decontamination schemes are proposed. The first scheme is depending on
Path Loss to perform User Grouping (PLUG) method while the second scheme is depending on the
pseudo-random code. The PLUG scheme divides users into central and edge users. Edger users are
allocated orthogonal pilots, and central users are assigned multiplex pilots, which improves edge user
performance. The Improved PLUG scheme (IPLUG) is further proposed to overcome the deficiency of
the PLUG scheme as it dynamically selects and correctly classifies the edge users and central users so
that there is no wrong misclassification and therefore, the communication quality of service is improved.
The analytical and simulation results show that the proposed IPLUG scheme can avoid the waste of
pilot overhead caused by users with good channel conditions being misclassified as edge users, or the
users with poor channel conditions being misclassified as central users, resulting in communication
interruption, therefore, the proposed IPLUG scheme increases the fairness of communication for each
user. The MSE of the expected channel estimation after the proposed pseudo-random code pilot design
scheme is deduced and analyzed. It is found that this scheme can not only effectively improve the
performance of channel estimation, but a more accurate channel estimation can be obtained by selecting
an appropriate pseudo-random code and pilot length. Thereby improving the performance of the
entire downlink system. The above conclusions are verified by numerical simulation. The numerical
results also show that the proposed pilot design scheme depending on pseudo-random code can greatly
improve the performance of the entire system limited by pilot contamination. The proposed PLUG
and IPLUG schemes are focused only on the cells of the omnidirectional antenna, and other sector
cells are not considered. At the same time, the pilot allocation is performed in the target cell, and the
pilot allocation between the cells is independent of each other and has certain limitations. Therefore,
the future research direction will be PLUG and IPLUG schemes depending on pilot decontamination
in sectoral cells. The problem is to consider the mutual connection of each cell to reasonably allocate
pilots throughout the system.
Author Contributions: Conceptualization, O.A.S., I.K. and B.M.L.; Data curation, O.A.S. and I.K.; Formal analysis,
O.A.S., I.K. and A.T.; Funding acquisition, I.K. and B.M.L.; Investigation, O.A.S., I.K. and A.T.; Methodology, O.A.S.,
I.K., B.M.L. and A.T.; Project administration, O.A.S. and I.K.; Software, O.A.S., I.K. and A.T.; Writing—original
draft, O.A.S., I.K. and B.M.L.; Writing—review & editing, O.A.S., I.K., B.M.L. and A.T.
Funding: This work was supported by the Basic Science Research Program through the National Research
Foundation of Korea (NRF) funded by the Ministry of Education (grant number: NRF-2017R1D1A1B03028350).
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Khan, I. A Robust Signal Detection Scheme for 5G Massive Multiuser MIMO Systems. IEEE Trans.
Veh. Technol. 2018, 67, 9597–9604. [CrossRef]
2. Khan, I.; Singh, D. Efficient Compressive Sensing Based Sparse Channel Estimation for 5G Massive MIMO
Systems. AEU Int. J. Electron. Commun. 2018, 89, 181–190. [CrossRef]
3. Khan, I.; Zafar, M.H.; Jan, M.T.; Lloret, J.; Basheri, M.; Singh, D. Spectral and Energy Efficient Low-Overhead
Uplink and Downlink Channel Estimations for 5G Massive MIMO Systems. Entropy 2018, 20, 92. [CrossRef]
4. Khan, I.; Singh, M.; Singh, D. Compressive Sensing-Based Sparsity Adaptive Channel Estimation for 5G
Massive MIMO Systems. Appl. Sci. 2018, 8, 754. [CrossRef]
74
Electronics 2019, 8, 55
5. Molisch, A.F.; Win, M.Z.; Winters, J.H. Space-time-frequency (STF) coding for MIMO-OFDM systems.
IEEE Commun. Lett. 2002, 6, 370–372. [CrossRef]
6. Bolcskei, H.; Paulraj, A.J. Space-frequency coded broadband OFDM systems. In Proceedings of the IEEE
Wireless Communications and Networking Conference (WCNC), Chicago, IL, USA, 23–28 September 2000;
pp. 1–6.
7. Darsena, D.; Gelli, G.; Paura, L.; Verde, F. Blind Channel Shortening for Space-Time-Frequency Block Coded
MIMO-OFDM Systems. IEEE Trans. Wirel. Commun. 2012, 11, 1022–1033. [CrossRef]
8. Redferm, A.J. Receiver window design for multicarrier communication systems. IEEE J. Sel. Areas Commun.
2002, 20, 1029–1036. [CrossRef]
9. Darsena, D.; Gelli, G.; Paura, L.; Verde, F. NBI-resistant zero-forcing equalizers for OFDM systems.
IEEE Commun. Lett. 2005, 9, 744–746. [CrossRef]
10. Mergen, B.S.; Scaglione, A. Randomized Space-Time Coding for Distributed Cooperative Communication.
IEEE Trans. Signal Process. 2007, 55, 5003–5017. [CrossRef]
11. Knopp, R.; Humblet, P.A. Information capacity and power control in single-cell multiuser communications.
In Proceedings of the IEEE International Conference on Communications (ICC), Seattle, WA, USA, 18–22 June
1995; pp. 331–335.
12. Mastronarde, N.; Verder, F.; Darsena, D.; Scaglione, A.; van der Schaar, M. Transmitting Important Bits and
Sailing High Radio Waves: A Decentralized Cross-Layer Approach to Cooperative Video Transmission.
IEEE J. Sel. Areas Commun. 2012, 30, 1597–1604. [CrossRef]
13. Rusek, F.; Persson, D.; Buon, K.L.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesso, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
14. Biswas, S.; Xue, J.; Khan, F.A.; Ratnarajah, T. Performance Analysis of Correlated Massive MIMO Systems
with Spatially Distributed Users. IEEE Syst. J. 2016, 12, 1850–1861. [CrossRef]
15. Khan, I. Channel modeling and analysis of OWC-massive MIMO Systems. Opt. Commun. 2018, 434, 209–217.
[CrossRef]
16. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
17. Jose, J.; Ashikhmin, A.; Marzetta, T.L.; Vishwanath, S. Pilot contamination and precoding in multi-cell TDD
systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
18. Ashikhmin, A.; Marzetta, T.L. Pilot contamination precoding in multi-cell large-scale antenna systems.
In Proceedings of the IEEE International Symposium on Information Theory Proceedings (ISIT), Cambridge,
MA, USA, 1–6 July 2012; Volume 1, pp. 1137–1141.
19. Liangbin, L.; Ashikhmin, A.; Marzetta, T.L. Pilot contamination precoding for interference reduction in
large-scale antenna systems. In Proceedings of the 51th Annual Allerton Conference on Communication,
Control, and Computing, Monticello, IL, USA, 2–4 October 2013; Volume 2, pp. 226–232.
20. Nguyen, T.M.; Le, L.B. Joint pilot assignment and resource allocation in multicell massive MIMO network:
Throughput and energy efficiency maximization. In Proceedings of the IEEE Wireless Communications and
Networking Conference (WCNC), New Orleans, LA, USA, 9–12 March 2015; pp. 393–398.
21. Cheng, H.V.; Björnson, E.; Larsson, E.G. Optimal Pilot and Payload Power Control in Single-Cell Massive
MIMO Systems. IEEE Trans. Signal Process. 2017, 65, 2363–2378. [CrossRef]
22. Dao, H.T.; Kim, S. Pilot power allocation for maximizing the sum rate in massive MIMO systems.
IET Commun. 2018, 12, 1367–1372. [CrossRef]
23. Elijah, O.; Leow, C.Y.; Rahman, T.A.; Nunoo, S.; Iliya, S.Z. A Comprehensive Survey of Pilot Contamination
in Massive MIMO—5G System. IEEE Commun. Surv. Tutor. 2015, 18, 905–923. [CrossRef]
24. Hoydis, J.; Brink, S.T.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
25. Khan, I.; Zafar, M.H.; Ashraf, M.; Bayati, A.K.S. Computationally Efficient Channel Estimation for 5G
Massive Multiple-Input Multiple-Output Systems. Electronics 2018, 7, 382. [CrossRef]
26. Yin, H.; Gesbert, D.; Filippou, M.; Liu, Y. A coordinated approach to channel estimation in large-scale
multiple-antenna systems. IEEE J. Sel. Areas Commun. 2012, 31, 264–273. [CrossRef]
27. Neumann, D.; Joham, M.; Weiland, L.; Utschick, W. Low-complexity computation of LMMSE channel
estimates in massive MIMO. In Proceedings of the 19th International ITG Workshop on Smart Antennas,
Ilmenau, Germany, 3–5 March 2015; pp. 1–6.
75
Electronics 2019, 8, 55
28. Ngo, H.; Larsson, E.G. EVD-based channel estimation in multicell multiuser MIMO systems with very
large antenna arrays. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal
Processing, Kyoto, Japan, 25–30 March 2012; pp. 3249–3252.
29. Jin, S.; Wang, Z.; Zheng, L.; Wong, K.; Huang, Y.; Tang, X. On massive MIMO zero-forcing transceiver using
time-shifted pilots. IEEE Trans. Veh. Technol. 2016, 65, 59–74. [CrossRef]
30. Xiong, X.; Jiang, B.; Gao, X.; You, X. QoS-Guaranteed User Scheduling and Pilot Assignment for Large-Scale
MIMO-OFDM Systems. IEEE Trans. Veh. Technol. 2015, 65, 6275–6289. [CrossRef]
31. Dai, X. Optimal training design for linearly time-varying MIMO/OFDM channels modeled by a complex
exponential basis expansion. IET Commun. 2007, 1, 945–953. [CrossRef]
32. Saxena, V.; Fodor, G.; Karipidis, E. Mitigating pilot contamination by pilot reuse and power control schemes
for massive MIMO systems. In Proceedings of the IEEE 81st Vehicular Technology Conference, Glasgow, UK,
11–14 May 2015; pp. 1–6.
33. Zhu, X.; Wang, Z.C.; Dai, L.L.; Qian, C. Smart pilot assignment for massive MIMO. IEEE Commun. Lett. 2015,
19, 1–4. [CrossRef]
34. Jin, S.; Li, M.; Huang, Y.; Du, Y.; Gao, X. Pilot scheduling schemes for multi-cell massive
multiple-input-multiple-output transmission. IET Commun. 2015, 9, 689–700. [CrossRef]
35. Litvinenko, L. OFDM signal PAPR reduction by pre-scrambling and clipping for frequency domain
comb-type channel estimation case. In Proceedings of the 23rd International Conference Radioelectronika,
Pardubice, Czech Republic, 16–17 April 2013; pp. 273–277.
36. Yang, B.; Deng, C.J.; Wu, P.H.; Xi, J.; Shi, L. Image encryption algorithm depending on two-one-dimensional
logistic chaotic inter-scrambling systems and m-sequence. In Proceedings of the IEEE International
Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 27–29 June 2014;
pp. 521–524.
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
76
electronics
Article
An Efficient Pilot Assignment Scheme for Addressing
Pilot Contamination in Multicell Massive
MIMO Systems, Shamala Subramaniam and Ali
Mohammed Mansoor
Ahmed S. Al-hubaishi 1, *, Nor Kamariah Noordin 1,2 , Aduwati Sali 1,2 , Shamala Subramaniam 3,4
and Ali Mohammed Mansoor 5
1 Department of Computer and Communication Engineering, Universiti Putra Malaysia, Serdang 43400, Malaysia;
[email protected] (N.K.N.); [email protected] (A.S.)
2 Wireless and Photonics Networks Laboratory, Universiti Putra Malaysia, Serdang 43400, Malaysia
3 Department of Communication Technology and Network, Universiti Putra Malaysia, Serdang 43400, Malaysia;
[email protected]
4 Sports Academy, Universiti Putra Malaysia, Serdang 43400, Malaysia
5 Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia;
[email protected]
* Correspondence: [email protected]; Tel.: +60-967-777-888-129
Abstract: The reuse of the same pilot group across cells to address bandwidth limitations in a network
has resulted in pilot contamination. This causes severe inter-cell interference at the targeted cell. Pilot
contamination is associated with multicell massive multiple-input multiple-output (MIMO) systems
which degrades the system performance even when extra arrays of antennas are added to the network.
In this paper, we propose an efficient pilot assignment (EPA) scheme to address this issue by maximizing
the minimum uplink rate of the target cell’s users. To achieve this, we exploit the large-scale characteristics
of the fading channel to minimize the amount of outgoing inter-cell interference at the target cell. Results
from the simulation show that the EPA scheme outperforms both the conventional and the smart pilot
assignment (SPA) schemes by reducing the effect of inter-cell interference. These results, show that
the EPA scheme has significantly improved the system performance in terms of achievable uplink rate
and cumulative distribution function (CDF) for both signal-to-interference-plus-noise ratio (SINR), and
uplink rate.
Keywords: pilot contamination; massive MIMO; pilot assignment; large-scale fading coefficients
1. Introduction
Equipping the base station (BS) with a large number of antennas (also known as massive
multiple-input multiple-output (MIMO)) has been considered one of the fundamental technologies that
leads to 5G [1]. The introduction of this technology is to meet the increasing demand for mobile data
in 5G [2]. Although the use of massive MIMO systems increases spectral efficiency, enhances energy
efficiency, and reduces the effect of small scale fading [3–7], but invariably promotes pilot contamination.
In massive MIMO, time-division duplex (TDD) protocol is preferred over the frequency-division duplex
(FDD) [8,9], as the former allows channel estimation in one direction (i.e., uplink) and avoids the estimation
77
Electronics 2019, 8, 372
of the other side (i.e., downlink) due to channel reciprocity property. In other words, the use of TDD
based channel reciprocal minimizes the overhead signals used for channel estimation, which largely saves
network bandwidth. Although, the channel estimation ensures high utilization of TDD massive MIMO
via uplink transmission, but its channel coherence blocks are restricted in size (limited size). Therefore, the
orthogonal pilot sequences cannot be allocated for all users among the cells. To overcome this problem,
the orthogonal pilot sequences have to be reused across the cells. Although, pilot reuse approach is a
remarkable way forward in addressing the associated problem, however, the channel estimate obtained in
a given cell will be contaminated by pilots transmitted by users in other cells. Specifically, the inter-cell
interference exacerbates the estimation error and also makes sure the channel estimation of two or more
users sharing the same pilot sequence is correlated at a given cell [10]. Thus, with multicell massive MIMO
systems, its performance deteriorates during uplink and downlink transmission. This issue is referred to
as pilot contamination, and depicted in Figure 1.
Figure 1. The effect of pilot contamination in multicell massive MIMO systems at a cell a, where the solid
line represents the direct gain and the dotted line represents the inter-cell interference.
To address the issue associated with pilot contamination, several research methods have been
proposed to eliminate/relieve pilot contamination. Among these methods, the pilot assignment technique
is identified to be a potential technique for solving this problem. Smart pilot assignment (SPA) method
proposed by [11], focused on adjusting the combination between the users and pilot sequences, but did
not consider inter-cell interference which causes the pilot contamination. In this paper, we propose an
efficient pilot assignment mechanism to improve the performance of users with respect to intense pilot
contamination in multicell massive MIMO systems. We summarize our contributions below:
• We formulate the pilot assignment as an optimization problem and develop a heuristic algorithm, in
order to maximize the minimum throughput considering the reduction in the inter-cell interference
pilot contamination.
• We evaluate the performance of the proposed mechanism in terms of signal-to-interference-plus-noise
ratio (SINR), and uplink rate with an extensive MATLAB simulation.
• We compare our work with SPA and other conventional schemes.
The rest of this paper is organized as follows. The related work is summarized in Section 2, the system
model is described in Section 3, the pilot contamination phenomenon and the achievable uplink rate are
illustrated in Section 4, the EPA scheme is explained in Section 5, the simulation results are depicted in
Section 6, and finally, this paper is concluded in Section 7.
Notation: Throughout this paper, the bold lower case letters represent vectors and matrices are
represented by bold upper case letters. I M denotes the identity matrix of dimensions M × M. The operators
78
Electronics 2019, 8, 372
(.)−1 , (.) T , and (.) H are defined for inverse, transpose and conjugate transpose operations, respectively.
The expectation operator is represented by E . .
2. Related Work
Different traditional algorithms based on pilot assignment have been proposed for pilot
decontamination [12,13]. A vertex graph-coloring-based pilot assignment has been proposed in [12], the
pilot sequences are allocated to the users according to the inter-cell interference (ICI) graph. The evaluation
of ICI graph depends on both angle of arrival (AoA) correlation and distances between users. However,
this scheme requires a second order channel information to construct ICI graph. A deep learning-based
pilot allocation scheme (DL-PAS) is proposed in [13] to address the pilot contamination problem in massive
MIMO systems. This algorithm aims at learning from the relationship between pilot assignment and users’
location. However, the DL algorithm requires high data and subsequently takes a longer time to process
the data.
The authors in [14,15] developed the location-based pilot assignment approaches for pilot
decontamination. A new expression for line of sight (LOS) interference is derived in [14] which is
considered as the criteria for pilot allocation. Although, there was an improvement in the sum spectral
efficiency (SE), but the pilot assignment process takes a longer time to be implemented, especially in large
networks. The work in [15] characterizes the angular region of the targeted user, and the pilot assignment
process was implemented with the aim of making this region interference-free. This angular region is
characterized by both the number of BS antennas and the location of the targeted user. However, the pilot
assignment problem is formulated by the joint optimization problems which subsequently introduce high
computational complexity.
In [16,17], the pilot allocation based pilot reuse (reuse factor more than 1) is also considered for pilot
contamination’s elimination technique. A systematically-constructed pilot reuse method is proposed
in [16]. In this approach, the neighbor cells are allowed to use different sets of pilot sequences according to
the tree division. To improve performance, it ensures larger distance between cells that share similar pilot
sets, the depth of the tree is increased as the pilot contamination severity increases. This approach offers
an effective performance when the ratio of the channel coherence time to the number of users in each cell
is relatively large. For the purpose of improving the quality of service (QoS) of the edge users, a soft pilot
reuse (SPR) scheme was proposed by [17]. The channel quality for each user is initially compared with a
determined threshold before the pilot allocation procedure, but an increase in complexity was recorded
due to additional computational cost incurred by finding the optimal threshold value.
By considering a fairness among users in order to mitigate the pilot contamination, pilot allocation
schemes were proposed in [18,19]. Specifically, to maximize the sum rate of the system and guarantee
fairness among users, a pilot allocation scheme was proposed by [18]. An optimization problem is
formulated based on a max-product criterion, then both min-leakage algorithm and user- exchange
algorithm based on greedy (UEBG) pilot allocation were suggested to solve the optimization problem.
Although this scheme almost achieves the same performance as the optimal exhaustive search algorithm
(ESA), it still suffers a setback due to high complexity. For the purpose of pilot contamination mitigation
in [19], the pilot assignment scheme based on the harmonic SINR utility function was introduced to
regulate the fairness among users. However, the system complexity increases as the number of users and
network size grows (more than two cells).
Based on performance degradation of users, a pilot assignment scheme has been proposed in [20],
the degradation performance is initially evaluated for all users according to the value of the uplink
achievable rate. Therefore, the optimal pilot sequences were assigned to users who suffered from the
highest degradation in a greedy way. Obviously, this scheme is not effective in bad channel conditions.
79
Electronics 2019, 8, 372
In [11,21], the pilot allocation approaches aim at enhancing the performance of users who suffer from
bad SINR. The pilot allocation in [21] focused on maximizing the sum capacity of the whole system for
pilot decontamination. In this work, the pilot sequences were assigned initially to the users who have bad
channel condition. However, the complexity of the pilot assignment procedure increases as the network
size is increased. A SPA scheme is proposed in [11] to improve the performance of users with poor SINR.
Users with low channel quality were assigned to pilot sequences which resulted in a low interference.
However, the achievement of this scheme is limited as it did not consider inter-cell interference which
causes the pilot contamination.
Some authors have tried to make a combination of two schemes to get an improved performance
as shown in [22,23]. As such, a joint pilot assignment scheme has been proposed by [22], in which
time-shifted [24] and the SPA [11] schemes were combined in order to mitigate the effect of pilot
contamination. Inter-group interference is suppressed according to [24] strategy, whereas SPA is used
to reduce intra-group interference. Although an improved overall performance was recorded, the
mutual interference between downlink data and uplink pilot signals cannot be eliminated despite the
use of SPA scheme. New pilot assignment schemes such as greedy-based and swapping-based were
implemented together with pilot contamination precoding design (PCP) for massive MIMO downlinks [23].
This combination offers a considerable improvement over the random pilot assignment, but the PCP
matrix is changed according to the update in pilot assignment information.
By exploiting the channel sparsity for wideband massive MIMO system, the pilot contamination
can be removed with the help of pilot assignment policy in [25]. The pilot assignment policy is designed
to help identify the subspace of the desired channel. The difficulty in this approach, lies on how to
deal with the subspace estimation, which can be realized through multiple frames after randomizing the
pilot contamination.
Differing from the aforementioned works [11,20], we consider the source of inter-cell interference
throughout pilot assignment, which is essentially the cause of the pilot contamination. In some other
works [12–15], the availability of some factors (e.g., user location, AoA, or LOS interference) are needed
for pilot assignment which are not always easy to estimate, while our approach requires only large-scale
fading coefficients, which can be tracked easily as they do not frequently change during coherence interval.
Besides, comparing to previous works [17–19,21], our algorithm is not computationally intensive, and
therefore it can be applied for large-scale networks.
3. System Model
In this section, we describe the system model under which the TDD-massive MIMO systems are
implemented. In this model, the uplink comprises L cells, in which each cell contains a BS equipped with
M antennas. Furthermore, in each cell coverage area K single-antenna users communicate simultaneously
to their designated BS, assuming that M K [2,5]. The propagation channels connecting the k-th user
located in the j-th cell to the BS in i-th cell is modeled as Rayleigh block fading [26] and the channel vector
hijk ∈ C M×1 is denoted as:
hijk = gijk βij . (1)
k
where gij and βij denote the small scale-fading vector and large-scale fading coefficient, respectively.
k k
The small scale-fading vector has a complex Gaussian distribution with zero mean and unity variance, CN
(0, I M ) , while the large-scale fading coefficient is referred to the effect of both path-loss and shadowing
and it can be tracked easily as it changes slowly during coherence interval τc = Bc Tc [27–29]. We use
Bc and Tc to denote the coherence bandwidth and the coherence time, respectively. Figure 2 illustrates
the coherence block for TDD protocol. We also consider that large-scale fading coefficient is equal for
80
Electronics 2019, 8, 372
all antenna elements, assuming that the distance between user k and BS is significantly larger than the
distances between antenna elements.
Ki L Kj
φ √ √ φ
Ui = ρφ ∑ hiik φkT + ρφ ∑ ∑ hijk φkT + Ni (3)
k =1 j =1 k =1
j =i
φ
where ρφ denotes the pilot transmission power, and Ni ∈ C M×τp denotes the additive white Gaussian
noise (AWGN) matrix which is assumed independent and identically distributed (i.i.d) random variables
whose elements have zero mean and variance σN 2 . The received signal Uφ is called the observation, in which
i
the BS in the cell i can use it to estimate the channel responses. The first term in (3) represents the received
pilot signals from users in the serving cell, whereas, the middle term represents the inter-cell interference
signal from the neighbor cells, which causes the pilot contamination. Correspondingly, the received uplink
data uid ∈ C M at the BS in the i-th cell can be represented by:
√ L K
uid = ρu ∑ ∑ hijk xujk + niu (4)
j =1 k =1
where x ujk denotes the uplink transmitted symbol from user k located in the j-th cell, ρu denotes the power
of the uplink transmitted symbol with E | x ujk |2 = 1, and niu ∈ C M×τu denotes the AWGN vector with
variance σn2 and zero mean value. The minimum mean square error (MMSE) is exploited for the purpose
of the channel estimation ĥijk ∈ C M×1 [9]. Therefore, the MMSE estimated channel vector ĥijk based on the
φ
observation Ui in (2) can be given as [10]:
√ p
ĥijk = ρu Rijk Ψijk uijk (5)
and
−1
Ψijk = ∑ ρu τp Rijk + σu2 IK (6)
j,k
81
Electronics 2019, 8, 372
φ
where uijk = Ui φk∗ which is called the received proceed signal, Ψijk denotes the inverse of the
p
normalized correlation matrix, and Rijk denotes the spatial correlation matrix of the channel to be estimated,
Rijk = E[hijk hiH
jk ].
The estimated channel is then used to detect the uplink data symbol and precode the downlink data.
Herein, we consider both maximum ratio combining (MRC) and zero forcing (ZF) as a linear detectors at
the BS which are given by [30]:
Ĥii MRC
Ai = −1 (7)
Ĥii Ĥii H Ĥii ZF
The received detected signal is evaluated by multiplying the received uplink data signal uid by the
decoding vector aiH i
ik , which represents the k-th column of the matrix Ai and hik is the k-th column of the
matrix Hii . Therefore, the detected symbol of user k at a given BS located in a cell i can be expressed as:
√ L K
u
ziik = aiH
ik ui = aik
d iH
ρu ∑ ∑ hijk xujk + niu (8)
j =1 k =1
Ki Kj
√ √ √ L
u
ziik = ik hik xik +
ρu aiH i u
ρu ∑ aiH
ik hin xin +
i u
ρu ∑ ∑ aiH
ik h jk x jk + aik ni
i u iH u
(9)
n =1 j =1 k =1
n =k j =i
The first term in (9) represents the desired signal, the second one represents the intra-cell interference,
the third term is the effect of pilot contamination (inter-cell interference), and the last one represents the
uncorrelated noise.
Consequently, the average SINR of the k-th user in the target cell i can be evaluated as:
ρu |aiH
ik hik |
i 2
u
SI NRik = (10)
Kj υik
i
ρu ∑ Lj=1 ∑k=1 |aiH
ik h jk | +
i 2
ρu
j =i
and
Ki
i
υik = ρu 2 ∑ |aiH
ik hin | + ρu || aik ||
i 2 i 2
n =1
n =k
where υik
i denotes the intra-cell interference and uncorrelated noise, in which their effect is almost neglected
as the number of antennas increases (M → ∞) [5]. Then, the uplink SINR can be described by large-scale
fading coefficients βijk as follows:
2
βiik
u
SI NRik = 2
, when (M → ∞) (11)
∑ j =i βijk
It is clear from the above expression that the effect of small-scale fading and thermal noise are
averaged out as the number of antennas is increased [5]. Therefore, the ergodic achievable uplink rate of
the user k according to [26] is:
1
Tc ∑
u
Rik = (1 + SI NRik
u
) (12)
τu
82
Electronics 2019, 8, 372
u is calculated in bit/channel use and τ refers to the uplink duration. From (12), it is obvious that
where Rik u
the average uplink rate of multicell massive MIMO systems is limited due to pilot contamination and it
cannot be boosted by increasing either the number of serving antennas or both ρu and ρ p .
5. Proposed Scheme
In this section, an efficient heuristic algorithm is developed for addressing the multicell massive
MIMO associated problem. To do this, the assignment and the reuse of pilot group across cells in the
network is formulated as an optimization problem.
2
M→∞ βiik
P −−−→ P̂ : max
∀ k ∈i min 2
(13)
min
∀φk ∑ j =i βijk
The above optimization problem is based on the method proposed by [11]. In this method, it is
assumed the number of antennas is very large and as such make use of the large-scale fading coefficients
βijk . To address problems related to pilot contamination, this study concentrates on assigning the pilot
sequences for a specific cell in multicell massive MIMO systems. In the target cell, the number of possible
iterations is defined by the number of K users which is usually very high. In contrast, the conventional
scheme assigns the pilot sequences Φ = [φ1 , φ2 , ..., φK ] T randomly to K users.
The performance of multicell massive MIMO systems is much degraded by the effect of the strong
inter-cell interference from the neighbor cells and is exacerbated when the channel quality of the users
in target cell is poor. Specifically, in the SPA scheme, the set of users with the worst channel quality are
assigned pilot sequences with the lowest inter-cell interference. Although these pilot sequences have the
lowest interference, they are still considered high interference pilot sequences when used by users which
have bad channel quality. Therefore, the interference that is associated with such pilot sequences must
be minimized.
83
Electronics 2019, 8, 372
In order to achieve this, we propose a heuristic algorithm based on SPA to solve the optimization
problem in (13). Before illustrating the algorithm, we need to define a set of parameters η jk which
characterizes the squared cross gain of the interfering users from neighboring cells:
2
η jk = βijk , k = 1, 2, ..., K, j = 1, 2, ..., L and j = i
The interference that is produced by users who shared the same pilot sequence φk can be evaluated
at the target cell as:
ξ k = ∑ η jk (14)
j =i
In addition, the set of parameters k is used to characterize the square channel quality of the target
cell’s users which can be expressed by:
2
k = βiik , k=1,2,...,K
The proposed algorithm EPA is summarized in Algorithm 1 to solve the above optimization problem.
84
Electronics 2019, 8, 372
The available large scale fading coefficients are exploited to measure the interference from the neighbor
cells. From the above algorithm, the users in the neighbor cells are classified into different levels according
to the value of squared cross gain (η jk ), which gives an indication of the strength of the interference at
the target cell i. The users that cause the highest interference (which have the largest η jk ) are classified
as the level V1 users. This level involves the worst interfering users from each neighbor cell. The second
level V2 contains the users which cause less interference than that in V1 . This classification process will
continue until the last level VK , which contains the users that produce the smallest interference. The k-th
interference level can be represented by:
The amount of interference that is produced by the users in each level is described by (14). After that,
the interfering users in each level are assigned the same pilot sequence. For instance, the users in V1
and VK are assigned the pilot sequences φ1 and φK , respectively. As a result, the pilot sequence φ1 is
suffering from the highest interference, whereas φK is the one with the lowest interference. The remaining
pilot sequences have different levels of interference between φ1 and φK . After minimizing the inter-cell
interference at the serving BS, the second step is to assign pilot sequences to its users and this can be
achieved by solving the following formula:
M→∞ k
P −−−→ P̂ : max
∀ k ∈i min (17)
ξk
Obviously, from the EPA algorithm, the pilot assignment process for the users of the target cell
depends on, both the squared channel quality k and the minimized outgoing interference ξ k , which is
caused by users sharing the same pilot sequence in the level Vk . The optimization problem in (17) can be
solved with the help of the SPA algorithm. In this algorithm, users that suffer setbacks due to bad channel
quality are exempted from the pilot sequence as it will cause severe interference. Thus, the sets of users
with the worst channel quality are assigned pilot sequence with the lowest inter-cell interference. For the
remaining cells, the process will continue in a sequential way, excluding the cells that are already included
with the target cell.
Furthermore, our algorithm is not computationally intensive in the sense that it ultimately relies on
cell sorting, thus the time complexity it incurs is O (L K log K), and therefore it works faster if compared to
recent schemes. For example, EPA shows less computational complexity than the work in [19,21], which
incur O (L K3 ) and O (L2 K log K), respectively. In addition, the scheme in [17] incurs O (M(K2 e + K2 CS ),
where Ke denotes the number of edge users in the network, and KCS represents the number of users in the
largest cell. So apparently [17] is much more intense than EPA. The SPA scheme [11], as it is fundamentally
limited to only a target cell optimization, unsurprisingly it incurs only O ( K log K).
6. Simulation Results
The base code implemented is obtained from [26], while Monte Carlo simulation is used to evaluate
the performance of the EPA scheme. A typical hexagonal cellular network made up of L cells is considered
in the EPA scheme. Each of these cells comprises of a BS which is equipped with M number of antennas
and K users with single antennas under its coverage area [2,5]. A center cell surrounded by all other cells
is considered as a target cell. The system parameters are summarized in Table 1. The parameter βiik is
modeled in decibel as [10]:
dijk
βiik = Υ + 10 α log10 + Fjki (18)
1 km
85
Electronics 2019, 8, 372
where dijk (km) is the distance between the k-th user in the j-th cell and the BS in the i-th cell, α is the
path-loss exponent, Υ determines the median channel gain at 1 km as a reference distance which can be
calculated according to many propagation models [33], and Fijk N (0, σ2s f ) is the shadow fading which
creates log-normal random variations around the nominal value Υ + 10 α log10 (dijk /1 km).
We evaluate the SPA [11] and the conventional schemes [2,5] against the EPA scheme. Figure 3
depicts the average uplink rate per user of the EPA, SPA and conventional schemes against the number
of BS’s antennas using the ZF as a linear detector. Obviously, the average uplink rate of the EPA scheme
outperforms the other schemes. This improvement can be attributed to the policy implemented for pilot
assignment in the neighbor cells. This implemented policy ensures a significant reduction of the inter-cell
interference at the serving BS, which invariably leads to a better throughput. Due to the pilot assignment
in the target cell which was executed according to the users’ channel quality, the SPA scheme achieves
better performance than the other conventional scheme. However, the performance of both SPA and
conventional schemes changes slightly when the number of antennas exceeds certain points (e.g., greater
than 150).
3.5
Conventional Scheme
SPA
3
EPA
Average Rate per User [bit/channel use]
2.5
1.5
0.5
0
50 100 150 200 250 300 350 400 450 500
Number of Receive Antennas per Cell
Figure 3. The average uplink rate per user with zero forcing (ZF) for different numbers of antennas.
86
Electronics 2019, 8, 372
Figure 4 shows the impact of the EPA scheme when using the MRC as a linear detector. It can be
clearly observed that the average uplink rate per user (bits/channel use) is substantially enhanced by
the EPA scheme when the number of antennas is increased. The superiority of the EPA scheme over
other schemes, arose as a result of the minimization of the inter-cell interference that comes from the
neighbor cells. This is achieved by allowing the users in each interference level Vk to share the same pilot
sequence. Consequently, EPA scheme has shown a low interference from the neighbor cells compared to
the other schemes.
3
Conventional Scheme
SPA
EPA
2.5
Average Rate per User [bit/channel use]
1.5
0.5
0
50 100 150 200 250 300 350 400 450 500
Number of Receive Antennas per Cell
Figure 4. The average uplink rate per user with maximum ratio combining (MRC) for different numbers
of antennas.
Figure 5 depicts the performance of the EPA scheme when compared with both conventional and SPA
schemes in terms of cumulative distribution function (CDF) of the average SINR. When the number of BS’s
antennas is 64 with ZF detector, the probabilities of the average uplink SINR being less than −10 dB for
the conventional, the SPA, and the proposed EPA schemes are almost 80%, 26.25%, and 10%, respectively.
The improvement is achieved because the effect of the interference, which is associated with the pilot
sequences, on channel quality of the users in the target cell became slight, which effectively increased the
SINR of the system.
Figure 6 depicts the CDF of the minimum SINR when M is 64. It is evident that the minimum SINR
of the EPA scheme is significantly improved when compared with SPA and conventional schemes. For
example, the probability of the minimum SINR to be less than −20 dB for the EPA scheme is approximately
16.25%, while this probability is about 34.6% and 79.6% for the SPA and the conventional schemes,
respectively. The reason behind this improvement is due to assigning the pilots of the users with the
lowest interference, in the neighbor cells, to the users who have bad channel quality in the target cell.
In consequence, the performance of these users was improved due to the reduction of their interference.
87
Electronics 2019, 8, 372
1
Conventional Scheme
0.9 SPA
EPA
0.8
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
-25 -20 -15 -10 -5 0 5 10 15
Average SINR
Figure 5. The cumulative distribution function (CDF) of the average signal-to-interference-plus-noise ratio
(SINR) when M = 64 using ZF.
1
Conventional Scheme
0.9 SPA
EPA
0.8
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
-70 -60 -50 -40 -30 -20 -10 0 10
Min. SINR
Figures 7 and 8 depict the CDF of average and minimum SINR, respectively, using MRC detector when
M is 64. As observed from Figures 7 and 8, the EPA scheme outperforms the SPA and the conventional
88
Electronics 2019, 8, 372
schemes. As shown in Figure 7, the EPA scheme increases the average SINR by 1.8 dB over the SPA scheme,
whereas it increases up to 4.69 dB for minimum SINR, as illustrated in Figure 8.
1
Conventional Scheme
0.9 SPA
EPA
0.8
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
-30 -25 -20 -15 -10 -5 0 5
Average SINR
1
Conventional Scheme
0.9 SPA
EPA
0.8
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
-70 -60 -50 -40 -30 -20 -10 0
Min. SINR
From Figures 5–8, the minimum SINR always achieves a better performance. In other words,
the performance of edge users is significantly enhanced. This is due to the fact that the inter-cell interference
has been greatly reduced at the target cell while the users with poor channel quality are assigned to the
89
Electronics 2019, 8, 372
suitable pilot sequences in order to maximize its SINR. Moreover, the results obtained as a result of using
ZF and MRC linear detections, are approximately comparable when run on the same parameters setting.
This is because the inter-cell interference is greatly reduced by the EPA scheme that runs before the process
of signal detection.
By using ZF detector, the performance of the EPA scheme has been examined in terms of the CDF of
the average uplink rate when M is 64, as shown in Figure 9. It can be seen that the performance of the
CDF in the conventional scheme is highly influenced by the pilot contamination. The assignment of the
pilot randomly, has led to the worst performance compared to the SPA and the EPA schemes. On the other
hand, the EPA scheme outperforms the SPA and the conventional schemes, since the effect of users who
cause the highest interference is considered weak compared to users having good channel quality when
they are assigned the same pilot sequence. As a result, these interfering users are excluded from sharing
the same pilots of users with bad channel quality.
0.9
Conventional Scheme
0.8 SPA
EPA
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
Average Throughput
Figure 9. The CDF of the average uplink rate when M = 64 using ZF.
Result of evaluation for the CDF of the minimum uplink rate is depicted in Figure 10. It is clear that
the EPA scheme performs better than the other schemes. For example, the minimum uplink rate of the
EPA scheme is doubled when compared to the SPA scheme. This improvement has been achieved because
the interference associated with pilot sequences, which is allocated to users with bad channel quality, was
reduced effectively by the EPA scheme.
Figures 11 and 12 represent the CDF of the average and the minimum uplink rate, respectively, when
the MRC is utilized and M is 64. The EPA scheme achieves the highest performance when compared
with other schemes, especially in the minimum uplink rate. Specifically, the achieved gain in minimum
uplink rate is doubled while it is 1.2 times in average uplink rate in comparison with SPA. The reason for
this improvement in the minimum uplink rate is due to the priority given to the users having the worst
channel quality during the pilot assignment process.
90
Electronics 2019, 8, 372
0.9
Conventional Scheme
SPA
0.8 EPA
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
Min. Throughput
Figure 10. The CDF of min. uplink rate when M = 64 using ZF.
0.9
0.8
Conventional Scheme
SPA
0.7
EPA
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
0 0.5 1 1.5 2 2.5
Average Throughput
Figure 11. The CDF of average uplink rate when M = 64 using MRC.
In order to verify the effectiveness of the EPA scheme, the average uplink rate against the
number of antennas has been evaluated in Figure 13 with different parameters, considering ZF detector.
These parameters, which are shown in Table 1, increase the interference severity at the target cell. Obviously,
the average uplink rate of EPA schema is higher than other schemes, despite the intensity of interference.
91
Electronics 2019, 8, 372
0.7
0.6
CDF
0.5
0.4
0.3
0.2
0.1
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Min Throughput
Figure 12. The CDF of min. uplink rate when M = 64 using MRC.
Conventional Scheme
2 SPA
EPA
Average Rate per User [bit/channel use]
1.5
0.5
0
50 100 150 200 250 300 350 400 450 500
Number of Receive Antennas per Cell
Figure 13. The average uplink rate per user with ZF for different numbers of antennas, K = 20 , and
R = 300 m.
92
Electronics 2019, 8, 372
7. Conclusions
In this paper, we propose a new pilot assignment approach to address the pilot contamination
in multicell massive MIMO systems. An optimization problem is formulated in order to improve the
minimum uplink rate for users in the target cell. Our approach to solving this optimization problem
ensures an overall reduction of the outgoing inter-cell interference of neighbor cells. This reduction is
achieved by assigning the pilot sequences to the neighbor cell’s users and maximizing the minimum
uplink rate of the target cell’s users based on SPA algorithm. The numerical results have clearly shown
that the EPA scheme is more effective than the other schemes in both MRC and ZF linear detections.
Additionally, using such an efficient assigning approach entitles the new EPA scheme to achieve significant
performance when the typical parameter M is 64 compared to the SPA and the conventional schemes.
Likewise, the minimum uplink rate is greatly enhanced by the new EPA scheme than the SPA scheme.
Furthermore, the proposed scheme has also proved high effectiveness and performance even in severe
interference environments.
Author Contributions: Conceptualization, A.S.A.-h. and N.K.N.; Formal analysis, A.S.A.-h. and N.K.N.; Methodology,
A.S.A.-h., N.K.N., A.S., S.S. and A.M.M.; Writing—review & editing, A.S.A.-h. and N.K.N.
Funding: This work was funded by [Advancing the State of the Art of MIMO: The Key to Successful Evolution of
Wireless Networks (ATOM)] grant number [690750-ATOM-H2020-MSCA- RISE-2015, UPM: 6388800-10801] and The
APC was funded by [Research Management Centre (RMC)].
Conflicts of Interest: The authors declare no conflicts of interest.
References
1. Boccardi, F.; Heath, R.W.; Lozano, A.; Marzetta, T.L.; Popovski, P. Five disruptive technology directions for 5G.
IEEE Commun. Mag. 2014, 52, 74–80. [CrossRef]
2. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
3. Lim, Y.G.; Chae, C.B.; Caire, G. Performance analysis of massive MIMO for cell-boundary users. IEEE Trans.
Wirel. Commun. 2015, 14, 6827–6842. [CrossRef]
4. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
5. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IIEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
6. Björnson, E.; Larsson, E.G.; Debbah, M. Optimizing multi-cell massive MIMO for spectral efficiency: How many
users should be scheduled? In Proceedings of the 2014 IEEE Global Conference on Signal and Information
Processing (GlobalSIP), Atlanta, GA, USA, 3–5 December 2014. [CrossRef]
7. Rajatheva, N.; Osseiran, A.; Monserrat, J.F.; Marsch, P. 5G Mobile and Wireless Communications Technology;
Osseiran, A., Monserrat, J.F., Marsch, P., Eds.; Cambridge University Press: Cambridge, UK, 2016;
ISBN 978-1-107-13009-8.
8. Björnson, E.; Larsson, E.G.; Marzetta, T.L. Massive MIMO: Ten myths and one critical question. IEEE Commun. Mag.
2016, 54, 114–123. [CrossRef]
9. Jose, J.; Ashikhmin, A.; Marzetta, T.; Vishwanath, S. Pilot Contamination and Precoding in Multi-Cell TDD
Systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
10. Björnson, E.; Hoydis, J.; Sanguinetti, L. Massive MIMO Networks: Spectral, Energy, and Hardware Efficiency.
IFound. Trends Signal Process. 2017, 11, 154–655. [CrossRef]
11. Zhu, X.; Wang, Z.; Dai, L.; Qian, C. Smart Pilot Assignment for Massive MIMO. IIEEE Commun. Lett. 2015,
19, 1644–1647. [CrossRef]
93
Electronics 2019, 8, 372
12. Dao, H.T.; Kim, S. Vertex Graph-Coloring-Based Pilot Assignment With Location-Based Channel Estimation for
Massive MIMO Systems. IEEE Access 2018, 6, 4599–4607. [CrossRef]
13. Kim, K.; Lee, J.; Choi, J. Deep Learning Based Pilot Allocation Scheme (DL-PAS) for 5G Massive MIMO System.
IIEEE Commun. Lett. 2018, 22, 828–831. [CrossRef]
14. Akbar, N.; Yan, S.; Yang, N.; Yuan, J. Location-Aware Pilot Allocation in Multicell Multiuser Massive MIMO
Networks. IEEE Trans. Veh. Technol. 2018, 67, 7774–7778. [CrossRef]
15. Muppirisetty, L.S.; Charalambous, T.; Karout, J.; Fodor, G.; Wymeersch, H. Location-Aided Pilot Contamination
Avoidance for Massive MIMO Systems. IEEE Trans. Wirel. Commun. 2018, 17, 2662–2674. [CrossRef]
16. Sohn, J.; Yoon, S.W.; Moon, J. On Reusing Pilots Among Interfering Cells in Massive MIMO. IEEE Trans.
Wirel. Commun. 2017, 16, 8092–8104. [CrossRef]
17. Zhu, X.; Wang, Z.; Qian, C.; Dai, L.; Chen, J.; Chen, S.; Hanzo, L. Soft Pilot Reuse and Multi-Cell Block
Diagonalization Precoding for Massive MIMO Systems. IEEE Trans. Veh. Technol. 2016, 65, 3285–3298. [CrossRef]
18. ZLiu, J.; Li, Y.; Zhang, H.; Guo, S. Efficient and Fair Pilot Allocation for Multi-cell Massive MIMO Systems.
In Proceedings of the 2017 9th International Conference on Wireless Communications and Signal Processing
(WCSP), Nanjing, China, 11–13 October 2017. [CrossRef]
19. Ma, S.; Xu, E.L.; Salimi, A.; Cui, S. A Novel Pilot Assignment Scheme in Massive MIMO Networks.
IEEE Commun. Lett. 2018, 7, 262–265. [CrossRef]
20. Wang, H.; Huang, Y.; Jin, S.; Du, Y.; Yang, L. Pilot Contamination Reduction in Multi-cell TDD Systems with Very
Large MIMO Arrays. Wirel. Pers. Commun. 2017, 96, 5785–5808. [CrossRef]
21. Du, T.; Wang, Y.; Wang, J.; Du, Y.; Yang, L. Multi-cell Joint Optimization to Mitigate Pilot Contamination for
Multi-Cell Massive MIMO Systems. In Proceedings of the 2017 IEEE 85th Vehicular Technology Conference
(VTC Spring), Sydney, NSW, Australia, 4–7 June 2017. [CrossRef]
22. Wang, P.; Zhao, C.; Zhang, Y.; Stuber, G.L. A Novel Pilot Assignment Approach for Pilot Decontaminating in
Massive MIMO Systems. In Proceedings of the 2017 IEEE Wireless Communications and Networking Conference
(WCNC), San Francisco, CA, USA, 19–22 March 2017. [CrossRef]
23. Zhao, M.; Zhang, H.; Guo, S.; Yuan, D. Joint Pilot Assignment and Pilot Contamination Precoding Design for
Massive MIMO Systems. In Proceedings of the 2017 26th Wireless and Optical Communication Conference
(WOCC), Newark, NJ, USA, 7–8 April 2017. [CrossRef]
24. Fernandes, F.; Ashikhmin, A.; Marzetta, T.L. Inter-Cell Interference in Noncooperative TDD Large Scale Antenna
Systems. IEEE J. Sel. Areas Commun. 2013, 31, 192–201. [CrossRef]
25. Chen Z.; Yang, C. Pilot decontamination in wideband massive MIMO systems by exploiting channel sparsity.
IEEE Trans. Wirel. Commun. 2016, 15, 5087–5100. [CrossRef]
26. Björnson, E.; Matthaiou, M.; Debbah, M. Massive MIMO with Non-Ideal Arbitrary Arrays: Hardware Scaling Laws
and Circuit-Aware Design. IEEE Trans. Wirel. Commun. 2015, 14, 4353–4368.. [CrossRef]
27. Li, M.; Jin, S.; Gao, X. Spatial Orthogonality-Based Pilot Reuse for Multi-cell Massive MIMO Transmission.
In Proceedings of the 2013 International Conference on Wireless Communications and Signal Processing,
Hangzhou, China, 24–26 October 2013. [CrossRef]
28. Ashikhmin, A.; Marzetta, T. Pilot Contamination Precoding in Multi-Cell Large Scale Antenna
Systems. In Proceedings of the 2012 IEEE International Symposium on Information Theory Proceedings,
Cambridge, MA, USA, 1–6 July 2012. [CrossRef]
29. Muller, R.; Cottatellucci, L.; Vehkapera, M. Blind pilot decontamination. IEEE J. Sel. Top. Signal Process. 2014,
8, 773–786. [CrossRef]
30. NGO, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and Spectral Efficiency of Very Large Multiuser. IEEE Trans.
Wirel. Commun. 2013, 61, 1436–1449. [CrossRef]
31. Sesia, S.; Toufik, I.; Baker, M. Network Architecture and Protocols. In LTE-The UMTS Long Term Evolution: From
Theory to Practice; John Wiley & Sons Ltd.: Chichester, UK, 2011; ISBN 978-0-470-66025-6.
94
Electronics 2019, 8, 372
32. Zhu, X.; Dai, L.; Wang, Z. Graph coloring based pilot allocation to mitigate pilot contamination for multi-cell
massive mimo systems. IEEE Commun. Lett. 2015, 19, 1842–1845. [CrossRef]
33. Sarkar, T.K.; Ji, Z.; Kim, K.; Medouri, K.; Salazar-Palma, M. A survey of various propagation models for mobile
communication. IEEE Antennas Propag. Mag. 2003, 45, 51–82. [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article
distributed under the terms and conditions of the Creative Commons Attribution (CC BY)
license (http://creativecommons.org/licenses/by/4.0/).
95
electronics
Article
Downlink Channel Estimation in Massive
Multiple-Input Multiple-Output with Correlated
Sparsity by Overcomplete Dictionary and
Bayesian Inference
Wei Lu 1, *, Yongliang Wang 1 , Xiaoqiao Wen 1 , Shixin Peng 2 and Liang Zhong 3
1 Air Force Early Warning Academy, Wuhan 430019, China; [email protected] (Y.W.);
[email protected] (X.W.)
2 National Engineering Research Center for E-Learning, Central China Normal University, Wuhan 430079,
China; [email protected]
3 Department of communication system, China University of Geoscience, Wuhan 430074, China;
[email protected]
* Correspondence: [email protected]; Tel.: +86-027-85695800
Abstract: We exploited the temporal correlation of channels in the angular domain for the downlink
channel estimation in a massive multiple-input multiple-output (MIMO) system. Based on the slow
time-varying channel supports in the angular domain, we combined the channel support information
of the downlink angular channel in the previous timeslot into the channel estimation in the current
timeslot. A downlink channel estimation method based on variational Bayesian inference (VBI)
and overcomplete dictionary was proposed, in which the support prior information of the previous
timeslot was merged into the VBI for the channel estimation in the current timeslot. Meanwhile
the VBI was discussed for a complex value in our system model, and the structural sparsity was
utilized in the Bayesian inference. The Bayesian Cramér–Rao bound for the channel estimation mean
square error (MSE) was also given out. Compared with other algorithms, the proposed algorithm
with overcomplete dictionary achieved a better performance in terms of channel estimation MSE
in simulations.
1. Introduction
Massive multiple-input multiple-output (MIMO) is the key technology for next generation wireless
communication. The large number of antennas enable high spectrum efficiency and lower power
consumption [1]. To get these benefits, the base station (BS) needs to acquire the channel stated
information (CSI) for uplink and downlink. Pilot-based channel estimation is widely used in wireless
communication systems. In the time division duplex (TDD) system, the channel reciprocity is used to
get the CSI by only estimating the uplink channel at BS. In the frequency division duplex (FDD) system,
the channel reciprocity cannot be used directly. In FDD massive MIMO system it is challenging to get
the downlink CSI with the conventional feedback scheme. In the conventional feedback scheme each
user estimates its channel and then feeds back the estimated CSI to the BS. The pilot and feedback
overheads are high for massive MIMO, since they are scaling linearly with the number of antennas.
Hence, it is important to design an efficient downlink channel estimation and feedback scheme for a
FDD massive MIMO system.
By exploiting the sparsity in massive MIMO channel, compressed sensing (CS) was applied in
the channel estimation and feedback. The users could feed the compressed training measurements
back to the BS, and an orthogonal matching pursuit (OMP) was used for downlink CSI recovery in [2].
In [3] the modified basis pursuit (MBP) was proposed by utilizing the partial priori signal support
information to improve the recovery performance. In [4] the support information of a signal in the
discrete fourier transform domain was incorporated into the weighted l1 minimization approach for CS
recovery, which could reduce the number of measurements by the size of the known part of support.
In [5] a three-level weighting scheme based on the support information was used for the weighted
l1 minimization and the simulation results showed superiority. In [6] we exploited the reciprocity
between uplink and downlink channels in the angular domain, and diagnosed the supports of the
downlink channel from the estimated uplink channel, and proposed a weighted subspace pursuit
(SP) channel estimation algorithm for FDD massive MIMO. It can be seen that CS was effective in the
channel estimation for massive MIMO.
However, most of these algorithms need the sparsity level in the estimation algorithm, which is
not practical in engineering scenarios. The Bayesian framework can be applied to the compressive
channel estimation. In [7], Bayesian estimation of sparse massive MIMO channel was developed in
which neighboring antennas shared among each other their information about the channel support.
In [8] a variational expectation maximization strategy was used for massive MIMO channel estimation,
and a Gaussian mixture prior model was designed to capture the individual sparsity for each channel
and the joint sparsity among users. In [9] a sparse Bayesian learning algorithm was proposed for FDD
massive MIMO channel estimation with arbitrary 2D-array. By the Bayesian framework in compressive
channel estimation the sparsity level is unnecessary, and it has relatively better recovery performance.
Additionally, there exists angular reciprocity in massive MIMO. For example, the channel covariance
matrices for uplink and downlink are reconstructed by making use of the angle reciprocity between
uplink and downlink channels in [10]. Hence it is promising to apply the angular reciprocity and
Bayesian framework in the compressive massive MIMO channel estimation.
Additionally, there exists angular reciprocity in the FDD massive MIMO. There is also time
correlation of channels. In [11] a differential compressive feedback in FDD massive MIMO was proposed
based on the channel impulses response (CIR) between timeslots, which were slow time-varying and
sparse, and the differential CIR between two CIRs in adjacent timeslots was sparse. Inspired by the
sparsity in the angular domain and time correlation of channels, the correlated angular sparsity can
also be exploited for massive MIMO channel estimation.
In this paper we proposed a downlink channel estimation in a TDD/FDD massive MIMO system.
The timeslots were divided into groups. In each group the estimated channel support information
of the previous timeslot was utilized by the following timeslot. The correlated angular sparsity
between timeslots in the downlink channel was utilized in the Bayesian inference for channel recovery.
We transformed the complex sparse vector to the real sparse vector recovery by Bayesian inference,
and the structural sparsity of the transformed real sparse vector was utilized. Meanwhile, the prior
support information from the estimated channel in the previous timeslot was made use of in modeling
the hidden hyperparameters in the Bayesian model. A Bayesian Cramér–Rao bound analysis is
presented, and simulations are given out to verify the performance of the proposed algorithm.
The main contributions were as follows: (1) a group-based channel estimation scheme was proposed,
in which previous estimated channel support information was used as the priori information in
the following timeslot due to the sparsity correlation; (2) priori information was merged into the
Bayesian inference algorithm for channel recovery; (3) the Bayesian Cramér–Rao bound for the channel
estimation mean square error (MSE) was analyzed.
The system model is illustrated in Section 2, while the proposed channel estimation algorithm
based on Bayesian inference is presented in Section 3. The Bayesian Cramér–Rao bound (BCRB) for the
channel estimation of mean square error (MSE) is given out in Section 4. Simulations and conclusions
are presented in Sections 5 and 6.
In the paper, we used the following notations. Scalars, vectors and matrices were denoted by
lower-case, boldface lower-case and boldface upper-case symbols. The probability density function
98
Electronics 2019, 8, 473
of a given random variable was denoted by p(·). Gamma(x|a, b) was the Gamma probability density
function (PDF) with shape parameters a and b for x, while Normal(x|c, d) was the Gaussian PDF with
parameters mean c and variance d for x. Γ(·) was the Gamma function, and ln(·) was the logarithm
function. Tr(·) stood for the trace operator. a (·) denoted the expectation operation with the PDF of
variable a.
2. System Model
We considered a massive MIMO TDD/FDD system with a single user, and assumed that the BS
was equipped with N antennas and the user terminal (UT) had a single antenna. For the downlink
channel estimation in the massive MIMO system, the BS transmitted the pilots to UT. The UT received
the pilots and fed back the received signal to the BS directly. The received signal yd (t) at the UT in the
t-th timeslot was written as
yd (t) = ρd Ahd (t) + nd (t) (1)
where hd (t) ∈ CN×1 is the downlink channel, A ∈ CTd ×N is the downlink pilots, Td is the pilot length,
ρd is the downlink received power, nd ∈ CTd ×1 is the received noise with each element to be i.i.d
Gaussian with mean 0 and variance σ2 , yd (t) ∈ CTd ×1 is the received signal at UT.
Since in the massive MIMO there existed sparsity, when Dd ∈ CN×M was the channel dictionary
for downlink channel which could be unitary dictionary or overcomplete dictionary (M > N,
their column vector had the form of steering vector with a different sampling angle), hda (t) was
the sparse representation with hd (t) = Dd hda (t). In this paper we applied the overcomplete dictionary
to present the sparse angular channel to get a better recovery performance. In the downlink channel
d
estimation, we needed to obtain ĥa (t) the estimated downlink channel in the angular domain in the
t-th timeslot.
By utilizing the sparse channel representation we then had
yd ( t ) = ρd ADd hda (t) + nd (t) (2)
For simplicity, the timeslot mark is omitted in the following equations. Since yd (t), hda (t), and nd (t)
are complex number vectors, we could rewrite Equation (2) into real number vectors as
⎡ ⎤
⎢⎢ Re( ρd ADd ) ⎥⎥
Re(yd ) ⎢⎢ −Im( ρd ADd ) ⎥⎥ Re(hda (t)) Re(nd (t))
= ⎢⎢⎢ ⎥⎥
Im(yd ) ⎣ Im( ρd ADd ) ⎥⎦ Im(hd (t)) + Im(nd (t)) (3)
Re( ρd ADd ) a
where Re(·) and Im(·) denote the real and imaginary parts respectively. For simplicity, we rewrote
Equation (3) as
y = Ah + n (4)
⎡ ⎤
⎢⎢ Re( ρd ADd ) −Im( ρd ADd ) ⎥⎥
Re(yd ) ⎢⎢⎢ ⎥⎥⎥, h = Re(hda (t))
where y = , A = ⎢ ⎥ and n =
Im(yd ) ⎢⎣ ⎥ Im(ha (t))
d
Im( ρd ADd ) Re( ρd ADd ) ⎦
Re(nd (t))
.
Im(n (t))
d
On the other hand, we considered the meaning of sparse angular channel representation hda (t).
If the transmission angles were allocated exactly at the sampling points in the channel dictionary Dd ,
then the corresponding coefficient in the hda (t) was nonzero. If the path number was smaller than
the antenna number, then hda (t) was sparse. However, there was leakage effect induced by dictionary
mismatch which will have deteriorated the sparsity of the angular channel representation [12]. When the
movement velocity of UT was not very high, e.g., v = 12 km/h, and the typical timeslot duration
τ = 0.5 ms, the movement distance of UT in one timeslot was 0.017 m. When the distance of UT and BS
99
Electronics 2019, 8, 473
was 200 m, the angle change for the line of sight (LoS) transmission in one timeslot was 0.0049◦ which
was much smaller than the sampling interval in the dictionary. For the non-LoS (NLoS) transmission,
the angle change was also small which is discussed in Section 4.1. Hence the transmission angle change
between two timeslots is very small if the transmission environment doesn’t change dramatically,
and there is correlation in the angular channel sparsity between adjacent timeslots. In other words,
the information regarding the estimated angular channel in the previous timeslot could be utilized in
the current channel estimation.
It was proven that the prior support information could improve the channel recovery
performance [3–6]. Hence in this paper we made use of the prior support information from the previous
timeslot to improve the Bayesian channel estimation. In the following section we have discussed how
to merge the prior information into the Bayesian inference algorithm for channel estimation.
3. Proposed Algorithm
We designed a three-layer hierarchical graphical model as shown in Figure 1. In the first layer,
h was assigned a Gaussian prior distribution
2N
p(h|α) = p(hi αi ) (5)
i=1
where hi and αi are the i-th entry in h and α respectively, p(hi αi ) = Normal(hi 0, αi ) and αi is the
inverse variance of the Gaussian distribution. When hi is close to 0, then αi is very large, and vice versa.
In the second layer, we assumed a Gamma distribution as hyperpriors over the hyperparameters
αi , and it can be presented as
2N
p(α) = Gamma(αi ai , bi ) (6)
i=1
where Gamma(·) is the Gamma PDF, and the parameters ai and bi characterize the shape of Gamma
PDF. For fixed ai , the larger bi is, the smaller αi is; then hi tends to be nonzero. In the sparse Bayesian
learning ai and bi were set to be very small for non-informative hyperprior over αi [13].
In our model, we set ai to be constant with a predefined value, and we modeled bi as random
parameters. In Figure 1 it could be found that the entries of y were divided into two sets by their
indices, i.e., S, S+N and S, S+N c , where S was the set with channel support indices from the previous
timeslot, and S+N was the set with each index in S added by N, since we converted the complex
system model to the real system model as Equation (3). S, S+N c was the complementary set of S, S+N .
For example, in the (t − 1)-th timeslot, the positions of nonzero entries or called supports in ha (t − 1)
d
were S = {4, 5, 6}, then S+N = {4 + N, 5 + N, 6 + N}. The probable supports for hda (t) in the current
t-th timeslot can be assumed to be the same as those for previous (t − 1)-th timeslot for simplicity.
On the other hand, we could have also diagnosed the probable channel supports further by taking the
angle deviation and leakage effects into consideration. In this paper we adopted the support diagnosis
algorithm, and the details can be found in [6].
For y j , j ∈ S, S+N , we employed a Gamma distribution over the hyperparameters bi in the third
layer as
Gamma(bi |c, d) = Γ(c)−1 dc bc−1
i e
−dbi
(7)
where c and d characterize the shape of Gamma PDF. By the system model and assumptions for massive
MIMO, we could use a Bayesian inference to perform the sparse channel recovery.
100
Electronics 2019, 8, 473
Figure 1. Graphical model for the channel estimation with Bayesian inference. The nodes with
double circle, single circle and square correspond to the observed data, hidden variables and
parameters, respectively.
According to the standard Bayesian inference [14], let z h, α, b , we have
where constant is a constant used for p(zi ) normalization, p( y, z) is the joint pdf for h and z, and zi can
be h, α, andb. We have p(y, z) = p(z y)p( y) . We assume p(z y) posterior independence among the
hidden variables z, then p(z y) ≈ p(z) , and p(z) is the product of PDF of h, α, and b.
In order to make use of the prior support information from the previous timeslot and the structure
sparsity in Equation (4), we needed to make some modifications to the standard Bayesian inference.
The main considerations for the modifications were as follows:
(I) Since we rewrote Equation (2) as Equation (4), if hda,i was nonzero, then hi and hi+N were
nonzero simultaneously. Hence it was wise to assume that bi and bi+N were the same;
(II) In the standard Bayesian learning ai and bi were set to be very small for non-informative
hyperprior over αi . This assumption was valid if no prior information was provided. If the prior
support information was available, such as that the support information of the previous timeslot could
be used for channel estimation in the coming timeslot by sparsity correlation, it was wise to assume
that the supports between adjacent timeslots were partially common. If the i-th element in the angular
channel vector was nonzero, then the hyperparameter bi and bi+N tended to be variables rather than to
be fixed small numbers, which meant only for the indices from the prior support set S the third layer
prior model was adopted.
It can be seen that the consideration (II) was similar to [15]. However, our proposed algorithm
was extended for a complex number system and the structure sparsity was considered. However,
on the other hand, the overcomplete dictionary was adopted in our algorithm.
The proposed uplink-aided downlink channel estimation based on Bayesian inference was
as follows:
(i) Update of p(h)
According to Equation (8), by ignoring the terms which are independent of h, we have
101
Electronics 2019, 8, 473
ln p(h) ∝ Eα,b ln p(y h) + ln p(h α) + ln p(b)
∝ Eα,b ln p(y h) + ln p(h α) (9)
−1 T T
= 2σ2
(y − Ah) (y − Ah) − 12 h Λh
where Λ = diag Eα [αi ] , σ2 is the noise variance in the
system model, the vectors b and α are
comprised
by bi and αi respectively. Since p(y h) and p h α are a Gaussian distribution, then p h follows a
Gaussian distribution with the mean μ and covariance φ given by
1 T
μ= ΦA y (10)
σ2
1 T
Φ= A A+Λ (11)
σ2
⎧ ⎛ ⎞ ⎫
⎪ ⎪ )⎟ ⎪
⎪
2 2
⎨ ⎜⎜ E (h +h
⎪ ( + − ) α − ⎜bi + h,b i i+N ⎟⎟⎟αi ⎬
⎜
⎪
c⎩
a i 0.5 1 ln i ⎝ 4 ⎠ ⎪
⎪
⎭
i∈{S,S+N }
where S is the estimated support set from the previous timeslot. Since the complex system model was
converted in Equation (4). By (II), S+N {si + N} was also the support set in the converted system
model in Equation (4). For i ∈ S, S+N , bi is variable number, bi and bi+N were assumed to be the same,
we used 0.5Eh,b (bi + bi+N ) to present Eh,b (bi ). The same assumption was applied to hi and hi+N with
2 2 2
Eh,b (hi ) = 0.5Eh,b (hi + hi+N ). In this way the structural sparsity was utilized.
Since p(α|a, b ) is the Gamma distribution and p h α is the Gaussian distribution, p(α) is the
ai and '
Gamma distribution. Then p(αi ) is also the Gamma distribution with the updated parameters ' bi
given by
'ai = ai + 0.5 (13)
⎧
⎪
⎪ Eh,b (bi +bi+N )
2 2
E ( hi + hi + N )
⎪
⎪ + h,b 4 , i ∈ S, S+N
⎪
⎨
'
bi = ⎪
2
(14)
⎪
⎪
⎪ 2 2
⎩ b + Eh,b (hi +hi+N ) , i ∈ S, S c
⎪
i 4 +N
102
Electronics 2019, 8, 473
According to Equation (8), by ignoring the terms which are independent of b, we have
ln p(b{S,S+N } ) ∝ Eα,h ln p(y h) + ln p(h α) + ln p(α a, b) + ln p(b c, d)
where b{S,S+N } is comprised by the entries indicated by S, S+N in b. In (15) the α, a, b, c, d are also
comprised by their indicated S, S+N , the subscript S, S+N is omitted for simplicity. As shown
in Figure 1, b{S,S+N } was modelled as a Gamma distribution. Since p(αi |ai , bi ) and p(bi |ci , di ) were a
~
Gamma distribution, p(b{S,S } ) was Gamma('
+N
bi∈{S,S } ' +M
ci and d'i were given by
ci , d'i ), and the updated '
'
ci = ai + ci (16)
Then the Bayesian inference for the channel estimation was executed iteratively among (i), (ii),
and (iii). The details of the algorithm are summarized in step 3 of Algorithm 1. When the estimated
channel vector h was recovered, we needed to convert it to the complex vector hda according to
Equation (3).
Algorithm 1 Downlink channel estimation with variational inference algorithm and overcomplete dictionary.
Input: A, y, σ2
Output: h
1. Divide the timeslots into groups, and with each group comprised by tg timeslots.
2. For the first timeslot in the group, use variational Bayesian inference (VBI) for channel estimation, and
obtain the angular channel supports.
3. For the rest of the timeslots in the group, utilize the support information from the previous timeslot for
channel estimation one by one. The recovery algorithm in each timeslot is as follows:
3.1. Initialize α, a, b, c, d.
T T 2
3.2. μ = σ12 φA y, φ = ( σ12 A A + Λ), Eh,b (hi ) = μ2i + φi,i , where Λ = diag Eα [αi ] , μi is the i-th
entry in μ, and φi,i is the i-th diagonal entry in φ.
3.3. Update ' ai and ' ai and '
bi according to Equations (13) and (14) in (ii) (' bi are the updated ai and bi ,
and ai and bi are the results from last iteration); then according to the property of the Gamma
ai /'
distribution variable, Eα (αi ) = ' bi .
3.4. Update 'c and d'according to Equations (16) and (17) in (iii) ('
c and d'are the updated c and d, and c
and d are the results from last iteration); then according to the property of the Gamma
distribution variable, Eα ('
bi ) = '
c/d.'
3.5. Go to step 3.2 until stop criteria meets.
3.6. Then h = μ.
4. Go back to step 2 for a new group of timeslots.
In a practical massive MIMO system, the transmission environment may change suddenly, in this
way the correlation of sparsity between adjacent timeslots will deteriorate, and the previous channel
support information cannot be utilized. On the other hand, the error will accumulate if the previous
channel support information is utilized timeslot by timeslot. Hence, the initialization is important for
the robustness and efficiency of the algorithm. As shown in Figure 2 divided the timeslots into groups,
and each group was comprised of several timeslots. During the channel estimation for each group,
103
Electronics 2019, 8, 473
the VBI was used for the channel estimation in the first timeslot, and then the proposed algorithm was
executed for the remaining timeslots in which the channel support information of the previous timeslot
was made use of by the current timeslot. This procedure is detailed in steps 1, 2 and 4 in Algorithm 1.
Figure 2. Channel estimations by group. Each block represents one timeslot, and the block filled with
grey is the timeslot with variational Bayesian inference (VBI) for the channel estimation, while the
blank blocks are the timeslots with the proposed algorithm for channel estimation.
4. Discussion
Figure 3. Ellipse geometry channel model for line of sight (LoS) and non-LoS (NLoS) transmission.
In order to illustrate the angle change Δθ during one timeslot, we assumed that dNLoS was 800 m,
the velocity of UT was 14.4 km/h, and the typical timeslot duration τ = 0.5 ms, then the movement
distance of UT in one timeslot was 0.02 m. By changing the distance between BS and reflector, as shown
in Figure 4, the angle change was not more than 0.025◦ . It should be noted that when the LoS distance
104
Electronics 2019, 8, 473
and the dNLoS were fixed, BS and reflector distance could not be arbitrary vales due to triangle inequality.
Hence, the angle of arrival or departure changed slowly and then there was sparsity correlation among
the angular channels for adjacent timeslots.
/R6GLVWDQFH
/R6GLVWDQFH
/R6GLVWDQFH
DQJHOFKDQJH
%6DQGUHIOHFWRUGLVWDQFH P
Figure 4. Transmission angle change during one timeslot with a different LoS distance and different
distances between the base station (BS) and reflector.
According to the system model in Section 2, h, σ are independent, the Fisher information matrix J
is block diagonal. We can rewrite p(y, z) as
Then the BCRB on the MSE of the estimated channel vector h is given by
( 2
) * +
E h − h ≥ tr J−1 (21)
hi h j
( )
∂2 log p(y,z)
where Jh h = Ez − is the fisher information sub-matrix. Thus, we can obtain the Bayesian
i j ∂hi ∂h j
Cramér–Rao bound of the minimum mean square error for the estimated channel h as shown in
Proposition 1.
Proposition 1. The BCRB of MSE for the channel estimation h is represented as
105
Electronics 2019, 8, 473
( ) ⎛* * + +−1 ⎞ ,
2 ⎜⎜ 1 1 T ⎟⎟ 1 , 1
E h − h ≥ tr⎜⎜⎝ diag(E ) + 2 A A ⎟⎟⎠ = + (22)
αi σ 1+c
+ λi bi
+ λi
i∈S adi σ iS a σ
T T
where S is the diagnosed support set, λi is the eigenvalues of A A, and A A ∈ R2N×2N , and a, bi , c, and di are
T
the parameters in the Bayesian model in Figure 1. When Td , M → ∞ and Md = β, according to the random
matrix theory, we have
( )
ˆ H ˆ 1
E h−h h−h ≥ |S| · |S| 1+c
1
λ + (N − |S|) · 1
(N−|S|)
1
max(b) λi
i∈S a min(d)
++ σi iS a +σ
(23)
amin(d) F(snr1 ,β) F(snr2 ,β)
→ |S| 1+c 1− 4βsnr1 + (N − |S|) maxa(b) 1 − 4βsnr2
* +2
amin(d)
√ 2 √ 2
where snr1 = (1+c)σ
, snr2 = a
σmax(b)
, F(x, z) = x 1 + z + 1 − x 1 − z + 1 , min(d) and
max(b) are the minimum and maximum entries in d and b.
The proof of proposition 1 is presented in Appendix B. From proposition 1, we can see that the
MSE lower bound is related to the priori support size |S|, (1 + c)/min(d) and max(b) for the massive
MIMO channel estimation.
5. Simulations
In the simulation, the support diagnosis algorithm in [6] was adopted, and we assumed that
the transmission angle change between timeslots was within 1 degree. The pilot length was 50,
and antenna number at the BS was 100. The channel was generated according to the spatial model as
defined in 3GPP TR25.996. We compared our proposed algorithm with a unitary dictionary with a
size of 100 and the overcomplete dictionary with a size of 150, 200, and 250, and compared this with a
Bayesian sparse learning (SL) [16], weighted subspace pursuit (WSP) [6], weighted l1 minimization
(W-l1 min) [5], weighted iteratively reweighted least square(W-IRLS), IRLS [17], compressive sampling
matched pursuit (COSAMP) in [11], and l1 minimization (l1 min) [18].
In order to evaluate the channel estimation performance, we used a normalized mean-square
error (MSE) between true and estimated channel vectors as follows:
-- -
d -2
1 , -ĥ − h -
d
MSE = -- --2 (24)
T -hd -
T
d
where T is the number of trials, ĥ and hd are the estimated and original channel vector, respectively
for each trial. In the simulations the trial number T was 250.
In Figure 5 the overcomplete dictionary size was 150 in the proposed algorithm. It could be seen
that when the unitary dictionary was used, our proposed algorithm outperformed WSP, COSAMP
and IRLS, but was a little worse than W-l1 with a small gap. However, when the overcomplete
dictionary was used, our proposed algorithm outperformed other algorithms, but almost had the
same performance as SL with a little performance improvement which could be seen in the zoomed-in
subfigure. The overcomplete dictionary in the proposed algorithm can dramatically improve the MSE
performance due to the fact that there are more atoms in the overcomplete dictionary than in the
unitary dictionary which can improve the sparsity in the angular channel; however, it doesn’t mean
that the larger the overcomplete dictionary size is, the better performance it has, which is shown in
Figure 6.
106
Electronics 2019, 8, 473
Figure 5. Comparisons of channel estimation mean square error (MSE) for different algorithms.
Figure 6. Comparisons of channel estimation of MSE for the proposed algorithm with different
dictionary sizes.
We compared the performance of the proposed algorithm with different dictionary sizes in
Figure 6. It can be seen that in the high SNR region the performance improved when an overcomplete
dictionary was used, but the MSE performance gain did not improve when increasing the dictionary
107
Electronics 2019, 8, 473
size. For example, the algorithm with a dictionary size of 150 had a relatively better performance than
with a dictionary size of 100. However, the performances with a dictionary size of 200 and 250 almost
gave the same trends as that with a dictionary size of 150. This was because the larger dictionary would
induce angel ambiguity because the correlation of atoms increased. Hence, in the practical engineering,
the dictionary size is not recommended to be very large. A large dictionary size is computationally
expensive and the benefit is limited. It also should be noted that in the low SNR region the MSE
performance with a larger dictionary size did not always do better than those with a smaller dictionary
size. For example, when the SNR was 0 dB, they hadsimilar performance. The reason was that in
the low SNR region the estimated channel support of the previous timeslot was not accurate enough,
and on the other hand larger dictionary size would have deteriorated the dictionary incoherence.
We compared the runtime and convergence performance of the proposed algorithm with a
different dictionary size in Figure 7. The relative error was defined as the ratio of the difference of
adjacent iteration results to the previous iteration result. It can be seen that the proposed algorithm with
dictionary size 150 converged fast than with a dictionary size of 100. However, the improvement had
its price, and the runtime for the proposed algorithm with dictionary size 150 was longer which meant
that the computational complexity was higher with a larger dictionary size. Based on the simulation
results shown in Figures 6 and 7, when the antenna at BS is 100, the dictionary size is recommended to
be set at 150 or so to balance the performance improvement and computation complexity.
Figure 7. Comparisons of runtime and convergence performances of the proposed algorithm with
orthogonal dictionary (size is 100) and overcomplete dictionary (size is 150).
6. Conclusions
In this paper we proposed a downlink channel estimation algorithm based on overcomplete
dictionary and variational Bayesian inference. We converted the complex system model to a real model
and exploited the correlation of angular channel sparsity in adjacent timeslots. In the algorithm we
divided the timeslots into groups and made use of the channel support information of the previous
timeslot to the channel estimation in the current timeslot within each group. The sparsity correlation
and Bayesian Cramér–Rao bound for the MSE of channel estimation was analyzed. Compared with
other recovery algorithms, such as WSP, IRLS, WIRLS, l1 min, W-l1 min and COSAMP, our proposed
algorithm with overcomplete dictionary had a relatively better performance. Moderate overcomplete
dictionary can improve the MSE performance of channel estimation to balance the computational
complexity and performance gain.
Author Contributions: Conceptualization and methodology, W.L.; validation, X.W., S.P. and L.Z.; formal analysis,
W.L.; writing—original draft preparation, W.L.; supervision, Y.W.
108
Electronics 2019, 8, 473
Funding: This work was supported in part by the National Science Foundation of China (No.61601509 and
61601334), the China Postdoctoral Science Foundation Grant (No.2016M603045 and 2018M632889) and the
self-determined research funds of CCNU(CCNU18QN007) from the colleges basic research and operation of MOE.
Acknowledgments: Thank you to the reviewers for their valuable comments.
Conflicts of Interest: The authors declare no conflicts of interest.
Appendix A
Since d2Δ is very small compared with d1 and dLoS , by the first-order approximation we have
* +
d21 +d2LoS −(dNLoS −d1 )2 ∓2dΔ (dNLoS −d1 )
θ ± Δθ ≈ arccos 2d1 dLoS
Then we have
2dΔ (dNLoS − d1 ) 1
Δθ ≈ √ . (A5)
2d1 dLoS 1 − cos2 θ
Appendix B
Proof of Proposition 1.
Let z h, σ , the we have
Ez (z − ẑ)(z − ẑ)T ≥ J−1 . (A6)
Since h, σ are independent, the Fisher information matrix J is block diagonal, and can be presented as
⎡ ⎤
⎢⎢ J 0 ⎥⎥⎥
J = ⎢⎢⎣ h,h ⎥⎦. (A7)
0 Jσ,σ
109
Electronics 2019, 8, 473
Because p(y, z) = p y z p(h α)p(α b)p(b)p(σ) , we have
2
∂ log p(y,z)
J = Ez − ∂z ∂z
2
i j
( ) 2
∂ log p(y|z) ∂2 log p(h α) ∂ log p(α|b)
= Ez − ∂z ∂z + Ez − ∂z ∂z + Ez − ∂z ∂z +
i j i j i j (A9)
2 2
∂ log p(b) ∂ log p(σ)
Ez − ∂z ∂z + Ez − ∂z ∂z
i j i j
Since we mainly focus on the MSE of h, we only need to analyze Jh,h . We discuss the above
formula part by part as follows:
2
∂ log p(y|z)
1) Let Jh,h (y) = Ez − ∂z ∂z , according to the Bayesian model in Figure 1, we have
i j
T T
A A A A
p y z ∼ Normal(y|Ah, σI)thenJh,h (y) = Ez { }= .
σ σ
( )
∂2 log p(h α) .
2N
2) Let Jh,h (h) = Ez − ∂z ∂z , and we have P h α = Normal(hi 0, αi ) , then we get Jh,h (h) =
i j
i=1
Ez α1i .
2 2 2
∂ log p(α|b) ∂ log p(b) ∂ log p(σ)
3) Because Ez − ∂z ∂z , Ez − ∂z ∂z and Ez − ∂z ∂z are independent with h, they are
i j i j i j
T
all 0. Then in summary, we get Jh,h = diag(E α1 ) + σ12 A A.
i
Since the priori support set information is used in our proposed algorithm, a three-layer model is
constructed for the elements belonging to the priori support
set, and a two-layer model is used for the
elements not belonging to the priori support set, so EZ α1 has different expressions for the two cases.
i
EZ α1 in the two cases are discussed as follows:
i
1) When i belongs to the priori support set, according to the three-layer graph model we have
2N
p(α) = Gamma(αi a, bi ) , (A10)
i=1
Then we get /∞
p(αi ) = 0
p(αi bi )p(bi )dbi
/∞
= Γ(a)−1 bai αa−1 e−bi αi Γ(c)−1 dci bc−1 e−di bi dbi
0 i i . (A12)
Γ (a+c)
= Γ(a)−1 Γ(c)−1 αa−1 dci
i (ai +di )a+c
Accordingly, we have
/∞ −1 −1 a−1 c Γ(a+c)
E 1
αi = 0 αi Γ (a) Γ (c) αi di (ai +di )a+c dαi
1
/∞ −a−c , (A13)
1 Γ(a+c) αi a−1 αi α
= 0 αi Γ(a)Γ(c) d1 d1 +1 ddi
1
Γ (a+c)
α a−1 α −a−c
where Γ(a)Γ(c) d i di + 1
i
satisfies the probability density function of Beta prime
i
distribution. According to the properties of the Beta prime distribution, when −a < −1 < c,
we have
110
Electronics 2019, 8, 473
( )
ˆ H ˆ 1
E h−h h−h ≥ |S| · |S| 1+c
1
λ + (N − |S|) · 1
(N−|S|)
1
max(b) λi
i∈S a min(d)
++ σi iS a +σ
. (A14)
amin(d) F(snr1 ,β) F(snr2 ,β)
→ |S| 1+c 1− 4βsnr1 + (N − |S|) maxa(b) 1 − 4βsnr2
2) When i does not belong to the priori support set, according to the high-order moment properties
for the general gamma distribution, we have
( )
1 bi
E = . (A15)
αi a
T T
where S is the diagnosed support set, λi is the eigenvalues of A A,and A A ∈ R2M×2M .
2π
When overcomplete dictionary is as Dd = √1 e−j M kn , k ∈ {1, · · · , M}, n ∈ {1, · · · , N}, and A is
N n,k
1
Gaussian random matrix with each element is mean 0 and variance Td , then ADd is complex Gaussian
ρd
random matrix. Then A is Gaussian random matrix with mean 0 and variance 2T .
d
According to the random matrix theory, for N × K dimensional random matrix H with each element
K
is independent and is variable with mean 0 and variance 1/N, when K, N → ∞ and N → β ,then the
+
empirical distribution of eigenvalues of H H converges almost surely as fβ (x) = 1 − β1 δ(x) +
T
√
(x−a)+ (b−x)+ 0 2 0 2
2πβx , where (x)+ = max(0, x), a = 1 − β , b = 1 + β .
ρd
Since A ∈ R2Td ×2M , and its element is Gaussian random variable with mean 0 and variance 2Td By
applying the above results for the empirical distribution of eigenvalues of HT H, when Td , M → ∞ and
Td T
M = β, the empirical distribution of eigenvalues λ of A A converges almost surely as
* ++ (λ − a)+ (b − λ)+
1
fβ ( λ ) = 1 − δ(λ) + (A17)
β
2πβλ ρd
0 2 0 2
where a = ρd 1 − β , b = ρd 1 + β . When s , M → ∞ and s
M = μ, we have
( )
ˆ H ˆ 1
E h−h h−h ≥ |S| · |S| 1+c
1
λ + (N − |S|) · 1
(N−|S|)
1
max(b) λi
i∈S a min(d)
++ σi iS a +σ
, (A18)
amin(d) F(snr1 ,β) F(snr2 ,β)
→ |S| 1+c 1− 4βsnr1 + (N − |S|) maxa(b) 1 − 4βsnr2
* +2
amin(d)
√ 2 √ 2
where snr1 = (1+c)σ
, snr2 = a
σmax(b)
and F(x, z) = x 1+ z +1− x 1− z +1 .
Then the proofs are complete.
111
Electronics 2019, 8, 473
References
1. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overview of Massive MIMO: Benefits and
challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
2. Rao, X.; Lau, V.K.N. Distributed compressive CSIT estimation and feedback for FDD multi-user massive
MIMO systems. IEEE Trans. Signal Process. 2014, 12, 3261–3271.
3. Vaswani, N.; Lu, W. Modified-CS: Modifying compressive sensing for problems with partially known
support. IEEE Trans. Signal Process. 2010, 9, 4595–4607. [CrossRef]
4. Borries, R.V.; Miosso, C.; Potes, C. Compressed sensing using priori information. In Proceedings of the 2nd
IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive, St. Thomas, VI, USA,
12–14 December 2007; pp. 121–124.
5. Tseng, C.C.; Wu, J.Y.; Lee, T.S. Enhanced compressive downlink CSI Recovery for FDD Massive MIMO
systems using weighted Block l1 minimization. IEEE Trans. Commun. 2016, 3, 1055–1066. [CrossRef]
6. Lu, W.; Wang, Y.; Fang, Q.; Peng, S. Downlink compressive channel estimation with support diagnosis in
FDD massive MIMO. J. Wirel. Commun. Netw. 2018, 115, 1–12. [CrossRef]
7. Masood, M.; Afify, L.H.; Al-Naffouri, T.Y. Efficient coordinated recovery of sparse channels in massive MIMO.
IEEE Trans. Signal Process. 2015, 1, 104–118. [CrossRef]
8. Cheng, X.; Sun, J.; Li, S. Channel estimation for FDD multi-user massive MIMO: A variational Bayesian
inference-based approach. IEEE Trans. Wirel. Commun. 2017, 11, 7590–7602. [CrossRef]
9. Dai, J.; Liu, A.; Lau, V.K.N. FDD massive MIMO channel estimation with arbitrary 2D-Array Geometry.
IEEE Trans. Signal Process. 2018, 10, 2584–2599. [CrossRef]
10. Xie, H.; Gao, F.; Jin, S.; Fang, J.; Liang, Y. Channel Estimation for TDD/FDD Massive MIMO Systems with
Channel Covariance computing. IEEE Trans. Wirel. Commun. 2018, 6, 4206–4218. [CrossRef]
11. Shen, W.; Dai, L.; Shi, Y.; Shim, B.; Wang, Z. Joint Channel Training and Feedback for FDD Massive MIMO
Systems. IEEE Trans. Veh. Technol. 2016, 10, 8762–8767. [CrossRef]
12. Tauböck, G.; Hlawatsch, F.; Eiwen, D.; Rauhut, H. Compressive Estimation of Doubly Selective Channels in
Multicarrier Systems: Leakage Effects and Sparsity-Enhancing Processing. IEEE J. Sel. Top. Signal Process.
2010, 2, 255–271. [CrossRef]
13. Tipping, M.E. Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 2001, 1,
211–244.
14. Christopher, M. Bishop. Bayesian Linear Regression. In Pattern Recognition and Machine Learning;
Springer-Verlag: Heidelberg, Germany, 2006; pp. 152–160.
15. Fang, J.; Shen, Y.; Li, F.; Li, H.; Chen, Z. Support knowledge-aided sparse Bayesian learning for compressed
sensing. In Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing,
Brisbane, Australia, 19–24 April 2015; pp. 3786–3790.
16. Ji, S.; Xue, Y.; Carin, L. Bayesian compressive sensing. IEEE Trans. Signal Process. 2008, 6, 2346–2356. [CrossRef]
17. Lu, W.; Wang, Y.; Fang, Q.; Peng, S. Compressive Channel Estimation Based on Weighted IRLS in FDD
Massive MIMO. Wirel. Pers. Commun. 2018, 2, 1–10. [CrossRef]
18. Candes, E.J.; Romberg, J.K.; Tao, T. Stable signal recovery from incomplete and inaccurate measurements.
Commun. Pure Appl. Math. 2006, 8, 1207–1223. [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
112
electronics
Article
Multiple-Symbol Non-Coherent Detection for
Differential QAM Modulation in Uplink Massive
MIMO Systems
Hieu Trong Dao and Sunghwan Kim *
School of Electrical Engineering, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan 44610, Korea;
[email protected]
* Correspondence: [email protected]; Tel.: +82-52-259-1401
Abstract: In this paper, we propose a novel multiple-symbol detector based on maximum likelihood
metric for differential quadrature amplitude modulation in massive multiple-input multiple-output
(MIMO) systems. While current research on differential modulation in massive MIMO has focused
on two consecutive symbols, our proposed detector is based on multiple-symbol, which is larger
than or equal to two. Moreover, we derive new distance based on the proposed detector. To encode
and decode data, we apply existing look-up table algorithm using the proposed distance, which is
known as optimum encoding algorithm for differential modulation. Simulation results show the
improvement based on the bit-error-rate performance since the proposed detector and distance vary
according to the channel statistic information.
1. Introduction
Massive multiple-input multiple-output (MIMO) transmission technique has gained a lot of
attention in recent decades [1–13], since it can achieve significant improvement in terms of the energy
and spectral efficiency while using simple signal processing [1–4]. Massive MIMO systems prefer
operating in the time division duplex (TDD) mode in which users must synchronously send mutually
orthogonal pilot signals to the corresponding base station (BS) so that the BS can estimate the channels.
This method uses the estimated channels to perform signal processing [5–13], so that pilot signals
account for a significant part of the total coherent interval, which decreases the spectral efficiency.
In addition, when the number of users is large, the orthogonal pilot set has to be reused in every cell,
which leads to pilot contamination problems; this is considered as a performance bottleneck in massive
MIMO systems.
The authors of [8] investigate the power allocation to improve the spectral efficiency, this require
a large information exchange in backhaul of system between BSs or between BSs and users. Besides,
the algorithm to optimize the power is quite complex. Some semi-blind and blind channel estimation
methods in uplink massive MIMO have been proposed. In [9], the authors proposed an eigenvalue
decomposition-based method to blindly estimate the uplink channel from the data signal. However,
they assumed that the number of antennas was very large such that the channel vectors become
mutually orthogonal. The authors of [10] derived a new channel estimator based on subspace projection.
However, this channel estimation algorithm relies heavily on the eigenvalues of the channel matrix.
Interestingly, the authors of [11] proposed an energy detection scheme in which data symbols could
be detected without relying on estimated channels. The scheme used in [11] requires designing
unique modulated signal constellation for each user in the system. Another promising technique
that does not require estimated channels is differential modulation, but it has not received much
attention in the massive MIMO research field until now. In [12], differential quadrature amplitude
modulation (DQAM) was proposed for massive MIMO systems. The modulation scheme used in [12]
was based on the asymptotic behavior of the channel when the number of BS antennas goes to
infinity; however, the authors of [12] did not show the detector when the number of BS antennas is
finite. The authors of [13] generalized the QAM detector in [12] and proposed a new detector and
non-coherent distance with better performance when the number of BS antennas is not very large.
The differential encoding part of [13] was done via the look-up table algorithm used in [14,15]; this is
known as the optimum encoding algorithm for differential modulation. Besides, both [12,13] can only
detect two consecutive symbols at a time. Recently, the authors of [16] developed a new differential
detector based on multiple-symbol differential detection (MSDD) and the generalized likelihood ratio
test (GLRT) criterion. However, the authors of [16] only consider the case of M-ary Phase Shift Keying
modulation (M-ary PSK).
In this paper, we propose a novel multiple-symbol detector for DQAM based on the maximum
likelihood metric, which can detect more than two symbols at a time and varies following channel
condition to adapt better with the change in environment. In addition, we propose a novel distance
which can be used to encode and decode data by using the look-up table algorithm in [14,15] for DQAM
encoding. Since the proposed scheme varies following the change in channel statistic information
while the schemes in [12,13] are unchanged, they adapt better to the change of environment and show
significantly better performance when compared to previous works [12,13].
where ρ presents the average signal-to-noise ratio (SNR); x = [ xt , xt+1 , ..., xt+ L−1 ] T is the transmit signal
vector with length L( L ≤ T ) and E[||x||2 ] = L, where elements of x are taken from the conventional
QAM constellation as shown in Figure 1a, nm is the additive white Gaussian noise vector at the mth BS
antenna whose entries follow CN (0, 1); and hm is the channel coefficient with CN (μh , σh2 ). An example
of the block fading model and signal vector is illustrated as in Figure 1b.
(a) (b)
Figure 1. Conventional 16-QAM constellation and illustration of block fading. (a) 16-QAM constellation;
(b) an illustration of block fading length T = 7 and signal vector length L = 3.
For simplicity, we normalize the channel so that μ2h + σh2 = 1. Since Rayleigh and Rician fading
models are very popular in evaluating system performance on both massive and regular MIMO
114
Electronics 2019, 8, 693
where Kr represents the Rician factor; In a special case, when Kr = 0, the channel becomes a Rayleigh
fading channel. Additionally, hLOS = [1, exp− jπ sin(θ ) , ..., exp− jπ ( M−1) sin(θ ) ] T ∈ C M×1 with the arrival
angle θ; is the light-of-sight (LOS) component when the antenna spacing is a half of wavelength.
hNLOS ∈ C M×1 denotes the non-light-of-sight (NLOS) component whose elements follow i.i.d
Gaussian variables with zero mean and unit variance.
In [12,13], the authors considered two consecutive tth and (t − 1)th instants, with channel vectors
ht ; and ht−1 , and assumed ht ≈ ht−1 . The received signal vector at the tth instant is given as
√
yt = ρht xt + nt . (3)
where xt is taken from a 16-DQAM constellation based on [19]. With a very large number of BS
antennas M, they have
1 H 1 H
lim ht ht−1 = 1, lim nt ht−1 = 0,
M→∞ M M→∞ M
(4)
1 H 1 H
lim ht nt−1 = 0, lim nt nt−1 = 0,
M→∞ M M→∞ M
1 T ∗
rt = y y = ρxt xt∗−1 for very large M, (5)
M t t −1
in which xt xt∗−1 can be mapped back to the information symbol by the encoding rule of [19]. However,
the authors of [12] did not propose a detector for when the number of BS antennas M is finite.
The authors of [13] generalized the detector in [12] as ([13], Equation (6)), which can be applied for any
value of M. After that, they proposed a new two consecutive-symbol detector based on the conditional
probability and a new non-coherent distance as ([13], Equations (9) and (10)). The new distance in [13]
is used for look-up table algorithm in [14,15] for differential encoding.
Particularly, the authors of [14] had already proved that any differential encoding techniques can
be transform equivalently to a differential encoding via a look-up table. Using the algorithm in [14,15]
to create look-up table for encoding and decoding 16-DQAM signal, a brief explanation of a look-up
table is as follows. The readers should refer to [14,15] for the details of the algorithm.
• Sort all possible codeword pair c x and cy in descending order based on their proposed distance.
• Arrange all groups: Take codeword pair one by one from the sorted list L. Put two codewords
of a pair into the same group if possible; Otherwise, put them into two different groups and the
two groups are written into L, which is called the list of near group pairs (near groups means
the distance between members in two groups are small, which means there is a big chance that
these two groups may be wrongly estimated as each other). Two codewords cannot be in the same
group if their first symbols are identical.
• Assign modulated symbol to each group G1 , G2 . . . , G16 based on list L: Take group pairs one by
one in order from list L. The number of different bits assigned for the considered two groups
taken from L should be as small as possible (since two near groups have big change to be wrongly
estimated as each other, this step’s goal is to minimize the bit error).
115
Electronics 2019, 8, 693
Moreover, this look-up table can be optimized by using algorithms proposed in [14,15]. In this
paper, we propose new non-coherent detector, new non-coherent distance and apply the algorithm
in [14,15] to generate the look-up table. We compare the performance of the proposed detector and
distance to the existing detectors and distances proposed in [12,13]. Due to the limitation of length,
we would like to skip the detail of the look-up table algorithm and refer interested readers to [14,15].
where det( Ry ) is the determinant of Ry , and Ry is the covariance matrix of ym , which can be
calculated as
Ry = E (ym − ȳm )(ym − ȳm ) H
√
= E ρ(hm − μh )xx H (hm − μh ) H + ρ(hm − μh )xnm
H
(8)
√
+ ρ( hm − μh ) H nm x H + nm nm
H
= (ρσh2 xx H + IN ).
Since the proposed detector aims to maximize the summation of the conditional probability
of received signal vector ym at all BS antennas 1 ≤ m ≤ M, given transmitted signal vector x,
the estimated signal vector x̂ can be calculated as
M
x̂ = arg max
x∈χ
∑ p ( y m | x ), (9)
m =1
where χ presents the vector space of all possible transmitted signal vectors x. Since the natural
logarithm function is monotonically increasing, maximizing p(ym |x) is equivalent to maximize
ln p(ym |x). Finally, the proposed detector is given as
M
x̂ = arg max ∑ −(ym − ȳm ) H Ry−1 (ym − ȳm ) − ln(det( Ry )) , (10)
x∈χ
m =1
Specially, when the channel is Rayleigh fading, we have ȳm = 0, and the proposed
detector becomes M
x̂ = arg max ∑ −ymH R−y 1 ym − ln(det( Ry )) ,
x∈χ,xt ∈ Q1
(11)
m =1
116
Electronics 2019, 8, 693
where Q1 is the first quadrant. This means the number of decision values that need to be calculated is
L
reduced by a factor of four, from N L to N4 , with N-QAM. The reduction occurs since when ȳm = 0,
for any codeword x = [ xt , xt+1 , ..., xt+ L−1 ] T with the first symbol xt belongs to the first quadrant of
the constellation, there are also three other codewords which have the first symbol xt belong to three
other quadrants, that have the same estimated vector x̂ as x. In other words, we only need to calculate
the decision values of codewords which have the first symbol xt belongs to the first quadrant of the
constellation in case of Rayleigh fading. To calculate the proposed non-coherent distance, we propose
the distance from x1 to x2 and the distance from x2 to x1 as in Equations (12) and (13) based on the
proposed detector, Equation (10), as below:
d(x1 → x2 ) = ||[−ρ(ch x1 ) H R− H −1
x1 ( c h x1 ) − ln(det( R x1 ))] − [− ρ ( c h x1 ) R x2 ( c h x1 ) − ln(det( R x2 ))]||;
1
(12)
d(x2 → x1 ) = ||[−ρ(ch x2 ) H R− H −1
x2 ( c h x2 ) − ln(det( R x2 ))] − [− ρ ( c h x2 ) R x1 ( c h x2 ) − ln(det( R x1 ))]||.
1
(13)
The Equation (12) is the non-coherent distance from x1 to x2 , which is based on the assumption
that we did send the codeword x1 but the detector wrongly estimated that x2 was sent. Inversely,
the Equation (13) is the non-coherent distance from x2 to x1 in which, the x2 was actually sent but
the detector wrongly estimated that x1 was sent. In other words, Equations (12) and (13) can be
used by likelihood estimator as a distance between two codewords x1 and x2 . The larger the values
of Equations (12) and (13) are, the less chance the detector wrongly estimates between x1 and x2 .
Eventually, the proposed non-coherent distance is calculated as
In the differential encoding part, we apply the look-up table algorithm for DQAM as in [14,15] by
using the proposed non-coherent distance in Equation (15).
The main contribution of these above steps and equations are summed up as follow.
• Equation (10) is our proposed detector which is derived based on the conditional probability
of received signal and contains channel statistical information, so that it adapts better with the
change in environment.
• Equation (11) is a simplified detector when the channel is Rayleigh fading.
• Equations (12) and (13) are derived based on Equation (10), and they are used to evaluate the
distance between two signal vectors (a small distance means that it is easy to wrongly decode
between two signal vectors).
• d(x1 → x2 ) is distance from x1 to x2 which is used to evaluate how likely it is to wrongly estimate
x1 as x2 . Similar explanation is applied to d(x2 → x1 ).
• Eventually, we use Equation (15) to take the minimum value between d(x1 → x2 ) and d(x2 → x1 )
as the distance between x1 and x2 .
4. Numerical Results
In simulations, we use the conventional 16-QAM constellation and apply the look-up table
algorithm in [14,15] to differentially encode the information. Particularly, one 4-bit information symbol
is encoded into two consecutive 16-QAM points; thus, the non-coherent distance as in Equation (15)
calculated with the length of transmitted signal vectors x1 , x2 is 2. Finally, the look-up table for
16-DQAM has 16 rows presenting 16 different groups; each group contains 16 different vectors x,
and all transmitted signal vectors in the same group correspond to the same information symbol.
The look-up table for the proposed 16-DQAM scheme with Rician fading channel Kr = 1 and an
117
Electronics 2019, 8, 693
average SNR = −4 dB is given in Table 1 as an example. Notice that, in [14,15] after generating the
look-up table, there is one more step that maps information bit to each group. This step adds a little
more improvement in bit error rate performance since groups with small non-coherent distance are
mapped to information bit symbols with a small difference in the number of bits. However, in this
simulation, we focus on comparing the performance of detector and distance between our proposed
ones and previous ones in [12,13] so that we skip this step and add the information bit symbols
sequentially from first group to last group in the look-up table.
Figure 2 illustrates the simulation results of the proposed 16-DQAM scheme with different lengths
of estimated signal vector L = 2 and 3, as well as the 16-DQAM schemes in [12,13] where the number
of BS antennas are M = 128 and 500, and the coherent length T = 7. Since the authors of [12]
did not show the detector when M is finite, we suppose that the 16-DQAM scheme in [12] uses the
generalized detector as in [13], Equation (6), and the corresponding non-coherent distance in [13].
As previously shown in [12,13], the 16-DQAM scheme of [12] shows an error floor when M is not very
large. The proposed scheme significantly outperforms the schemes in [12,13] for both M = 128 and 500.
Figure 2. Performance comparison between the proposed 16-DQAM scheme and the previous works
of References [12,13] under Rayleigh fading.
118
Electronics 2019, 8, 693
Since the schemes in [12,13] can only detect two consecutive symbols at a time, we simulate our
proposed scheme when the length of signal vector is L = 2. We can clearly see that, with the same
channel condition and signal vector’s length, the proposed scheme outperforms the other schemes
for M = 128 with nearly 3 dB when BER = 10−4 . With not so large number of BS antenna M = 128,
the significant improvement of the proposed scheme shows a huge potential that it can be deployed in
real system. When M increases to 500, the BER performance increases much more further with nearly
5 dB at BER = 10−4 , which also shows the advantages of massive MIMO with very large number of
BS antennas.
Noticeably, the performance of the proposed scheme is improved significantly when the length
of the estimated signal vector L increases, regardless of the value of M. The gain is nearly 1.5 dB at
BER = 10−5 when the length L increases from 2 to 3. When M = 500, the scheme of [13] only performs
better than [12] at low BER (≤ 10−4 ) while the performance of the proposed scheme is remarkably
better than both [12,13]; the gain is approximately 3 dB at BER = 10−5 in comparison with [13].
Figure 3 shows the simulation results for the aforementioned schemes under Rician fading with
T = 7, L = 2, M = 128 and different values of the Rician factor Kr = 0, 1, and 10. Notice that when
Kr = 0, the Rician channel becomes a Rayleigh channel. The error floor still happens when Kr = 1
in the case of [12]. However, when the LOS component of the Rician channel becomes stronger with
Kr = 10, the error floor seems to disappear and the performance of [12] is improved much more
than [13]; the performance gap is nearly 1 dB at BER = 10−5 . The performances of scheme of [13] are
nearly the same with different values of Kr , this is because the detector in [13] cancels out the channel
coefficient between two consecutively received symbols. The performance of the proposed scheme
is the best among three schemes. Even with Kr = 0 (i.e., Rayleigh fading), the performance of the
proposed scheme remains better than the other schemes with Kr = 1 or 10. When Kr is increased from
0 to 1 and 10, the performance of the proposed scheme is improved significantly with gains 5.5 dB
and 9.5 dB, respectively. In summary, we conclude that the proposed scheme shows much better
performance in Rician channels than in Rayleigh channels.
Figure 3. Performance comparison between the proposed 16-DQAM scheme and the previous works
of References [12,13] under Rician fading with different Rician factors.
5. Conclusions
In this paper, we propose a new detector and non-coherent distance for differential QAM
modulation in massive MIMO systems. We also apply the well-known look-up table algorithm
119
Electronics 2019, 8, 693
for DQAM encoding using the proposed non-coherent distance. The proposed detector can detect
multiple symbols (≥ 2) at a time. The proposed scheme varies following the change in channel
information statistics, allowing them to adapt better to the change in environment. Additionally, they
can be applied in a wide class of channels with a not too large number of base station antennas. This
paper focuses on massive MIMO system with single cell. Therefore, as a future work, it will be very
attractive to investigate the performance and how to improve the proposed scheme in the multiple-cell
environment where there is the presence of interference between users in nearby cells.
Author Contributions: All authors discussed the contents of the manuscript and contributed to its presentation.
H.T.D. designed and implemented the proposed scheme, analyzed the simulation results and wrote the paper
under the supervision of S.K.
Funding: This research was funded by the Research Program through the National Research Foundation of Korea
(NRF-2016R1D1A1B03934653, NRF-2019R1A2C1005920).
Conflicts of Interest: The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
References
1. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
2. Kim, K.J.; Choi, K.J.; Lee, S.R.; Kim, K.S. Multi-user massive MIMO for next-generation WLAN systems.
IET Electron. Lett. 2015, 51, 792–794. [CrossRef]
3. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overivew of massive MIMO: Benefits and
challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
4. Hassan, N.; Fernando, X. Massive MIMO Wireless Networks: An Overview. Electronics 2017, 6, 63. [CrossRef]
5. Trong Dao, H.; Kim, S. Worst Cell Based Pilot Allocation in Massive MIMO Systems. Electronics 2018, 7, 197.
[CrossRef]
6. Nyarko, J.K.N.; Mbom, C.A. A Performance Study of Massive MIMO Heterogeneous Networks with
Ricean/Rayleigh Fading. Electronics 2018, 7, 79. [CrossRef]
7. Zhu, X.; Dai, L.; Wang, Z.; Wang, X. Weighted-graph-coloring-based pilot decontamination for multicell
massive MIMO systems. IEEE Trans. Veh. Technol. 2017, 66, 2829–2834. [CrossRef]
8. Dao, H.T.; Kim, S. Pilot power allocation for maximising the sum rate in massive MIMO systems.
IET Commun. 2018, 12, 1367–1372. [CrossRef]
9. Ngo, H.Q.; Larsson, E.G. EVD-based channel estimation in multicell multiuser MIMO systems with very
large antenna arrays. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and
Signal Processing (ICASSP 2012) , Kyoto, Japan, 25–30 March 2012; pp. 3249–325.
10. Muller, R.R.; Cottatellucci, L.; Vehkapera, M. Blind pilot decontamination. IEEE J. Sel. Top. Signal Process
2014, 8, 773–786. [CrossRef]
11. Manolakos, A.; Chowdhury, M.; Goldsmith, A. Energy-Based Modulation for Noncoherent Massive SIMO
Systems. IEEE Trans. Wirel. Commun. 2016, 15, 773–786. [CrossRef]
12. Kong, D.; Xia, X.; Jiang, T. A Differential QAM Detection in Uplink Massive MIMO Systems. IEEE Trans.
Wirel. Commun. 2016, 15, 6371–6383. [CrossRef]
120
Electronics 2019, 8, 693
13. Wei, R.; Wang, X. Differential 16-QAM and 16-APSK for Uplink Massive MIMO Systems. IEEE Wirel.
Commun. Lett. 2017, 7, 170–173. [CrossRef]
14. Wei, R.-Y. Differential encoding by a look-up table for quadrature amplitude modulation. IEEE Trans. Commun.
2011, 59, 84–94. [CrossRef]
15. Wei, R.-Y.; Chen, L.-T. Further results on differential encoding by a table. IEEE Trans. Commun. 2012, 60,
2580–2590. [CrossRef]
16. Wang, Y.; Tian, Z. Multiple Symbol Differential Detection for Noncoherent Communications With Large-Scale
Antenna Arrays. IEEE Wirel. Commun. Lett. 2018, 7, 190–193. [CrossRef]
17. Angelis, C.T.; Chronopoulos, S.K. System Performance of an LTE MIMO Downlink in Various Fading
Environments. In International Conference on Ambient Media and Systems; Springer: Berlin, Germany, 2011;
pp. 36–43.
18. Chronopoulos, S.K.; Christofilakis, V.; Tatsis, G.; Kostarakis, P. Preliminary BER Study of a TC-OFDM system
operating under noisy conditions. J. Eng. Sci. Technol. Rev. 2016, 9, 13–16. [CrossRef]
19. Egri, R.G.; Horrigan, F.A. A finite group of complex integers and its application to differentially coherent
detection of QAM signals. IEEE Trans. Inf. Theory 1994, 40, 216–219. [CrossRef]
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
121
electronics
Article
Throughput Enhancement in Downlink MU-MIMO
Using Multiple Dimensions
Jong-Gyu Ha, Jae-Hyun Ro and Hyoung-Kyu Song ∗
Department of Information and Communication Engineering, uT Communication Research Institute,
Sejong University, Gunja-dong 98, Gwangjin-gu, Seoul 05006, Korea
* Correspondence: [email protected]; Tel.: +82-2-3408-3890
Abstract: This paper focuses on the throughput performance enhancement in the single cell multi-user
MIMO (MU-MIMO) downlink system model. For better quality of service, this paper proposes the
scheme that increases system throughput and improves the spectral efficiency. Specifically, the signal
transmission and detection schemes are proposed by using multiple dimensions. At the transmitter
side, two dimensions (power and space) are adopted at the same time. To achieve multiple access
(MA), the space domain is exploited by using a block diagonalization (BD) precoding technique,
and the power domain is exploited to transmit more data symbols. At the receiver, the signal detection
structure corresponding to a transmitter is also proposed. In the simulation results, comparisons
of throughput performance are presented in various aspects. As a result, the proposed scheme
outperforms the conventional schemes using only one dimension in terms of throughput. This paper
shows strong performance in MU-MIMO senarios by adopting multiple dimensions.
Keywords: multi-user MIMO; space division multiple access (SDMA); block diagonalization (BD);
non-orthogonal multiple access (NOMA); broadcast channel
1. Introduction
In the future, in order to handle explosive data traffic, studies will probably aim to increase
channel capacity and data rate in the overall wireless communication system [1]. Multiple-input
multiple-output (MIMO) has been studied in wireless systems since it has dramatic gains in channel
capacity [2]. Also, multiuser MIMO (MU-MIMO) has been studied widely as a potential for improving
the overall throughput [3–5]. In downlink broadcasting (BC) channel, MU-MIMO is accomplished by
multiuser beamforming that eliminates the multiuser interference (MUI) completely [6–9]. A number
of users can be served by one base station (BS) simultaneously and the spectral efficiency can be
increased. Therefore, the use of space-division multiple access (SMDA) in the downlink channel
provides a considerable gain in system capacity. The sum rate of the MU-MIMO broadcast channel
is achieved by dirty paper coding (DPC). However, the critical drawback of DPC is extreme high
complexity to implement in practice [10]. Another promising technique in an MU-MIMO system is
block diagonalization (BD). In this paper, the proposed scheme considers BD as a generalization of the
channel inversion [11,12]. BD supports the multiple data stream with low complexity and approaches
the sum capacity of DPC using user selection algorithms [13].
On the other hand, non-orthogonal multiple access (NOMA) is one of the most promising
techniques for improving the overall spectral efficiency [14,15]. NOMA shares the same resources
with multiple users by exploiting the domains. The well-known NOMA schemes can be divided
into power-domain and code-domain NOMA. Power-domain NOMA multiplexing is achieved by
different allocated power for users according to the channel conditions. Symbols are superposed
to each user and receivers perform successive interference cancellation (SIC) [16]. Furthermore,
1. First, the proposed scheme is written in a different view from the existing MU-MIMO papers.
Recently, many researches in MU-MIMO and NOMA have been mainly focused on increasing
spectral efficiency by using user clustering, power allocation, and beamforming. User clustering
means that close and far users are clustered within each beam and result in intra-beam interference.
Intra-beam interference is a major cause of performance degradation. In existing papers,
the resource reuse approach is highly affected by intra-beam interference and is critical to overall
system performance. However, the proposed scheme does not need to consider intra-beam
interference. The proposed scheme makes a single user in each beam and power domain is
exploited in a single user. In conclusion, power and spatial domains are exploited within a
single user.
2. Second, the proposed scheme improves performance through joint design that uses dimensions
appropriately according to wireless communication systems. In a wireless communication system,
it is important to decide which technique to use according to the system requirements and
characteristics along with cost and complexity constraints. While there are various combinations
of techniques that make up the wireless communication system, it is difficult to get a clear
answer as to the best scheme for a complex multiuser system under a range of typical operating
conditions. This paper proposes a technique that can appropriately bring out the merits of various
combinations of techniques.
3. Finally, the proposed idea can be described from a different perspective and provides insights into
other system implementations. The key idea here is to use multiple dimensions appropriately. The
beauty of this approach is that it can even be applied to other multiple access systems. The various
system models can be implemented by using additional dimensions. This paper demonstrates
that the system throughput can be increased by using multiple dimensions. This concept can
be extended further. If the additional dimensions are used without degradation or with a little
tradeoff, the system throughput can be increased. The idea presented in this paper provides a
multitude of interesting avenues for future research.
The remainder of this paper is organized as follows. Section 2 describes the problems of the
existing scheme and explains the solution and the reasonability of the proposed scheme. Section 3
describes the overall system model and the proposed scheme. Also, this section presents the
performance of the proposed scheme. Section 4 presents simulation results and compares with
the conventional scheme under various conditions. Section 5 considers some extensions about the
implementation of the proposed scheme. Finally, conclusion is provided in Section 6.
2. Motivation
The novelty of the proposed scheme is to use multiple dimensions as much as possible.
Figure 1a,b represent examples of resource usage in NOMA and BD schemes using only one dimension,
respectively. Figure 1c represents example of resource usage for the proposed scheme. In Figure 1a,b
124
Electronics 2019, 8, 758
which represent existing techniques, the two techniques cannot be simply combined since all existing
schemes utilize each technique for multiple access. However, in the proposed scheme, the SC is used to
separate each symbol by utilizing the power domain. In conclusion, it is possible to utilize both power
and space domain at the same time and improve performance by changing usage in SC application.
In addition, through proper combination, the proposed scheme compensates the shortcomings of
existing schemes and has advantages. First, overhead of the BS is reduced. A dynamic user scheduling
and grouping strategy needs the feedback information at the BS. The proposed scheme can reduce the
feedback overhead significantly by not considering intra-beam interference. Second, proper fairness
among users is assured. The interference is treated as noise to users and does not guarantee fairness
among users. In the proposed scheme, a certain error probability is ensured for all users. Third,
the complexity is reduced for each user. In the case of the existing NOMA scheme, it is necessary
to decode even if it is not the user’s own signal. However, in the proposed scheme, all the decoded
symbols become the user’s own data. Therefore, it is possible to reduce the complexity of the user in
demodulating unnecessary data. As a result, the proposed scheme allows more flexibility in spreading
user signals over the multiusers.
(a)
(b)
Figure 1. Cont.
125
Electronics 2019, 8, 758
(c)
Figure 1. Examples of resource usage in various dimensions (a) Example of resource usage for
conventional NOMA; (b) Example of resource usage for conventional BD; (c) Example of resource
usage for proposed scheme.
By adopting the multiple dimensions, the same resources can be shared at each domain. In this
paper, both the power and space domain are exploited. Figure 2 shows an example of additional
dimension usage. Four dimensions (frequency, time, power, space) are used and the same resources
can be appropriately shared. Therefore, the more data symbols can be transmitted to more users.
This paper considers a downlink single-cell MU-MIMO system as shown in Figure 3. In the
proposed scheme, multiple access is accomplished by SDMA and the power domain is exploited to
transmit more data symbols to each user. As a result, the proposed scheme has significant potential
to improve spectral efficiency and provide better wireless services to many users. Also, this paper
offers advantages in various design issues to meet the requirements and characteristics of the system
by using multiple dimensions.
126
Electronics 2019, 8, 758
3. Proposed Scheme
This section describes the proposed scheme from data transmission to signal detection. Using the
proposed system, more data symbols can be transmitted and more users can be served simultaneously.
First, the transmission model explains how two additional domains can be used to improve spectral
efficiency. Second, the signal detection model explains how each signal is detected and reliability can
be satisfied. Additionally, in the received SINR, some considerations for the proposed scheme are
presented. Finally, the sum throughput between the proposed scheme and conventional schemes is
compared. As a result, the throughput performance of the proposed scheme is superior to that of the
conventional scheme.
K
yk = Hk Wk sk + ∑ Hk W j s j +nk , k = 1, · · · , K, (2)
2 34 5 j=1,j =k
desired signal 2 34 5
undesired signals
where k and j are user indices, Wk is Nt × Nr precoding matrix for user k, sk is a Nr × 1 data symbol
vector, xk is a Nt × 1 precoded signal vector for the k-th user. yk is a received signal vector for the
k-th user and nk is Nr × 1 zero-mean additive white Gaussian noise (AWGN) vector with variance σ2 .
In the Equation (2), the first term denotes signal in the intended direction (desired user sk ) and the
second term denotes multi-user interference caused due to undesired signals (undesired users s j )
127
Electronics 2019, 8, 758
where i is a receiving antenna index, N is the number of symbols in one superposed signal. s̃ are
the symbols in one superposed signal. The proposed scheme allocates optional power to symbols to
detect each symbol, leading P1 < P2 < · · · < PN . Then, transmit signals for the k-th user are defined
as follows,
⎡ ⎤ ⎡ √ √ √ ⎤
s1 ( k ) P1 s̃1,1 (k ) + P2 s̃1,2 (k) + · · · + PN s̃1,N (k )
⎢ ⎥ ⎢ √ √ √ ⎥
⎢ s2 ( k ) ⎥ ⎢ P1 s̃2,1 (k ) + P2 s̃2,2 (k) + · · · + PN s̃2,N (k ) ⎥
sk = ⎢
⎢ .. ⎥ =
⎥ ⎢
⎢ .. ⎥.
⎥ (4)
⎣ . ⎦ ⎣ . ⎦
√ √ √
s Nr (k ) P1 s̃ Nr ,1 (k) + P2 s̃ Nr ,2 (k) + · · · + PN s̃ Nr ,N (k )
128
Electronics 2019, 8, 758
For exploiting space domain, the precoding matrix W should be designed in SDMA. In the scheme,
BD beamforming is adopted to suppress the MUI since BD method shows a strong performance in
terms of capacity and has a good flexibility. The objective of the BD method is to completely elminate
the MUI by employing the precoding matrix W. Then, the precoding matrix design for MUI elimination
can be defined as follows,
H̃k Wk = 0, k = 1, · · · , K. (5)
where H̃k is represented as the channel matrix for all users except for user k,
With the help of singular value decomposition (SVD), the precoding matrix for eliminating the
MUI is designed. SVD is used to decompose a matrix into matrices representing rotation and scaling.
By applying the SVD, the H̃k is defined as follows
(1) (0)
H̃k = Uk Λk [Vk Vk ] H , (7)
(0)
where Λk is the diagonal matrix of which the diagonal elements are singular value of H̃k . Vk contains
(1) (0)
vectors of the zero singular values, and contains vectors of the non-zero singular values. Vk is
Vk
an orthogonal basis for the null space of H̃k and the required precoding matrix. As a result, intended
user’s channel is projected on the null space in order to have the transmission under the constraint of
(0)
zero-interference (i.e.,H̃k Ṽk = 0 , k = 1, · · · , K.)
The received signal of the k-th user after eliminating MUI is defined as follows,
yk = Hk W ⎡ k sk + nk ⎤= H⎡e f f ,k sk + n⎤
k
s1 ( k ) n1 ( k )
⎢ ⎥ ⎢ ⎥
⎢ s2 ( k ) ⎥ ⎢ n2 ( k ) ⎥
= He f f ,k ⎢
⎢ .. ⎥+⎢
⎥ ⎢ .. ⎥
⎥
⎣ . ⎦ ⎣ . ⎦
⎡ s Nr (k ) n (k) ⎤ ⎡ ⎤ (8)
√ √ √ Nr
P1 s̃1,1 (k ) + P2 s̃1,2 (k ) + · · · + PN s̃1,N (k) n1 ( k )
⎢ √ √ √ ⎥ ⎢ ⎥
⎢ P1 s̃2,1 (k ) + P2 s̃2,2 (k ) + · · · + PN s̃2,N (k) ⎥ ⎢ n2 ( k ) ⎥
= He f f ,k ⎢
⎢ .. ⎥+⎢
⎥ ⎢ .. ⎥.
⎥
⎣ . ⎦ ⎣ . ⎦
√ √ √
P1 s̃ Nr ,1 (k ) + P2 s̃ Nr ,2 (k) + · · · + PN s̃ Nr ,N (k ) n Nr (k )
where He f f ,k denotes the effective channel of the k-th user. According to the Equation (8), the MUI is
perfectly eliminated and the k-th user receives its own data. Finally, the users can be considered as
point-to-point MIMO.
As a result, by exploiting both the spatial and power domains additionally, more data symbols
can be transmitted in the same resource (frequency/time). In the proposed scheme, as the number of
symbols at each superposed signal is increased, sum throughput at each user is linearly improved.
Although the allocated power to each symbol is reduced, the total throughput is improved since the
number of transmit symbols is increased. Unlike other techniques that use only one dimension, such
as space or power domains, the proposed scheme can achieve significant gains in overall system
throughput by using additional domains.
129
Electronics 2019, 8, 758
(ZF) and minimum mean squared error (MMSE) can be applied simply. If the ZF detection scheme is
used, the filter matrix for the k-th user is as follows,
The detected superposed signals of the k-th user using ZF MIMO detection can be represented
as follows,
In the Equation (10), sk can be detected since it satisfies Gk He f f ,k = I. However, if the linear
detection algorithms are used, the bit error rate (BER) performance is too poor since the power of
symbols in one superposed signal is low. Low performance detection techniques in terms of BER
can cause a negative effect on performing SIC. Therefore, the non-linear detection algorithms can
be applied such as maximum likelihood (ML), ordered successive interference cancellation (OSIC),
decision feedback equalizer (DFE), QRD-M.
From Equation (10), the estimated superposed signals of the k-th user after performing MIMO
detection can be reconstructed as follows,
⎡ ⎤ ⎡ √ √ √ ⎤
ŝ1 (k ) P1ˆ˜s1,1 (k) + P2ˆ˜s1,2 (k) + · · · + PNˆ˜s1,N (k )
⎢ ⎥ ⎢ √ √ √ ⎥
⎢ ŝ2 (k ) ⎥ ⎢ P1ˆ˜s2,1 (k) + P2ˆ˜s2,2 (k) + · · · + PNˆ˜s2,N (k ) ⎥
ŝk = ⎢
⎢ .. ⎥=⎢
⎥ ⎢ .. ⎥,
⎥ (11)
⎣ . ⎦ ⎣ . ⎦
√ √ √
ŝ Nr (k ) P1ˆ˜s Nr ,1 (k) + P2ˆ˜s Nr ,2 (k) + · · · + PNˆ˜s Nr ,N (k )
where ˆ˜s are the symbols at each estimated superposed signal. In Equation (11), the superposed signal
can be decoded by conducting SIC. Therefore, SIC is performed at each receiving antenna. In estimated
√
superposed signal, the strong symbol, i.e., p N s̃ Nr,N is first decoded. The first decoded symbol can be
represented as follows, ⎡ d ⎤
ˆ˜s (k)
1,N
⎢ ˆ˜sd k ⎥
⎢ ( ) ⎥
ˆ˜sdk = ⎢ 2,N. ⎥. (12)
⎢ . ⎥
⎣ . ⎦
ˆ˜sdN ,N (k )
r
Finally, all the symbols in the superposed signal can be decoded by performing SIC. The receiver
model is described in Figure 6.
130
Electronics 2019, 8, 758
where i is received antenna index, λi is a channel gain in one MIMO parallel channel. In this case,
P1 < P2 subject to total power constraint. Therefore, s̃i,1 is a weak symbol and s̃i,2 is a strong symbol.
Then the received SINR for the strong symbol is defined as follows,
λi P2
SINRs̃2 = . (15)
σ2 + λi P1
The weak symbol can be decoded by employing SIC. If it assumed that perfect SIC decoding is
conducted, the received SINR for weak symbol is defined as follows,
λi P1
SINRs̃1 = . (16)
σ2
In the proposed scheme, SINR shows two negative effects in terms of system throughput. First,
the weak symbol acts as an interference for received SINR of the strong symbol. As shown in Equation
(15), λi P2 term is considered as noise. Second, strong symbols is likely be decoded incorrectly.If a strong
symbol is not properly decoded, it can adversely affect the decoding of weak symbol. The negative
effects for the system throughput are restated in the simulation result. Although there are some
degradations in terms of throughput on exploiting the power domain, the system throughput is
increased since the number of transmit symbols is linearly increased at high SNR.
4. Simulation Results
This section shows simulation results to demonstrate the throughput gain of the proposed scheme
and compares the results with other conventional schemes. Simulation results also provide some
considerations for the proposed scheme. The simulations are performed in seven multi-path Rayleigh
fading and time-invariant channel model. OFDM overcomes the frequency selectivity of the wideband
channel and multiple carriers enable the high rate data transmission [22,23]. The OFDM symbol is
composed of 128 FFT size, four pilots and the 108 data subcarriers based on specification for IEEE
802.11n. The remaining 16 subcarriers are zero padding and OFDM symbol duration is 4 microseconds.
131
Electronics 2019, 8, 758
All the simulation are simulated with quadrature phase-shift keying (QPSK) modulation (data bits per
subcarrier is 2).
Figure 7a,b compare the sum throughput between the proposed schemes and the conventional
schemes for the case of two users as a function of SNR. The throughput T is calculated as follows,
T = Nb × (1 − E) L × K ÷ Ts
Nb = Ns × O × Nr × s, (17)
L = Ns × O,
where Nb is the number of transmit data bits and L is the number of data bits in one OFDM symbol. E
is the BER for each user and Ns is the number of data subcarriers. O is the number of data bits per
subcarrier and s is the number of symbols in one superposed signal. Ts is the OFDM symbol duration.
In the simulation, proposed scheme and conventional BD are the MIMO system that BS and each user
have multiple antennas. On the other hand, the conventional NOMA has one antenna at the BS and
each user since multiple access is accomplished by the power domain. The proposed scheme and
BD scheme are simulated in the case of Nt = 4, Nr = 2, K = 2 (4, 2, 2) and NOMA scheme is Nt = 1,
Nr = 1 K = 2 (1, 1, 2). And in the proposed scheme, the number of symbols at each superposed signal
is two ( N = 2). Therefore, the total number of transmit symbols to each user is 8. The power of each
symbol is allocated at a ratio of 8:2 from the total power Pt = 1. In MIMO system, the ML MIMO
detection is applied before conducting SIC. The summary of the simulation parameters is shown in the
Table 1.
Scheme Ns O Nr s K
Conventional BD 108 2 2 1 2
Conventional NOMA 108 2 1 1 2
Proposed Scheme 108 2 2 2 2
In the simulations, there are two types of simulation results: with and without SIC error.
Figure 7a,b show the simulation result with SIC error and Figure 7c,d show the simulation result
without SIC error. Both cases of the proposed scheme outperform the conventional schemes in terms
of maximum throughput since the proposed scheme exploits both the spatial and power dimensions.
The conventional BD shows better performance than the proposed scheme in terms of BER. However,
the proposed scheme has higher throughput since the proposed scheme transmits more symbols.
In Figure 7b, the BER performance of the proposed scheme is better than that of NOMA since the
effect of MIMO detection and SIC decoding is improved at high SNR. Figure 7c,d show the impact of
SIC error on the proposed scheme. Without SIC error, throughput and BER performance are better
than when there is SIC error. If the strong symbol is wrongly decoded at low SNR, the weak signal is
also wrongly decoded. As a result, the error propagation occurs. Therefore, the system needs to be
designed to avoid error propagation and reduce SIC error. As the proposed scheme minimizes the
impact of SIC error, the performance of the proposed scheme approaches the case where there is no
SIC error.
132
Electronics 2019, 8, 758
(a) (b)
(c) (d)
Figure 7. Throughput and BER performance of conventional and proposed schemes: (a) Throughput
performance with SIC error; (b) BER performance with SIC error; (c) Throughput performance without
SIC error; (d) BER performance without SIC error.
Figure 8a, b show the performance difference of the proposed scheme according to the number
of users and antennas ( Nt , Nr , K ). The cases of (6, 3, 2) and (6, 2, 3) have the higher throughput
performance than (4, 2, 2) case since (6, 3, 2) and (6, 2, 3) transmit more symbols by exploiting both
space and power dimension. (6, 2, 3) case has lower throughput performance in low SNR than (6, 3, 2)
case since (6, 3, 2) case allocates more symbols to each user. On the other hand, in case of (6, 2, 3), one
more user can be serviced. In terms of BER, (6, 3, 2) has better BER performance than (6, 2, 3) since
(6, 3, 2) transmits more symbols to each user than (6, 2, 3). (4, 2, 2) has better BER performance than
(6, 2, 3) since more power is allocated to each symbol. However, (6, 2, 3) has higher throughput than
(4, 2, 2) since (6, 2, 3) transmits more symbols.
133
Electronics 2019, 8, 758
(a) (b)
Figure 8. Throughput and BER performance of proposed scheme according to the number of users and
antennas: (a) Throughput performance; (b) BER performance.
Figure 9a, b show the difference of the performance between the cases of linear MIMO detection
and non-linear MIMO detection. The method with ML detection outperforms the method with ZF
detection. The ML technique has better detection performance than the ZF technique before performing
SIC on each antenna. Additionally, the ML detection technique mitigates the error propagation
compared to the ZF detection scheme.
As a result, the overall simulation results show better performance compared to conventional
schemes by using two dimensions simultaneously. In the proposed scheme, by exploiting both power
and space domains at the same time, the transmitted symbol is increased. The results show that the
superiority of the proposed scheme, and the proposed scheme uses the dimensions appropriately.
(a) (b)
Figure 9. Throughput and BER performance of ML detection and ZF detection: (a) Throughput
performance; (b) BER performance.
134
Electronics 2019, 8, 758
5. Implementation Issue
This section presents some additional considerations for system application and limitation under
practical constraints. In addition, this section gives some ideas for additional performance gain and
presents some methods to reduce some negative effects in the proposed scheme.
5.1. Complexity
In the proposed scheme, the additional implementation complexity in typical NOMA is not
needed since SIC is performed for the own user’s data. However, even though the superposed signals
are the user’s own data, the symbols with small allocation power in superposed signal have poor
BER performance. To solve this problem, a detection scheme with better performance should be
used. However, a non-linear algorithm has high complexity. For this problem, a complexity-reduced
detection algorithm can be considered. The main consideration is achieving higher throughput with
lower complexity.
6. Conclusions
This paper suggests multi-dimensionality and a methodology to improve the throughput in an
MU-MIMO system. This paper also presents the transceiver structure of the proposed scheme. As a
result, the multiple dimensions (space and power) are exploited at the same time, and the overasll
system spectral efficiency is improved. If more dimensions are used without degradation or with a
little tradeoff in performance, the system throughput can be increased. Also, various system models
can be implemented by using additional dimensions.
Author Contributions: J.-G.H. proposed throughput enchancement scheme for MU-MIMO downlink channel
and processed the simulation; J.-H.R. analyzed the simulation results and made the figure; H.-K.S. reviewed
135
Electronics 2019, 8, 758
the algorithm and provided the experimental materials for better computational simulations and revised critical
errors of the manuscript
Funding: This research was supported by Institute for Information & communications Technology
Promotion(IITP) grant funded by the Korea government(MSIT) (No.2017-0-00217, Development of
Immersive Signage Based on Variable Transparency and Multiple Layers) and was supported by the
MSIT(Ministry of Science and ICT), Korea, under the ITRC(Information Technology Research Center) support
program(IITP-2018-2018-0-01423) supervised by the IITP(Institute for Information & communications Technology
Promotion).
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Gupta, A.; Jha, R.K. A survey of 5G network: Architecture and emerging technologies. IEEE Access 2015, 3,
1206–1232. [CrossRef]
2. Goldsmith, A. Capacity Limits of MIMO Channels. IEEE JSAC 2003, 5, 684–702. [CrossRef]
3. Weingarten, H.; Steinberg, Y.; Shamai, S. The Capacity Region of the Gaussian MIMO Broadcast Channel.
IEEE Trans. Inf. Theory 2006, 52, 3936–3964. [CrossRef]
4. Caire, G.; Shamai, S. On the achievable throughput of a multi-antenna Gaussian broadcast channel. IEEE
Trans. Inf. Theory 2003, 43, 1691–1706. [CrossRef]
5. Angelis, C.T.; Chronopoulos, S.K. System Performance of an LTE MIMO Downlink in Various Fading
Environments. In Proceedings of the Ambient Media and Systems, Berlin, Germany, 27–28 January 2011;
pp. 36–43.
6. Choi, L.U.; Murch, R.D. A transmit preprocessing technique for multiuser MIMO systems using a
decomposition approach. IEEE Trans. Wirel. Commun. 2004, 3, 20–24. [CrossRef]
7. Spencer, Q.H.; Swindlehurst, A.L.; Haardt, M. Zero-forcing methods for downlink spatial multiplexing in
multiuser MIMO channels. IEEE Trans. Signal Process. 2004, 2, 461–471. [CrossRef]
8. Viswanathan, H.; Venkatesan, S.; Huang, H. Downlink capacity evaluation of cellular networks with
known-interference cancellation. IEEE J. Sel. Areas Commun. 2003, 21, 802–811. [CrossRef]
9. Votis, C.; Christofilakis, V.; Raptis, V.; Tatsis, G.; Chronopoulos, S.K.; Kostarakis, P. Design and analysis of
a multiple-output transmitter based on DDS architecture for modern wireless communications. AIP Conf.
Proc. 2010 1203, 421–426.
10. Jindal, N.; Goldsmith, A. Dirty-paper coding versus TDMA for MIMO broadcast channels. IEEE Trans. Inf.
Theory 2005, 51, 1783–1794. [CrossRef]
11. Rim, M. Multi-user downlink beamforming with multiple transmit and receive antennas. Electron. Lett.
2002, 38, 1725–1726. [CrossRef]
12. Chen, R.; Andrews, J.G.; Heath, R.W. Multiuser Space-Time Block Coded MIMO System with Downlink
Precoding. In Proceedings of the 2004 IEEE International Conference on Communications, Paris, France,
20–24 June 2004
13. Shen, Z.; Chen, R.; Andrews, J.G.; Heath, R.W.; Evans, B.L. Low complexity user selection algorithms
for multiuser MIMO systems with block diagonalization. IEEE Trans. Signal Process. 2006, 54, 3658–3663.
[CrossRef]
14. Saito, Y.; Kishiyama, Y.; Benjebbour, A.; Nakamura, T.; Li, A.; Higuchi, K. Non-orthogonal multiple access
(NOMA) for cellular future radio access. In Proceedings of the 2013 IEEE 77th Vehicular Technology
Conference, Dresden, Germany, 2–5 June 2013
15. Ali, M.S.; Hossain, E.; Kim, D. Non-orthogonal multiple access (NOMA) for downlink multiuser MIMO
systems: User clustering, beamforming, and power allocation. IEEE Access 2017, 5, 565–577. [CrossRef]
16. Islam, S.M.R.; Avazov, N.; Dobre, O.A.; Kwak, K.-S. Power-domain non-orthogonal multiple access (NOMA)
in 5G systems: Potentials and challenges. IEEE Commun. Surv. Tutor. 2016, 19, 721–742. [CrossRef]
17. Zeng, M.; Yadav, A.; Dobre, O.A.; Tsiropoulos, G.I.; Poor, H.V. On the sum rate of MIMO-NOMA and
MIMO-OMA systems. IEEE Wirel. Commun. Lett 2017, 6, 534–537. [CrossRef]
18. Ding, Z.; Adachi, F.; Poor, H.V. The application of MIMO to nonorthogonal multiple access. IEEE Trans.
Wirel. Commun. 2016, 15, 537–552. [CrossRef]
19. Ding, Z.; Schober, R.; Poor, H.V. A general MIMO framework for NOMA downlink and uplink transmission
based on signal alignment. IEEE Trans. Wirel. Commun. 2016, 15, 4438–4454. [CrossRef]
136
Electronics 2019, 8, 758
20. Choi, Y.I.; Lee, J.; Rim, M.; Kang, C.G. On the performance of beam division non orthogonal multiple access
for FDD-based large-scale multi-user MIMO systems. IEEE Trans. Wirel. Commun. 2017, 15, 5077–5089.
[CrossRef]
21. Yang, Q.; Wang, H.-M.; Ng, D.W.K.; Lee, M.H. NOMA in downlink SDMA with limited feddback:
Performance analysis and optimization. IEEE J. Sel. Areas Commun. 2017, 35, 2281–2294. [CrossRef]
22. Van Nee, R.; Prasad, R. OFDM for Wireless Multimedia Communications; Artech House, Inc.: Norwood, MA,
USA, 2000
23. Chronopoulos, S.K.; Votis, C.; Raptis, V.; Tatsis, G.; Kostarakis, P. In depth analysis of noise effects in
orthogonal frequency division multiplexing systems, utilising a large number of subcarriers. AIP Conf. Proc.
2010, 1203, 967–972.
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
137
electronics
Article
LOS-Based Equal Gain Transmission and Combining
in General Frequency-Selective Ricean Massive
MIMO Channels
Qiuna Yan 1,2 , Yu Sun 1 and Dian-Wu Yue 1, *
1 School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China;
[email protected] (Q.Y.); [email protected] (Y.S.)
2 State Key Laboratory of Integrated Services Networks, Xidian University, Xi’an 710071, China
* Correspondence: [email protected]; Tel.: +86-411-8472-4261
Keywords: massive MIMO; beamforming; line-of-sight; Ricean fading; frequency-selective; power scaling
1. Introduction
Recently, massive multiple-input multiple-output (MIMO) has attracted great interest in both
academia and industry, and has been a promising solution to meet the demanding spectral efficiency
requirement of 5G systems [1]. Its promising benefits includes significant increase of both spectral
and energy efficiencies [2–4]. Interestingly, the two benefits of massive MIMO can be achieved
by maximum-ration transmission/maximum-ration combining (MRT/MRC) or zero-forcing (ZF)
precoding/detection [3,5].
With MRT/MRC and ZF linear processing, many scholars have given various asymptotic
performance analyses. In particular, the power scaling law in the limit of the large number of antennas
has been widely studied in order to quantify the power savings. For Rayleigh and Ricean fading
environments, with MRC and ZF detectors, authors in [3,6] analyzed uplink massive MIMO system
performance. If perfect channel state information (CSI) is available, they showed that, when the
number of base station (BS) antennas grows large and the transmit power of each user is scaled down
proportionally to it, the ergodic achievable rate can asymptotically be equal to a positive constant.
In order to obtain the needed CSI, channel estimation must be obviously carried on [7]. However,
the channel estimation will result in not only heavy overhead but also pilot contamination in multi-cells,
which will become a serious problem [2,8]. For a point to point massive MIMO system in Ricean
fading, to reduce the heavy overhead to estimate the CSI, we investigated a scheme with equal gain
transmission /equal gain combining (EGT/EGC), which is only based on the line-of-sight (LOS)
component (or say specular component) and has low hardware complexity [9]. It was showed that,
with this scheme, the ergodic achievable rate can converge to that of the corresponding MRT/MRC
based on the perfect CSI as the two numbers of antennas at the transmitter and receiver go to infinity.
After that, we further considered the novel linear processing scheme for a downlink or uplink multiuser
massive MIMO system [10,11] and showed that each user in the downlink or uplink system can
have asymptotically the same rate as in the single-user case when the number of BS antennas goes
without bound.
It should be pointed out that the above-mentioned results with the novel scheme have only
considered uncorrelated Ricean frequency-flat fading channels without a relay [12]. Recently, we tried
to develop our analysis to frequency-selective Ricean fading channels [13], but only for a very simple
and special scenario [14]. The EGT/EGC linear transmission scheme is very attractive for massive
MIMO systems since it enables low-complexity and inexpensive hardware [15–18]. Motivated by
these facts, in this paper, we make use of a comparatively complicated and general frequency-selective
Ricean fading channel model [14,19,20] to investigate further the LOS-based EGT/EGC scheme.
We firstly derive several interesting power scaling properties for broadband massive MIMO systems
with and without spatial correlation. In particular, it is shown that the ergodic achievable rate of
LOS-based EGT/EGC scheme can have the same asymptotic value as the ergodic achievable rate of
the whole CSI-based MRT/MRC scheme if the two numbers of transmit and receive antennas go
without bound and with a fixed ratio. Then, we extend our asymptotical performance analysis to the
cooperative relaying scenarios with decode-and-forward (DF) protocol and with amplify-and-forward
(AF) protocol, respectively, and obtain two novel power scaling laws for the two scenarios. In particular,
it is also shown that the ergodic achievable rate of LOS-based scheme can have the same asymptotic
value as the ergodic achievable rate of the CSI-based scheme if the number of source antennas and
the two numbers of transmit and receive relay antennas go without bound and with two fixed
ratios, respectively.
The manuscript is organized as follows: in Section 2, the system model is introduced. In Section 3,
the proposed LOS-based transmission scheme is presented and its power scaling law without
correlation and with correlation is derived, respectively. Extension of our analysis to a cooperative
relaying system is given in Section 4. In Section 5, the analysis results are verified by simulation.
Finally, in Section 6, some concluding remarks are given.
Notation: boldface lower and upper case letters denote column vectors and matrices, respectively.
The superscripts (·)† and (·)T stand for conjugate-transpose and transpose operations, respectively.
The expectation operator is denoted by E{·}. α ∼ CN(0, δ2 ) stands for a circularly symmetric complex
Gaussian variable α which has zero mean and variance δ2 .
2. System Model
Since a set of parallel independent frequency flat MIMO channels can be used to describe a
frequency selective MIMO channel, we start with introducing the frequency flat channels [14,19].
For a point-to-point MIMO system over frequency-flat channels, we assume that it has N transmit
antennas and M receive antennas. Then, we can represent a M × 1 received signal vector as
√
y= pH0 x + z, (1)
where z denotes the additive white Gaussian noise (AWGN) vector that has zero-mean and covariance
matrix σ2 I M with I M being the M × M identity matrix, x denotes the transmitted signal vector,
M,N
H0 = [ hmn ]m,n =1 stands for the M × N channel matrix whose element hmn denotes the channel gain
between the m-th antenna at the receiver and the n-th antenna at the transmitter, and p is the average
transmitted power. The channel matrix H0 under Ricean fading consists of a LOS matrix and a scattered
matrix, i.e.,
140
Electronics 2019, 8, 79
√ √
H0 = κ̄0 H0 + 6 0,
κ̃0 H (2)
Here, r0 denotes the specular array response at the receiver and can be expressed as
where θ is the angle of arrival of the LOS component and dr is the antenna spacing normalized by
wavelength at the receiver. Similarly, t0 denotes the specular array response at the transmitter and can
be given by
t0 = [1, e j2πdt sin(φ) , . . . , e j2π ( N −1)dt sin(φ) ]T (5)
where φ is the angle of departure of the LOS component and dt is the antenna spacing normalized by
6 0 , [H
wavelength at the transmitter. The entries in the scattering matrix H 6 0 ]mn ∼ CN(0, 1), i.e., they are
circular complex Gaussian random variables with zero mean and unit variance. Furthermore,
we assume that they are independent and identically distributed (i.i.d).
Now, we are concerned with a broadband orthogonal frequency-division multiplexing
(OFDM)-MIMO system with K subcarriers, where ideal OFDM transmission with proper cyclic prefix
extension is assumed. For the k-th subcarrier, the input–output relationship is expressed as
√
y= pHx + z, (6)
where x is just the normalized signal vector, z is the AWGN vector, and H is the channel matrix.
The channel matrix can be given by as
L −1
k
H= ∑ ρ H exp(− j2π K ), (7)
=0
L −1 2
where L represents the channel delay spread, {ρ2 } is the power delay profile satisfying ∑= 0 ρ = 1,
and H stands for the channel matrix at time delay . Furthermore, H , = 0, 1, · · · , L − 1 are mutually
uncorrelated, Ricean distributed, and can be expressed as in Label (2)
√ -
H = κ̄ H + 6 .
κ̃ H (8)
L −1 -
6 =
H ∑ ρ 6 exp(− j2π k ).
κ̃ H (9)
=0
K
141
Electronics 2019, 8, 79
L −1 √ k
H= ∑ ρ κ̄ H exp(− j2π )
K
(10)
=0
can be available. In what follows, by employing only H, we will present a linear processing scheme
with EGT/EGC and then compare it with the MRT/MRC scheme based on the perfect CSI.
6 is not available, a couple of the normalized weighting vectors wt and wr can be chosen in
Since H
such a way that the effective output signal-to-noise ratio (SNR) can become maximum. The largest
† †
eigenvalue of matrix H H is now denoted by λmax (H H) . Due to the fact that wr† zk ∼ CN(0, σ2 ),
the effective output signal-to-interference-plus-noise ratio (SINR) can be described as
†
(k) p|wr† HwTt † |2 pλmax (H H)
γS = = . (11)
6 T† | 2 + σ 2
p|wr† Hw p | w † 6 T† 2
r Hwt | + σ
2
t
K −1 K −1
1 (k) 1 (k)
RS = E{
K ∑ log2 (1 + γS )} =
K ∑ RS , (12)
k =0 k =0
(k) (k)
where RS = E{log2 (1 + γS )}. We have the following results through the derivation.
(k) L −1 2
†
Lemma 1. Define λmax = λmax (H H) and κ̃S = ∑= 0 ρ κ̃ . Then,
(k) (k)
pλmax (k) pλ
log2 (1 + ) ≤ RS ≤ log2 (1 + max ). (13)
pκ̃S + σ2 σ2
Proof of Lemma 1. Regarding the ergodic achievable rate of the k-th subcarrier, it is easy for us to
derive its following lower bound with the help of the well-known Jensen’s inequality:
(k)
(k) 1 pλmax
RS ≥ log2 (1 + (k)
) = log2 (1 + L −1 2
), (14)
p ∑= 0 ρ E + σ
2
E(1/γS )
(k)
pλmax
k
RC ≥ log2 (1 + ). (15)
pκ̃S + σ2
(k) (k)
(k) pλmax pλmax
RS = E log2 (1 + ) ≤ log2 (1 + ). (16)
† 6
p|wr HwTt † |2 + σ2 σ2
L −1 2
Lemma 2. Let κ̄U = ∑= 0 ρ κ̄ . and κ̄ L = max{ ρ κ̄ , 0 ≤ ≤ L − 1}. Then,
2
(k)
λmax
κ̄ L ≤ lim ≤ κ̄U . (17)
MN →∞ MN
Proof of Lemma 2. For 0 ≤ ≤ L − 1 and 0 ≤ b ≤ L − 1, we define ϕb = 2πdr (sin(θb ) − sin(θ )),
and then have that
142
Electronics 2019, 8, 79
H̄† H̄b
= b Ub , (18)
MN
where
r† rb 1, for l = b;
b = = 1−e jMϕb (19)
M , for = b
M (1−e jϕb )
and
tT † tTb N,N
Ub = = [u an ] a,n =1 (20)
N
with u an = N1 e2πdt (( a−1) sin(θb )−(n−1) sin(θ )) . Now, suppose that M ≥ N. Noting that lim M→∞ b = 0
for = b and tr(U ) = 1, we can obtain that
† L −1 †
λmax (H H) H H
lim
MN →∞ MN
≤ ∑ ρ2 κ̄ MNlim
→∞
λmax (
MN
) = κ̄U . (21)
=0
λmax (H H)
† 6 T† | 2
|wr† Hw
lim ≥ lim t
= κ̄ . (23)
MN →∞ MN MN →∞ MN
Therefore, Lemma 2 holds when M ≥ N. When N ≥ M, we can also similarly prove that Lemma 2
† †
holds, based on the fact that λmax (H H) = λmax (HH ).
Eκ̄ L Eκ̄U
log2 (1 + )≤ lim RS ≤ log2 (1 + ). (24)
σ2 MN →∞ σ2
Remark 1. This proposition gives the lower and upper bounds of the ergodic achievable rate of the LOS-based
scheme. In the following special cases, we can obtain further the exact expressions of the ergodic achievable rate.
Eκ̄U
lim RS = log2 (1 + ). (25)
M→∞ σ2
Eκ̄U
lim RS = log2 (1 + ). (26)
N →∞ σ2
†
λmax (H H)
Proof of Corollary 1. When M = 1 or N = 1, due to the fact that lim MN →∞ MN = κ̄U , it easily
follows that lim MN →∞ RS = log2 (1 + Eσκ̄2U ).
143
Electronics 2019, 8, 79
Eκ̄ L
lim RS = log2 (1 + ). (27)
M,N →∞ σ2
L −1 √ k
H= ∑ ρ κ̄ r tT exp(− j2π ), (28)
=0
K
√ √ √
where ρ0 κ̄0 ≥ ρ1 κ̄1 ≥ · · · ≥ ρ L−1 κ̄ L−1 . We can rewrite H in a matrix form as
H = Ar DATt , (29)
√
where D is a L × L diagonal matrix, [D]ll = ρ MN κ̄ , and Ar and At are defined as follows:
1
A r = √ [ r0 , r1 , . . . , r L −1 ] (30)
M
and
1 k k
At = √ [t0 , t1 exp(− j2π ), . . . , t L−1 exp(− j2π ( L − 1))]. (31)
N K K
Since both {r0 , r1 , . . . , r L−1 } and {t0 , t1 , . . . , t L−1 } are orthogonal vector sets when M → ∞ and
N → ∞ [21], Ar and At are asymptotically unitary matrices. For matrix H, thus we can form a singular
value decomposition (SVD) as follows
T
H = UΣV† = [Ar |Ar⊥ ]Σ[At |A⊥
t ] , (32)
where Σ is a diagonal matrix including all singular values on its diagonal, i.e.,
√
ρ MN κ̄ , for 0 ≤ l ≤ L − 1,
[Σ]ll = (33)
0, for l > L − 1.
Then,
(k)
λmax
lim = ρ20 κ̄0 = κ̄ L . (34)
M,N →∞ MN
Thus, we finally obtain the desired result.
On the other hand, suppose that the perfect CSI is known, i.e., both of the LOS and scattered
components are available at the transmitter and the receiver. Then, the weighting vectors wt and wr
should be chosen in such a way that the exact output SNR is maximized. Thus, the resulting output
SNR can be written as [11]
(k) p
γP = 2 λmax (H† H), (35)
σ
where λmax (H† H) stands for the largest eigenvalue of H† H. For the MRT/MRC scheme based on
(k)
the perfect CSI, let R P represent its ergodic achievable rate, i.e., R P = E{ K1 ∑kK=−01 log2 (1 + γP )}.
Now, we obtain the following power scaling law.
Proposition 2. When M → ∞ and N → ∞, suppose that E = MN p is fixed and N/M → μ . We have that
144
Electronics 2019, 8, 79
6 we can have
Proof of Proposition 2. Due to the fact H = H + H,
1 1 6 )† (H + H6 )]
[H† H] = [(H + H
M M
† †6 6 †H H 6 †H
6
H H H H H
= + + + . (37)
M M M M
If we let
6
†
H H N,N
G= = [ guv ]u,v =1 , (38)
M
it follows that
M
1 † 6 ]kv .
guv =
M ∑ [H ]uk [H (39)
k =1
† 6 ]kv ∼ CN(0, δ2 ) with
With the aid of (9) and (10), we can have that |[H ]uk |2 ≤ κ̄S ≤ 1, and [H
δ2 6
= κ̃S ≤ 1. As [H]kv , 1 ≤ k ≤ M are independent each other, we know that
1
guv ∼ CN(0, σuv
2
), σuv
2
≤ . (40)
M
Thus, it can follow that, when M → ∞, G → Q, where Q denotes a matrix with all zero elements.
Similarly, we can have that, if M → ∞,
6 †H
H
G† = → Q. (41)
M
Now, M is assumed to be large enough. Then, we certainly have that
H† H
†
H H H6 †H
6
λmax ( ) = λmax ( + )
M M M
†
H H 6 †H
H 6
≤ λmax ( ) + λmax ( ). (42)
M M
When M → ∞, suppose that N/M → μ. Then, we easily derive from ([22], Theorem 2.37),
6 ]mn ∼ CN(0, κ̃S )
only noting that [ H
1 6† 6 √
λmax ( H H) → κ̃S (1 + μ)2 . (43)
M
Thus, we further get
1 1 † √
λmax ( H† H) ≤ λmax ( H H) + κ̃S (1 + μ)2 /N. (44)
MN MN
In addition, we can obtain
1 1 †
λmax ( H† H) ≥ λmax ( H H ). (45)
MN MN
When M → ∞ and N → ∞, we can get, by combining (44) with (45),
(k)
1 λmax
λmax ( H† H) → lim . (46)
MN M,N →∞ MN
145
Electronics 2019, 8, 79
K
1 p
RP =
K ∑ E log2 (1 + σ2 λmax (H† (k)H(k)))
k =1
1 K
pMN H† (k )H(k )
=
K ∑ E log2 (1 + σ 2
λmax (
MN
)). (47)
k =1
Remark 2. This proposition implies that, when the two numbers of antennas at the transmitter and the receiver
grow large with a fixed ratio, the ergodic achievable rate of the LOS-based scheme has the same asymptotic value
as the ergodic achievable rate of the whole CSI-based scheme.
H 6 ω [Ψ ]1/2 ,
6 = [Φ ]1/2 H (49)
where H6 ω is an i.i.d. matrix with each entry ∼ CN(0, 1). Since the scattered component of the channel
matrix remains unchanged, the needed weighting vectors wt and wr should also remain unchanged.
The ergodic achievable rate of the scenario with spatial correlation is denoted by
K −1
1 (k) (k) (k)
RC =
K ∑ RC , RC = E{log2 (1 + γC )}. (50)
k =0
Lemma 3.
(k) (k)
pλmax (k) pλ
log2 (1 + ) ≤ RC ≤ log2 (1 + max ), (51)
pκ̃C + σ2 σ2
L −1 2 1 1
T† 2
where κ̃C = ∑= 0 ρ κ̃ wr [ Φ ] [ Ψ ] wt .
† 2 2 2
Remark 3. This proposition implies that the two ergodic achievable rates with and without spatial correlation
have the same asymptotic value when MN goes without bound.
146
Electronics 2019, 8, 79
Proposition 4. When M → ∞ and N1 → ∞, let E1 = MN1 p1 be fixed and N1 /M → μ1 for the source-relay
link. When M → ∞ and N2 → ∞, let E2 = MN2 p2 be fixed and N2 /M → μ2 for the relay-destination
link. Then,
lim RP = lim RS . (53)
M,N1 ,N2 →∞ M,N1 ,N2 →∞
Proof of Proposition 4. From [24], it follows that the ergodic achievable rate with the perfect SCI-based
MRT/MRC is written as
(1) (2)
R P = min{ R P /2, R P /2}, (54)
(1) (2)
where R P and R P are the corresponding ergodic achievable rates of the source-relay and
relay-destination transmission links, respectively. Similarly, we also have that the ergodic achievable
rate with the only LOS-based EGT/EGC can be given by
(1) (2)
RS = min{ RS /2, RS /2}, (55)
(1) (2)
where RS and RS are the corresponding ergodic achievable rates of the source-relay and
relay-destination transmission links, respectively. Under the condition of Proposition 4, we get
by Proposition 2
(1) (1)
lim RP = lim RS (56)
M,N1 →∞ M,N1 →∞
and
(2) (2)
lim RP = lim RS . (57)
M,N2 →∞ M,N2 →∞
Remark 4. This proposition implies that, when the number of source antennas and the two numbers of relay
antennas at the transmitter and the receiver grow large with fixed ratios, the ergodic achievable rate of the
LOS-based scheme also has the same asymptotic value as the ergodic achievable rate of the whole CSI-based scheme.
147
Electronics 2019, 8, 79
Proposition 5. Suppose that N1 = N2 = 1. When M → ∞, let E1 = Mp1 and E2 = MN2 p2 be fixed. Then,
E1 κ̄U1 E2 κ̄U2 E κ̄ E κ̄
lim RS = log2 (1 + ( · )/( 1 2U1 + 2 2U2 + 1)). (58)
M→∞ σ12 σ22 σ1 σ2
(k) (k)
Proof of Proposition 5. We denote by γS1 and γS2 the output instantaneous SNR of the source-relay
and relay-destination links, respectively. From [25], we obtain that
K −1
1 (k)
RS =
K ∑ E{log2 (1 + γS )}
k =0
K −1
1 (k) (k) (k) (k)
=
K ∑ E{log2 (1 + (γS1 · γS2 )/(γS1 + γS2 + 1))}. (59)
k =0
Based on the proof of Lemma 2, we can have the following asymptotical SNR expressions
(k) E1 κ̄U1
lim γS1 = (60)
M→∞ σ12
and
(k) E1 κ̄U2
lim γS2 = (61)
M→∞ σ22
Thus, the power scaling law (58) holds.
5. Simulation Results
For OFDM-MIMO systems in frequency-selective Ricean fading channels, we in this section
provide our analytical results and simulation results. In all simulations, we assume that all of these
spacings between adjacent antennas at the transmitter and the receiver are 0.5. We set the number of
subcarriers K = 256, the channel delay spread L = 3, and the noise variance as σ2 = 1. In addition,
we let ρ20 = ρ21 = ρ22 = 1/3, θ0 = φ0 = π/6, θ1 = φ1 = π/4, and θ2 = φ2 = π/3. In Figures 1 and 2,
the Ricean K-factor κ is fixed and is equal to 5 dB.
In order to verify Propositions 1 and 3, we consider firstly the scenario with spatial correlation
when N = 3 and E = 20 dB. The spatial correlation among antennas is assumed to follow the
exponential model, i.e., the correlation magnitude between antenna p and q can be determined by
c( p, q) = g| p−q| , where g denotes the correlation coefficient [12]. Therefore, we represent the correlation
matrices (i, j)-th of Φ by [Φ ]ij = ( gr )|i− j| and (i, j)-th of Ψ by [Ψ ]ij = ( gt )|i− j| , respectively,
= 0, 1, 2. Moreover, we set gr0 = gr1 = gr2 = gr and gt0 = gt1 = gt2 = gt . For the correlation coefficients
gt = gr = g = 0, 0.3, 0.6, 0.9, as M increases from 6 to 60, Figure 1 provides a curve of the exact average
rate RC and two curves of the upper and lower bounds of RS . It can be observed that the exact ergodic
rate RC increases as the number of receive antenna M grows large, and is always between the two
bounds of RS . As both of the correlation coefficients ( gt , gr ) increase, RC is closer to the upper bound,
and becomes higher and higher than RS . This indicates that, compared to the uncorrelated scenario,
the presence of spatial correlation results in improving the rate performance under the LOS-based
EGT/EGC scheme. Therefore, if the LOS-based scheme can be employed, we can achieve performance
benefits from the spatial correlation, which is obviously different from the traditional point of view.
This implies that it would be practical if a large-scale antenna array is compactly arranged.
148
Electronics 2019, 8, 79
6.5
5.5
Average Rate (Bits/s/Hz)
4.5
4
gt=0 gr=0
3.5
gt=0.3 gr=0.3
3 g =0.6 g =0.6
t r
2.5 gt=0.9 gr=0.9
Upper Bound
2
Lower Bound
1.5
10 20 30 40 50 60
Number of Receive Antennas
Figure 1. The ergodic achievable rate versus the number of receive antennas for comparing the case
with correlation and the case without correlation.
10
E=30dB
Average Rate (Bits/s/Hz)
4
E=20dB
E=10dB
0
CSI -based BF
LOS-based BF
-2
10 15 20 25 30 35 40 45 50 55 60
Number of Receive Antennas
Figure 2. The ergodic achievable rate versus the number of receive antennas for comparing the
LOS-based scheme with the whole CSI-based scheme.
149
Electronics 2019, 8, 79
Next, we consider validating Proposition 2. For that, we need to compare the ergodic achievable
rate of the proposed LOS-based EGT/EGC scheme with that of the perfect CSI-based MRT/MRC
scheme. We set μ = 1/2 when the numbers of antennas at the transmitter and receiver grow large.
For the parameter E = 10, 20, 30 dB, as M increases from 6 to 60, Figure 2 plots the two ergodic
achievable rates, RS and R P . It can be found from Figure 2 that both of the ergodic achievable rates can
tend to the same limit results for the given values of E. However, with an increase of E, the speed of
rate convergence appears to be slower and slower.
Finally, we pay our attention to the classical DF cooperative relay system consisting of the
source-relay and relay-destination links and set the identical parameters mentioned above in the two
links. When N1 = N2 = 6, as M increases from 6 to 60, Figure 3 plots the two average rates RS
and R P for κ = 5, 15 dB. It can be found from Figure 3 that, with an increasing κ, both RS and R P
improve and RS is closer to R P . It should be noticed that R P denotes the average rate for the traditional
linear processing scheme based on the whole CSI as considered in [6]. For obtaining a comprehensive
comparison with the scheme based on the whole CSI in Rayleigh fading discussed in [24], Figure 3 also
includes a rate curve which corresponds to κ = −∞ dB. Interestingly, with κ = 5 dB, the LOS-based
scheme always obviously outperforms the scheme based on the whole CSI in Rayleigh fading.
1.6
1.5
1.4
Average Rate (Bits/s/Hz)
1.3
1.2
1.1
0.9
CSI: =15 dB
0.8 LOS: =15 dB
CSI: =5 dB
0.7 LOS: =5 dB
CSI: =- dB
0.6
10 15 20 25 30 35 40 45 50 55 60
Number of Receive Antennas
Figure 3. The ergodic achievable rate versus the number of relay antennas for comparing the case with
Ricean fading and the case with Rayleigh fading.
6. Conclusions
In this paper, we have developed the transmission scheme of LOS-based EGT/EGC for
point-to-point massive-MIMO systems in frequency-selective Ricean fading channels without and
with spatial correlation. In particular, we have derived expressions of the system achievable rate and
determined several power scaling laws. In addition, we have also generalized our analysis to the
150
Electronics 2019, 8, 79
cooperative relaying scenarios with DF and AF protocols, respectively. It is shown by our simulation
results that the spatial correlation can improve the system performance and thus is an advantage,
which is contrary to the traditional point of view. Compared to the Rayleigh fading environments,
deployment of large scale antenna arrays in Ricean fading environments would be more suitable.
For instance, massive MIMO can be applied in microwave backhaul links [26].
Author Contributions: Formal Analysis, Q.Y.; Validation, Y.S.; Writing—Original Draft Preparation, Q.Y., Y.S.;
Writing—Review and Editing, D.-W.Y.; Supervision, D.-W.Y.
Funding: This research was funded by the open research fund of State Key Laboratory of Integrated Services
Networks, the Fundamental Research Funds for the Central Universities Grant No. 3132016347, and the Natural
Science Foundation of Liaoning Province Grant No. 201602086.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Shafi, M.; Molisch, A.F.; Smith, P.J.; Haustein, T.; Zhu, P.; Silva, P.D.; Tufvesson, F.; Benjebbour, A.; Wunder, G.
5G: A tutorial overview of standards, trials, challenges, deployment, and practice. IEEE J. Sel. Areas Commun.
2017, 35, 1201–1221. [CrossRef]
2. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
3. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
4. Liu, W.; Han, S.; Yang, C. Energy efficiency scaling law of massive MIMO systems. IEEE Trans. Commun.
2017, 65, 107–121. [CrossRef]
5. Yang, H.; Marzetta, T.L. Performance of conjugate and zero-forcing beamforming in large-scale antenna
systems. IEEE J. Sel. Areas Commun. 2013, 31, 172–179. [CrossRef]
6. Zhang, Q.; Jin, S.; Wong, K.-K.; Zhu, H.; Matthaiou, M. Power scaling of uplink massive MIMO systems with
arbitrary-rank channel means. IEEE J. Sel. Top. Signal Process. 2014, 62, 966–981. [CrossRef]
7. Rana, M.M.; Kim, J. Fundamentals of Channel Estimations for Mobile Communications-Existing and New
Techniques of a LTE SC-FDMA System; LAMBERT Academic Publishing: Saarbrucken, Germany, 2012;
ISBN 978-3-8454-3137-6.
8. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Proc. Mag. 2013, 30, 40–60. [CrossRef]
9. Yue, D.-W.; Zhang, Y.; Jia, Y.-N. Beamforming based on specular component for massive MIMO systems in
Rician fading. IEEE Wirel. Commun. Lett. 2015, 4, 197–200. [CrossRef]
10. Yue, D.-W.; Yan, Q. Beamforming only based on specular component for downlink massive MIMO systems.
Wirel. Pers. Commun. 2017, 95, 1799–1810. [CrossRef]
11. Yue, D.-W.; Meng, Z. LOS component-based equal gain combining for Ricean links in uplink massive MIMO.
In Proceedings of the 2016 Sixth International Conference on Instrumentation & Measurement, Computer,
Communication and Control (IMCCC), Harbin, China, 21–23 July 2016; pp.476–481.
12. Yue, D.-W.; Zhang, Q.T. Generic approach to the performance anaysis of correlated transmit/receive diversity
MIMO systems with/without co-channel interference. IEEE Trans. Inf. Theory 2010, 56, 1147–1157. [CrossRef]
13. Yue, D.-W. Specular component-based beamforming for broadband massive MIMO systems with
doubly-ended correlation. Electron. Lett. 2016, 52, 1082–1084. [CrossRef]
14. McKay, M.R.; Collings, I.B. On the capacity of frenquency-flat and frequency-selective Rician MIMO channels
with single-ended correlation. IEEE Trans. Wirel. Commun. 2006, 5, 2038–2043. [CrossRef]
15. Dinis, R.; Montezuma, P. Iterative receiver based on the EGC for massive MIMO schemes using SC-FDE
modulations. Electron. Lett. 2016, 52, 972–974. [CrossRef]
16. Love, D.J.; Heath, R.W. Equal gain transmission in multiple-input multiple-output wireless systems.
IEEE Trans. Commun. 2003, 51, 1102–1110. [CrossRef]
17. Zheng, X.; Xie, Y.; Li, J.; Stoica, P. MIMO transmit beamforming under uniform elemental power constraint.
IEEE Trans. Signal Process. 2007, 55, 5395–5406. [CrossRef]
151
Electronics 2019, 8, 79
18. Fu, H.; Crussière, M.; Hélard, M. BER analysis for equal gain transmission in downlink multiuser MIMO
systems. IEEE Wirel. Commun. Lett. 2015, 4, 533–536. [CrossRef]
19. Jin, S.; Gao, X.; You, X. On the ergodic capacity of rank-1 Ricean fading MIMO channels. IEEE Trans.
Inf. Theory 2003, 53, 502–517. [CrossRef]
20. Bölcskei, H.; Borgmann, M.; Paulraj, A.J. Impact of the propagation environment on the the performance of
space-frequency coded MIMO-OFDM. IEEE J. Sel. Areas Commun. 2003, 21, 427–439. [CrossRef]
21. Chen, J. When does asymptotic orthogonality exist for very large arrays? In Proceedings of the Global
Communications Conference (GLOBECOM), Atlanta, GA, USA, 9–13 December 2013; pp. 4251–4255.
22. Tulino, A.M.; Verdu, S. Random matrix theory and wireless communications. Found. Trends Commun.
Inf. Theory 2004, 1, 1–182. [CrossRef]
23. Wang, C.-X.; Ge, X.; Cheng, X.; Zhang, G.; Thompson, J. Cooperative MIMO channel models: A survey.
IEEE Commun. Mag. 2010, 48, 80–87. [CrossRef]
24. Ngo, H.Q.; Suraweera, H.A.; Matthaiou, M.; Larsson, E.G. Multipair full-duplex relaying with massive
arrays and linear processing. IEEE J. Sel. Areas Commun. 2014, 32, 1721–1736. [CrossRef]
25. Suraweera, H.A.; Krikidis, I.; Yuen, C. Antenna selection in the full-duplex multi-antenna relay channel.
In Proceedings of the 2013 IEEE International Conference on Communications (ICC), Budapest, Hungary,
9–13 June 2013; pp. 4823–4828.
26. Durisi, G.; Tarable, A.; Camarda, C.; Devassy, R.; Montorsi, G. Capacity bounds for MIMO microwave
backhaul links affected by phase noise. IEEE Trans. Commun. 2014, 62, 920–929. [CrossRef]
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
152
electronics
Article
Hybrid Beamforming for Millimeter-Wave
Heterogeneous Networks
Mostafa Hefnawi
Department of Electrical and Computer Engineering, Royal Military College of Canada,
Kingston, ON K7K 7B4, Canada; [email protected]
1. Introduction
Recently, heterogeneous networks (HetNets) that use massive multiple-input multiple-output
(MIMO) and millimeter-wave (mmWave) technologies has emerged as a promising solution to
enhance the network capacity and coverage of next-generation 5G cellular networks [1–6]. Small cell
deployment in HetNets can achieve high signal to interference plus noise ratio (SINR) and dense
spectrum reuse, mmWave can address the current challenge of bandwidth shortage, and the large
number of antenna arrays [7–10] are essential for mmWaves to compensate for channel attenuation.
In Reference [11] we applied the concept of massive multiuser (MU)-MIMO to enhance both the access
and the backhaul links in HetNets, and it was shown that such a concept could significantly improve
the system performance in terms of link reliability, spectral efficiency, and energy efficiency. Traditional
MIMO-beamforming systems require a dedicated radio frequency (RF) chain for each antenna element,
which becomes impractical with massive MIMO systems due to either cost or power consumption.
To reduce the number of RF chains, hybrid beamforming (HBF), which combines RF analog and
baseband digital beamformers, has been proposed as a promising solution [12–17]. Figure 1 shows
a general hybrid configuration that connects Na antenna elements to Nd RF chains, where Nd < Na ,
using an analog RF beamforming matrix built from only phase-shifters. Two widely-used analog
beamformer architectures for hybrid beamforming are shown in Figure 2. The fully-connected hybrid
beamforming structure of Figure 2a provides a full beamforming gain per transceiver—but with
high complexity—by connecting each RF chain to all antennas through a network of Nd × Na phase
shifters [12–15]. Figure 2b, on the other hand, shows a partially-connected structure, where each RF
chain is connected to Na /Nd number of sub-arrays. Such a structure has a lower hardware complexity
at the price of reduced beamforming gain.
(a) (b)
Figure 2. The architecture of analog beamformers: (a) Fully-connected structure; (b) partially-
connected structure.
Previous studies on massive hybrid MIMO mainly focused on single-user systems [12–14]. On the
other hand, MU-MIMO cases were studied in References [15–17]. In Reference [15] a scheme called Joint
Spatial Division Multiplexing (JSDM) was proposed to create multiple virtual sectors which reduce the
overhead and computational complexity of downlink training and uplink feedback. In References [16,17]
it was shown that the required number of RF chains only needs to be twice the number of data
streams to achieve the same performance of any fully-digital beamforming scheme. These studies,
however, did not consider HBF in the context of HetNets and focused primarily on macro-cellular
systems. In this paper, we consider a HetNet where both the macro-cell base stations (MBSs) and
small-cell base stations (SBSs) are equipped with fully-connected massive hybrid antenna arrays,
while all mobile users have a single antenna. We propose a low-complexity HBF that is fully-based on
eigenbeamforming. The MBSs and the SBSs select the best-fixed multi-beams by eigendecomposition
of the access and backhaul channels. The selected beams are then used by the digital beamformers,
which are based on the maximization of the receive SINR of the effective channels consisting of the
cascade of the analog beamforming weights and the actual channel [18,19].
2. System Model
We consider the access and backhaul uplinks in the HetNet of Figure 3, where K cognitive small
cells and their Ls small-cell users (SUs) are concurrently sharing the same frequency band with one
MBS and their L p macro-cell primary users (PUs). It is assumed that both the MBS and SBSs are
equipped with massive hybrid antenna arrays while the SUs and PUs are equipped with a single
154
Electronics 2019, 8, 133
antenna. The SBSs are acting as smart relays between the SUs and the MBS with Na − element
transmitting/receiving antenna arrays and Nd RF chains. The MBS is equipped with Ma − element
antenna arrays and Md RF chains. It is also assumed that both the SBS and the MBS perform
an OFDM-based transmission and that the analog beamformers are identical for all subcarriers while
adapting digital beamformers
! in each
" subcarrier. ! "
p p p
Let xs [ f n ] = x1s , x2s , · · · , x sLs and x p [ f n ] = x1 , x2 , · · · , x L p denote, respectively, the set of Ls
SUs signals and L p PUs signals transmitted on each subcarrier, and f n , n = 1, · · · , Nc , where Nc
denotes the number of subcarriers per OFDM symbol in the system. The analysis is done separately
on each subcarrier. For brevity therefore, we drop the frequency index f n .
Figure 3. System model: hybrid beamforming-based HetNet with one macro-cell and K small-cells.
where Gk,SU ∈ C Na × Ls is the channel matrix between the kth SBS and its Ls users, Gk,PU ∈ C Na × L p is
the channel matrix between the kth SBS and the L p PUs, xs ∈ C Ls ×1 is the transmitted signal vector of
Ls users in the kth small-cell, x p ∈ C L p ×1 is the transmitted signal vector of L p users in the HetNet,
and nk,SBS ∈ C Na ×1 is the received complex additive white Gaussian noise (AWGN) vector at the
kth SBS.
It should be noted that in Equation (1), the interference between small-cells was neglected.
This was justified by the fact that small-cell base stations are using a large number of antennas,
which enables sharp beamforming towards their users without harming neighboring small-cells.
The kth SBS received signal, yk,SBS , is first applied to an Na × Nd receive analog beamforming
weight matrix, ASBS R,k,ls , whose output is directed to an Nd × Nd receive digital beamforming weight
vector DSBS
R,k,ls . If we denote the combined digital-analog receive beamformer for the lsth user as
wR,k,ls = AR,k,ls DR,k,ls , then the detection of the lsth user signal by its kth SBS can be expressed as
SBS SBS
rk,ls = wH
R,k,ls yk,SBS = w R,k,ls Gk,SU xs + w R,k,ls Gk,PU x p + w R,k,ls nk,SBS
H H H
Ls (2)
= wR,k,l
H g x s + wR,k,l
s k,ls ls
H
s
∑ gk,ixs + wR,k,l
H
s
+ Gk,PU x p + wR,k,l
H n
s k,SBS
i
i =1,i =ls
155
Electronics 2019, 8, 133
where gk,ls is the lsth column of Gk,SU that represents the channel between the kth SBS and its lsth user.
H
If we denote H AL,k,ls = ASBSR,k,ls gk,ls as the effective access channel between the kth SBS and its
H
lsth user, then for a set of selected beams, i.e. known ASBS R,k,ls , the SINR can be expressed in terms of
the digital beamformer, DSBS
R,k,ls , as
H H
DSBS
R,k,ls H AL,k,ls xlss xlss H H SBS
AL,k,ls D R,k,ls
k,ls =
γSBS H , (3)
DSBS
R,k,ls B AL D SBS
R,k,ls
Ls H
H
B AL = ∑ (ASBS s s H H SBS
R,k,i ) gk,i xi ( xi ) gk,i A R,k,i + (A R,k,ls ) Gk,PU x p x p Gk,PU A R,k,ls + σn (A R,k,ls )A R,k,ls , (4)
2
SBS
34
H H SBS
5
2 SBS SBS
i =1,i =ls
2 34 5 Interference from L p PUs
Interference from Ls −1 SUs
beamforming matrix ASBST,k . If we denote the combined digital-analog transmit beamformer for the k
th
SBS as wT,k = AT,k DT,k , then the array output of the MBS can be written as
SBS SBS
K
∑ Hk,MBS wT,k sk + HPU,MBS x p + n MBS ,
p p
yMBS = (5)
k =1
p
where yMBS is the Ma × 1 vector containing the outputs of the Ma − element antenna array of the
MBS, Hk,MBS is the Na × Ma channel matrix representing the transfer functions from the Na − element
antenna array of the kth SBS to the Ma − element antenna array of the MBS, HPU,MBS is the Ma × Lp
channel matrix from the L p PUs to the MBS’s Ma − element antenna array, and n MBS is the received
Ma × 1 complex AWGN vector at the MBS.
p
The MBS detects the kth SBS signal by applying the output of the array yMBS to the Ma × Md receive
analog weight matrix AR,k followed by the Md × Md receive digital beamforming weight vector Dk,R
MBS MBS .
If we denote the combined digital-analog receive beamformer for the k SBS as ck = Ak,R Dk,R ,
th MBS MBS
then the detection of the kth SBS signal by the MBS can be expressed as
p p
x̂k = ckH yMBS = Sk + S Ik + S I p + ckH n MBS , (6)
p p p
where Sk = ckH Hk,MBS wT,k sk is the kth SBS signal, S Ik = ckH ∑iK=1,i =k Hk,MBS wT,k sk is the interference
from K − 1 other SBSs, and SIp = ckH HPU,MBS x p is the interference from L p PUs.
p p
Assuming that sk are complex-valued random variables with unit power, i.e., E sk 2 = 1,
H H
and denoting HBL,k = AR,kMBS Hk,MBS ASBS
T,k as the effective channel between the kth SBS and the
MBS, the SINR at the MBS for the kth SBS can be expressed as
H H
MBS
DR,k HBL,k DSBS DSBS H MBS
T,k T,k H BL,k D R,k
γkMBS = , (7)
ckH BBL ck
156
Electronics 2019, 8, 133
and the expression for the array output of the MBS can be written as
K
y MBS = HPU,MBS x p + yk,MBS + ∑ yi,MBS + n MBS , (9)
i =1, i =k
When the MBS applies the output of the array, y MBS , to the hybrid weight, ckH , the detection of
the lsth
user signal of the kth SBS by the MBS can be expressed as
x̂k,ls = ckH yMBS = ckH Sk,ls + SSBSs
I + SSU
k,I + S I + N MBS
PU
, (11)
where
The end-to-end SINR at the MBS for the lsth user of the kth SBS can be expressed as
H
ckH Hk,MBS wT,k wR,k,l
H g x s xlss gk,l
H w H H
s k,ls ls s R,k,ls wT,k H k,MBS ck
MBS
γk,l = , (12)
s
ckH B AL− BL ck
H H H
where B AL− BL = SSU SU
k,I Sk,I + SSBS
I SSBS
I + SPU
I SPU
I + N MBS (N MBS ) H is the covariance
matrix of the interference-plus-noise for the lsth user end-to-end link.
The ergodic channel capacity for each user, ls , is given by [19]
H
#
ckH Hk,MBS wT,k wR,k,l
H g xs x Hs gk,l
s k,ls
H w H
R,k,ls wT,k H k,MBS ck
C = E log2 1 + s
, (13)
ckH B AL− BL ck
157
Electronics 2019, 8, 133
αil e− j2πi f n /Nc are the complex gains of the jth ray in the ith scattering cluster and 6
where αil, f n = 6 αil are
assumed i.i.d CN(0, σ6α2,i ). With σ6α2,i representing the average power of the ith cluster, ∅ri,l and ∅i,l t are
the azimuth angles of arrival and departure, respectively, θi,j r and θ t are the elevation angles of arrival
i,j
and departure, respectively, and a MBS, f n ∅ri,l , θi,l r and ak,SBS, f n ∅i,l
t , θt
i,l represent, respectively,
the normalized receive and transmit array response vectors of the MBS and the kth SBS.
For the access link, the channel matrix at subcarrier f n between the kth SBS and its Ls users can be
written as
7
Ls Ma
a∗k,SU, f n ∅i,l
Ncl Nray
Ncl Nray ∑i ∑l =1 il, f n SBS, f n
Gk,SU, f n = α a ∅ri,l , θi,l
r t t
, θi,l , (15)
where ak,SU, f n ∅i,l
t , θt
i,l represents the spatial signature vector of the Ls single antenna users.
The Nray azimuth and elevation angles, ∅r,t r,t
i,l and θi,l , within the cluster i are assumed to be
randomly distributed with a uniformly-random mean cluster angle of ∅r,t r,t
i and θi , respectively,
and a constant angular spread of σ∅r,t and σθ r,t , respectively.
For simplicity, the access links between the MBS and its L p users (HPU,MBS ) and between PUs and
the kth SBS (Gk,PU ) are modeled by convetional i.i.d MIMO channels.
Note that in this per-subcarrrier representation, it is assumed
that foreach subcarrierf n , the carrier
frequency f c is much larger than f c ± f n and that a MBS, f n ∅ri,l , θi,lr and ak,SBS, f n ∅i,lt , θt
i,l can
approximately be considered equal for all subcarriers. Consequently, the channel covariance
matrices are approximately similar with the same set of eigenvectors for all subcarriers and can be
Nc
AL,k,ls H AL,k,ls = Nc ∑n=1 H AL,k,ls , f n H AL,k,ls , f n ,
1
replaced by the average of the covariance matrices, i.e., H H H
N N
BL,k = f n H BL,k, f n , and H PU,MBS H PU,MBS =
H H 1 H H 1 H
HBL,k Nc ∑n=c 1 HBL,k, Nc ∑n=c 1 HPU,MBS, f n H PU,MBS, f n .
158
Electronics 2019, 8, 133
Since the analog beamforming matrix ASBS R,k,ls is realized using phase shifters only, its elements,
2
SBS
AR,k,ls (i, j), satisfy AR,k,ls (i, j) = 1. It should be noted that each SBS is using a different analog matrix
SBS
for each user and that the system model shown in Figure 1 focuses on the detection of the lsth user of the
kth SBS and shows the analog beamformer and the RF chains for one user only. The analog beamformer
can be implemented using the Butler matrix as shown in Figure 4, where four users ( Ls = 4) and
four RF chains per user ( Nd = 4) are assumed. Depending upon which 4 ports are activated, 4 beams
are produced in specified directions. Since we are assuming 4 different channels, we should expect
4 different ports for each user.
Figure 4. Hybrid beamforming based on Butler matrix for the access link.
Once the analog beams are selected, the received optimal digital weights, DSBS R,k,ls , are obtained
based on the maximization of the access link receive SINR, γSBS
k,ls , given by Equation (3):
−1
R,k,ls = B AL,k,ls V AL,k,ls ,
DSBS (17)
where V AL,k,ls denote the eigen vector corresponding to the maximum eigenvalue of the effective access
channel, H H
AL,k,ls H AL,k,ls .
159
Electronics 2019, 8, 133
−1
T,k = D R,k = B BL,k HBL,k V BL ,
DSBS MBS
(19)
4. Simulation Results
In our simulation setups, we considered a HetNet organized into four SBSs (K = 4) and one
macro-cell. The SBSs and the MBS used the same number of antennas, Na = Ma = 64, and the same
number of RF chains, Nd = Md = 2 or 4. Each SBS is serving Ls = 4 users and the macro-cell is
serving 4 users, each transmitting with a single antenna. We assumed QPSK modulation. For the
OFDM configurations, we assumed the 256-OFDM system (Nc = 256), which is widely deployed in
broadband wireless access services.
Figure 5 shows the beampattern of the proposed HBF with four RF chains and the optimal
fully-digital one for the access link. It is noted that the optimal beamformer has about five dominant
beams, three of which are similar to the selected beams of the proposed HBF. This beampattern means
that the data streams can be successfully transmitted through those three beams using the proposed
HBF and that near optimal performance could be achieved if we were to bring the number of RF chains
close to the number of dominant beams of the optimal beamformer. For the backhaul link, Figure 6
shows very similar beampatterns with more dominant beams.
(a) (b)
Figure 5. Beampattern of the access link: (a) Proposed HBF, 4 RF chains; (b) fully-digital
beamforming (optimal).
(a) (b)
Figure 6. Beampattern of the backhaul link: (a) Proposed HBF, 4 RF chains; (b) fully-digital
beamforming (optimal).
160
Electronics 2019, 8, 133
Figure 7, on the other hand, compares the ergodic channel capacity of the proposed HBF and the
optimal fully-digital one. It is observed that for both cases the optimal beamformer is outperforming
the proposed HBF. However, as we increase the number of RF chains, the performance gap between
the two schemes was reduced, and a near-optimal solution was achieved by the proposed HBF using
four RF chains. On the other hand, for the single cell MU-MIMO case presented in References [12–14],
near optimal performance was achieved with only five RF chains, and for the MU-MIMO case in [16,17],
it was shown that the required number RF chains could be reduced to two to achieve fully digital
beamforming performance. However, unlike our case, where we have assumed a HetNet with a macro
cell and multiple small cognitive cells, these studies focused primarily on macro-cellular systems and
did not consider HBF in the context of HetNets.
Figure 7. Ergodic channel capacity of the proposed HBF for different number of RF chains.
5. Conclusions
In this paper, we employed hybrid beamforming at the access and backhaul links of a mmWave
HetNet system. We proposed a low-complexity HBF that was fully-based on MRT/MRC Eigen-
beamforming schemes. The performance evaluation in terms of the beam patterns and the ergodic
channel capacity showed that the proposed HBF scheme achieved near-optimal performance with
only four RF chains and required considerably less computational complexity.
References
1. Siddique, U.; Tabassum, H.; Hossain, E.; Kim, D.I. Wireless backhauling of 5G small cells: Challenges and
solution approaches. IEEE Wirel. Commun. 2015, 22, 22–31. [CrossRef]
2. Gao, Z.; Dai, L.; Mi, D.; Wang, Z.; Imran, M.A.; Shakir, M.Z. MmWave Massive MIMO Based Wireless
Backhaul for 5G Ultra-Dense Network. IEEE Wirel. Commun. 2015, 22, 13–21. [CrossRef]
3. Tabassum, H.; Hamdi, S.A.; Hossai, E. Analysis of massive MIMO-enabled downlink wireless backhauling
for full-duplex small cells. IEEE Trans. Commun. 2016, 64, 2354–2369. [CrossRef]
4. Shariat, M.; Pateromichelakis, E.; Quddus, A.; Tafazolli, R. Joint TDD Backhaul and Access Optimization in
Dense Small-Cell Networks. IEEE Trans. Veh. Technol. 2015, 64, 5288–5299. [CrossRef]
5. ElSawy, H.; Hossain, E.; Kim, D.I. HetNets with cognitive small cells: User offloading and Distributed
channel allocation techniques. IEEE Commun. Mag. 2013, 51, 28–36. [CrossRef]
161
Electronics 2019, 8, 133
6. Yan, Z.; Zhou, W.; Chen, S.; Liu, H. Modeling and Analysis of Two-Tier HetNets with Cognitive Small Cells.
IEEE Access. 2016, 5, 2904–2912. [CrossRef]
7. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
8. Rusek, F.; Persson, D.; Lau, B.; Larsson, E.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
9. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
10. Hoydis, J.; ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
11. Hefnawi, M. Capacity-Aware Multi-User Massive MIMO for Heterogeneous Cellular Network. In Proceedings
of the IEEE International Conference on Selected Topics in Mobile and Wireless Networking, Tangier,
Morocco, 20–22 June 2018.
12. Sohrabi, F.; Yu, W. Hybrid beamforming with finite-resolution phase shifters for large-scale MIMO systems.
In Proceedings of the IEEE Workshop Signal Processing Advances in Wireless Communications, Stockholm,
Sweden, 28 June–1 July 2015.
13. El Ayach, O.; Rajagopal, S.; Abu-Surra, S.; Pi, Z.; Heath, R. Spatially sparse precoding in millimeter wave
MIMO systems. IEEE Trans. Wirel. Commun. 2014, 13, 1499–1513. [CrossRef]
14. Alkhateeb, A.; El Ayach, O.; Leus, G.; Heath, R. Channel estimation and hybrid precoding for millimeter
wave cellular systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 831–846. [CrossRef]
15. Sohrabi, F.; Yu, W. Hybrid digital and analog beamforming design for large-scale MIMO systems.
In Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Process (ICASSP), Brisbane,
Australia, 19–24 April 2015.
16. Liang, L.; Dai, Y.; Xu, W.; Dong, X. How to approach zero-forcing under RF chain limitations in large
mmwave multiuser systems? In Proceedings of the IEEE/CIC International Conference on Communications
in China, Shanghai, China, 13–15 October 2014.
17. Liang, L.; Xu, W.; Dong, X. Low-complexity hybrid precoding in massive multiuser MIMO systems.
IEEE Wirel. Commun. 2014. [CrossRef]
18. Kang, M. A comparative study on the performance of MIMO MRC systems with and without cochannel
interference. IEEE Trans. Commun. 2004, 52, 1417–1425. [CrossRef]
19. Sulyman, A.I.; Hefnawi, M. Adaptive MIMO Beamforming Algorithm Based on Gradient Search of the
Channel Capacity in OFDMSDMA System. IEEE Commun. Lett. 2008, 12, 642–644. [CrossRef]
© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
162
electronics
Article
Spatio-Radio Resource Management and Hybrid
Beamforming for Limited Feedback
Massive MIMO Systems
Hedi Khammari 1 , Irfan Ahmed 2, *, Ghulam Bhatti 3 and Masoud Alajmi 1
1 Department of Computer Engineering, College of Computers and Information Technology, Taif University,
Taif 21974, Saudi Arabia; [email protected] (H.K.); [email protected] (M.A.)
2 Department of Electrical Engineering, Higher Colleges of Technology, Ruwais 12389, UAE
3 Department of Computer Science, College of Computers and Information Technology, Taif University,
Taif 21974, Saudi Arabia; [email protected]
* Correspondence: [email protected]
Abstract: In this paper, a joint spatio–radio frequency resource allocation and hybrid beamforming
scheme for the massive multiple-input multiple-output (MIMO) systems is proposed. We consider
limited feedback two-stage hybrid beamformimg for decomposing the precoding matrix at the
base-station. To reduce the channel state information (CSI) feedback of massive MIMO, we utilize the
channel covariance-based RF precoding and beam selection. This beam selection process minimizes
the inter-group interference. The regularized block diagonalization can mitigate the inter-group
interference, but requires substantial overhead feedback. We use channel covariance-based
eigenmodes and discrete Fourier transforms (DFT) to reduce the feedback overhead and design
a simplified analog precoder. The columns of the analog beamforming matrix are selected based on
the users’ grouping performed by the K-mean unsupervised machine learning algorithm. The digital
precoder is designed with joint optimization of intra-group user utility function. It has been shown
that more than 50% feedback overhead is reduced by the eigenmodes-based analog precoder design.
The joint beams, users scheduling and limited feedbacK-based hybrid precoding increases the
sum-rate by 27.6% compared to the sum-rate of one-group case, and reduce the feedback overhead
by 62.5% compared to the full CSI feedback.
1. Introduction
The scarcity of available frequency band for wireless communications has led to the inclusion
of millimeter Wave (mmWave) frequencies in cellular communications. This has opened the doors
for massive multiple-input multiple-output (MIMO) systems. Due to high transmission frequencies,
fabrication of large number of antennas with small form factor has become possible. MmWave band
has inherent hindrances, like, high path-loss and absorption-loss. It has been known that MIMO
systems advantages (spatial multiplexing or diversity gain) are scaled-up with the number of antennas.
In summary, one can enjoy the benefits of the large bandwidth available at mmWave frequencies by
combating high path and absorption losses with massive MIMO directional beamforming. Future
mmWave massive MIMO-based cellular networks will be as shown in Figure 1. Due to the high
pathloss on one hand and high directional gain on the other hand, the inter-cell interference and cell
boundaries will become meaningless. The fixed area size cell boundaries of traditional cellular will
probably no longer exist in the future mmWave massive MIMO systems. Narrow beams can serve
distant user equipment (UE) without interfering other UEs provided that there is no obstacle between
BS and intended UE, whereas a closely located UE may deprive of connection due to the obstacles.
ĂƐĞ^ƚĂƚŝŽŶ
ĂƐĞ^ƚĂƚŝŽŶ ĂƐĞ^ƚĂƚŝŽŶ
ĂƐĞ^ƚĂƚŝŽŶ
The cost of massive MIMO is in terms of excessive feedback overhead for channel estimation
along with the hardware complexity of RF chains (increased number of radio-frequency (RF) chains).
The feedback overhead has been tackled separately for frequency division duplex (FDD) and time
division duplex (TDD) systems. In FDD systems, the uplink channel estimation consists of fewer
overheads compared to the downlink channel estimation, because generally, the number of transmit
antennas Nt is larger than the number of users K, and the number of receive antennas per user nk
(Nt K and Nt nk ). The most common technique to reduce the downlink channel estimation
overhead is joint spatial division multiplexing (JSDM) [1]. The JSDM uses two-stage precoding:
second order channel statistics (covariance)-based user grouping and the traditional MU-MIMO
linear precoding (zero-forcing) for the inter-user interference mitigation based on the low-dimensional
effective channel. In TDD, only uplink channel estimation is done and the downlink channel estimates
are obtained by the transpose of the uplink channel using the channel reciprocity principle. The TDD
massive MIMO systems suffer from pilot contamination when the BS receives non-orthogonal pilot
signals from the neighboring cells. This pilot contamination degrades the channel estimation and
hence, affects both uplink combining and downlink precoding.
In traditional MIMO systems, a separate RF chain (analog-to-digital converter/digital-to-analog
converter, serial-to-parallel/parralel-to-serial converter, up/down converter etc) is required for each
antenna, but the high power consumption makes it infeasible for the case of massive MIMO systems.
Hybrid beamforming technique resolves this problem by dividing the precoding/combining into
baseband digital processing and RF analog processing. The hybrid precoding and combining
offer extra degrees of freedom in space domain with a large number of antennas and analog
beamforming [2]. The hybrid beamforming can be realized by using MU-MIMO precoding as
baseband digital precoding and the statistical channel state information-based pre-beamforming as RF
analog precoding. This limited feedback (due to average CSI) configuration is particularly suited for
massive MIMO mmWave systems with a large number of antennas but relatively small number of RF
chains [3]. It has been shown [4], that the covariance-based limited feedback works well for mmWave
massive MIMO systems, where the number of users is small with respect to the number of BS antennas
and the channels are formed by a few multi path components (MPCs) with small angular spread.
164
Electronics 2019, 8, 1061
Limited work has been done on the joint multiuser massive MIMO resource allocation and
hybrid beamforming design. Although mmWave massive MIMO system has a potential of tremendous
increase of spectral efficiency. However, the cost and power consumption of power-hungry radio
frequency chains (analog-to-digital converter (ADC)/digital-to-analog converter (DAC), parallel to
serial converter, serial to parallel converter, up converter/down converter) make it impractical to build
a complete RF chain for each antenna. A promising solution to this problem is hybrid beamforming,
where the precoder at the transmitter is divided into two parts: analog precoder and digital precoder.
The analog precoder (usually a network of phase shifters) at the RF stage reduces the number of
RF chains required for the digital precoder. In order to configure these precoders, the transmitter
requires channel state information in the form of uplink feedback from users, but in the presence
of massive antennas, this feedback becomes a huge load on the wireless uplink, especially in FDD
mode. JSDM [4] is a technique used to reduce the feedback overhead. It uses slowly varying average
channel statistics to implement the analog precoder; then, the digital precoder is realized by using a
low-dimensional effective channel. Till now, different variants of the JSDM have been proposed. Li et
al. [5] generalize the JSDM scheme to support non-orthogonal virtual sectorization and with multiple
RF chains at both link ends. It uses the Kronecker channel model to decouple the transmit and
receive beamforming. Under this channel, the analog beamformer is obtained by stacking strongest
eigenbeams of the channel covariance matrix and then the digital beamformer is based on a weighted
minimum mean squared error (MMSE) with effective channel. However, the Kronecker model does
not characterize the mmWave channel where transmitter and receiver have coupling effects due to
highly directional transmission. In [6], the authors apply JSDM using a geometrical channel model and
find hybrid precoder and combiner at transmitter and receivers, respectively. Hybrid beamforming
with switches (HBwS) has been introduced in [7], where, L × Nt analog beamformer is controlled by
NRF × L instantaneous CSI based switches. Nt is the number of transmit antennas, NRF is the number
of RF chains, and NRF < L < Nt . Another switch-based analog beamforming is proposed in [8] but it
requires instantaneous CSI for both switching network and the phase shifter network. Also it contains
L = Nt . The JSDM implementation also requires the training in the downlink to estimate the channel
covariance matrix. Most of the work assumes that the CSI is known at both ends. In [9], authors
consider the joint optimization of the training resource allocation and channel-statistics-based analog
beamformer design by using user centric virtual sectorization. There are different structures for the
phase shifter-based analog beamformer, namely, fully connected, sub-connected, and dynamically
connected [10]. Park [11] investigate JSDM with these analog beamformer architectures. The dynamic
architecture gives better result at the cost of added complexity. In [12], authors propose a hybrid
beamforming method with unified analog beamformer by Subspace Construction (SC) based on
partial CSI in massive MIMO OFDM system. In [13], statistical CSI based analog beamformer uses
regularized block diagonalization to mitigate the inter-group interference and instantaneous CSI based
digital beamformer utilizes the weighted MMSE to suppress intra-group interference. Jiang et al. [14]
jointly optimize the user selection and beam selection during analog beamforming design. They use
Lyapunov-drift optimization framework to obtain the optimal solution. Their work only focuses on the
design of statistical CSI based analog precoder and user/beam selection. Our previous work [15] on
resource allocation for transmit beamforming develops digital and analog precoders which maximize
the sum rate with total power and desired number of RF chains constraints. The provided solutions
require full instantaneous CSI at the transmitter and receiver, which, in case of the massive MIMO,
consists of large number of pilot transmission in downlink and channel information feedback in
the uplink. In this work we exploit the channel similarities by grouping (K-Mean machine learning)
the users based on the location information. Low complexity DFT matrix based analog precoder is
derived using statistical CSI. This greatly reduces the feedback overhead for the design of zero-forcing
digital precoder.
Machine learning (ML) applications for the physical layer of wireless communication systems
have been widely reported in [16]. Most of the conventional transmitter and receiver blocks can be
165
Electronics 2019, 8, 1061
replaced by an ML-based auto encoder as suggested by the authors. The large number of antennas
in massive MIMO leads to the challenging issue of channel estimation in mmWave communications.
A common practice in TDD massive MIMO systems is to utilize the channel reciprocity to get the
downlink CSI from uplink channel information estimates. However, in FDD, the channel reciprocity is
not applicable and the downlink CSI estimation is very difficult. The downlink channel estimation is
known to be hampered by the pilot contamination effect (user to base-station). The quality of channel
estimates is deteriorated by the mutual interference caused by the non-orthogonal pilots in a cell. In
[17], a supervised learning-based pilot decontamination scheme for massive MIMO uplink is reported.
In the proposed ML-based solution, the users’ locations in all cells and the pilot assignments stand
for the input features and output labels, respectively. In [18], a deep learning network CsiNet is
used to learn the CSI-to-codeword transformation (codebook approach is usually adopted to reduce
the feedback overhead) at users’ terminals and inverse CsiNet at base-station. The authors of [19],
suggest a learning-based antenna selection for massive MIMO systems. It uses a multiclass K-NN and
support vector machine (SVM) for data-driven optimal antenna selection. Wang et al. [20] employs
K-nearest neighbor (K − NN ) supervised learning for the N beams allocation among K users. In [21],
a reinforcement learning based framework for radio resource management in radio access networks
has been proposed. In our previous work [22], we used neural networks to reduce the execution time
of the computationally intensive resource allocation part of the joint resource allocation and hybrid
beamforming design in [15]. However, in this work, we use K-mean based unsupervised machine
learning scheme to group the users based on their spatial locations. To the best of our knowledge,
there is no research work that jointly consider the spatio–radio resource management and the hybrid
beamforming in massive MIMO systems.
In this work, we use spatial channel covariance matrices for the analog beamforming design.
We also consider the users to RF beam mapping. This mapping requires channel state information
and a search over all possible beam combinations at the base-station. This search is exponential in the
number of users [23]. Due to this exponential increase in complexity, we use DFT-based eigenmode
beams with RF switches.
Contribution: In this paper, we develop joint spatio–radio resource and hybrid precoding
algorithms for limited feedback wideband massive MIMO systems. The contributions of this paper are
summarized as follow.
• First, we consider the problem of joint hybrid precoder design with limited feedback and
user-beam selection to maximize the sum proportional rate under the total power constraint.
The formulated mixed integer programming problem is then transformed to the relaxed-convex
optimization problem.
• Second, a low complexity suboptimal solution is provided for the optimization problem.
The algorithm generates the analog beamforming matrix, digital beamforming matrix, and the set
of users in each group. The DFT/eigenmodes-based analog beamforming is formed using limited
statistical CSI feedback from the users. Then, the digital precoder design with users selection is
done iteratively.
• Finally, we develop a K-Mean algorithm based unsupervised machine learning scheme for users
grouping. These users groups are used to form the limited feedback (statistical channel state
information) based analog beamforming matrix. The proposed machine learning based analog
beamforming along with the zero-forcing digital precoding and user scheduling gives better
performance than the DFT/eigenmodes-based solution.
The rest of the paper is organized as follows. System, signal, and channel model along with the
problem formulation are described in Section 2. Section 3 introduce the relaxed-convex transformation
of the formulated mixed integer optimization problem. Suboptimal solution to the joint resource
allocation and hybrid beamforming based on eigenmodes and discrete Fourier transform is given
in Section 4. Section 5 proposes machine learning based users grouping and beam selection for
166
Electronics 2019, 8, 1061
joint optimization problem. Simulation results are given in Section 6, followed by the conclusions
in Section 7.
Notations: Bold upper and lower case letters denote vectors and matrices, respectively.
The notations X−1 , X† , X T , X H , and tr (X) denote the inverse, pseudo-inverse, transpose, Hermitian
transpose, and trace of a matrix X. vec{·} is a vector operator, diag{ x1 , ..., xn } is diagonal matrix,
and ⊗ is the Kronecker product. · F denotes the Frobenius norm. The n × n identity matrix is
denoted by In . E{·} represents the expectation with respect to the random variable within the brackets.
2. System Model
Consider a FDD MU-MIMO downlink system where a base station (BS) with Nt antennas is
located at the cell center and transmits to K single antenna users as shown in Figure 2. There are G
groups of users such that the group g ∈ G = {1, ..., G }. Each group contains K g users.
Őсϭ
ŐсϮ
Őс'
Assume that the BS and users have the knowledge of the channel. We consider multi-carrier
OFDM transmission with narrow-band blocK-fading channel. The BS is equipped with Nt antennas in
linear antenna array (ULA) configuration. The information signal block S ∈ CK × N f at the input of the
BS transceiver for the user k is given as
s k = [ s1 , s2 , . . . , s N f ], ∀k, (1)
where N f and Ns are the number of subchannels and the number of symbols per subchannel,
respectively. In a subchannel n, the information symbol vector is s ∈ C Ns ×1 . We assume Ns = K,
such that the transmit signal per subchannel n satisfying E{sn snH } = PKn IK , where Pn = PT /N f is the
transmit power per subchannel and PT is the total transmit power of the BS. The transmit signal vector
X is obtained from F B S, where F B ∈ C Nt × Ns is the precoding matrix. The hybrid beamforming divides
the precoding matrix into baseband digital precoding matrix F DB ∈ C NRF × Ns and RF analog precoding
167
Electronics 2019, 8, 1061
matrix F AB ∈ C Nt × NRF , where NRF is the number of RF chains as shown in Figure 3. The transmit
signal X ∈ C Nt × N f is given by
X = FB S
= F AB F DB S (3)
E{tr (XX H )} ≤ PT
H DB H AB H
E{tr (F AB DB
F SS F F )} ≤ PT
H H
tr (F AB F DB E{SS H }F DB F AB ) ≤ PT (4)
PT
since E{SS H } = N f K IKN f , therefore,
H H
tr (F AB F DB F DB F AB ) ≤ N f K (5)
yn = HnH xn + wn
= HnH F AB FnDB sn + wn (6)
where Hn [h1,n , ..., hK,n ] ∈ C Nt ×K is the channel matrix with hk,n = [ h1,k , ..., h Nt ,k ] H being the channel
vector from BS to user k in subchannel n, xn = FnDB F AB sn , and wn ∼ CN (0, σ2 IK ) be the additive white
Gaussian noise (AWGN) in subchannel n at the users. The RF beamforming F RF is performed in time
domain and the same beamforming is applied on all subchannels, whereas, the digital beamforming
FnDB is performed in frequency domain on the per subchannel basis [11]. In the nth subchannel,
the jth UE receives the sum of all transmitted signals for K UEs over its MIMO channel H j,i as
K
y j,n = ∑ h j,n
H
xk,n + w j,n (7)
k =1
where h j,n is the Nt × 1 channel vector. We denote the rank of the channel matrix H j,n by r j,n , where
0 ≤ r j,n ≤ min(K, Nt ), ∀n. In matrix form, the above equation is given as
y j,n = h j,n
H
xn + w j,n (8)
yk = [hk,1
H AB DB H
F F1 s1 , ..., hk,N f N f s N f ] + wk ,
F AB F DB (9)
Combining the signals for all UEs in a K dimensional received signal vector y = [y1 , ..., yK ] H ,
we get the system equation as
Y = H H F AB F DB S + W, (10)
where Y, W ∈ CK × N f .
168
Electronics 2019, 8, 1061
DŝdžĞƌ
DŝdžĞƌ
п ^ͬW &&d LJϭ
R H = E{vec{H}vec{H H }}
= Rtx ⊗ Rrx (11)
under the above assumptions, the channel model H is simplified to Kronecker model,
1/2
H = R1/2
rx KRtx (12)
where K ∼ CN (0, 1) is an i.i.d. unit variance MIMO channel matrix, Rtx and Rrx are the transmit
and receive corrrelation matrices, respectively. The transmit and receive correlation matrices are
given as [24],
Rtx = E{H H H}, Rrx = E{HH H } (13)
The physical models explicitly model wave propagation parameters like the complex amplitude,
DoD, DoA, and delay of an MPC [24,25]. MmWave propagation leads to limited spatial scattering
due to the high free-space pathloss. In addition, the large tightly packed antenna arrays lead to high
levels of antenna correlation. The sparse scattering and antennas spatial correlation makes many of the
commonly used statistical fading distributions inaccurate for mmWave channel modeling. Therefore,
we use extended Saleh-Valenzuela model, which accurately describes the mathematical structure
present in mmWave channels [26,27]. For simplicity, we assume that each scattering cluster around the
transmitter and receiver contributes a single propagation path [28].
In general, the mmWave MIMO channel matrix between the BS with Nt transmit antennas and
a user k with nr receive antennas in subchannel n, can be modeled as double directional channel,
7
Nt nr L
ρk,n L l∑
Hk,n = αk,n,l a(φk,n,l )b H (θk,n,l ), (14)
=0
where L is the total number of multipaths, αk,n,l is the complex gain of the l th path with i.i.d. CN (0, 1),
and ρk,n is the distance dependent pathloss between the BS and user k [29]. The LOS path is included
with l = 0. Moreover, a and b are the receive and transmit steering vectors, respectively. The variables
φk,n,l ∈ [0, 2π ) and θk,n,l ∈ [0, 2π ) are the l th path’s azimuth angles (boresight angles in the receive
array and transmit array) of arrival and departure, respectively. The steering vectors are given by
1
a(φk,n,l ) = √ [ a1 (φl ), ..., anr (φk,n,l )] T (15)
Nr
169
Electronics 2019, 8, 1061
1
b(θk,n,l ) = √ [b1 (θl ), ..., b Nt (θk,n,l )] T (16)
Nt
The elements of transmit and receive steering vectors are given by
dt
bi (θk,n,l ) = e− jωτi,l = e− j2π (i−1) λ sin(θk,n,l )
, i = 1, 2, ..., Nt (17)
dr
ai (φk,n,l ) = e− jωτi,l = e− j2π (i−1) λ sin(φk,n,l )
, i = 1, 2, ..., nr (18)
where ν = NρL t nr
and, Ak,n and Bk,n consist of stacked steering vectors of AoA and AoD, respectively,
i.e., Ak,n = [a(φk,n,1 ), a(φk,n,2 ), . . . , a(φk,n,L )] and Bk,n = [b(θk,n,1 ), b(θk,n,2 ), . . . , b(θk,n,L )]. The matrix
Dk,n is a diagonal matrix, given as Dk,n = diag{αk,n,1 , αk,n,2 , . . . , αk,n,L }. The small scale fading at user
k in subchannel n in multipath component (MPC) is given by αk,n,l with zero mean and variance σk,n,l 2 .
αk,n,l
where Σk,n = diag{σk,n,1
2 , σk,n,2
2 . . . , σk,n,L
2 } and D̄k,n = diag{ᾱk,n,1 , ᾱk,n,2 . . . , ᾱk,n,L } with ᾱk,n,l = σk,n,l
such that E{ᾱk,n,l } = 0 and E{ᾱ2k,n,l } = 1.
Substituting (20) in (13) and averaging over small scale fading, we get the transmit and receive
correlation matrices for user k in the subchannel n as
For mmWave massive MIMO systems with large number of antennas, the steering vectors are
asymptotically orthogonal to each other [6]:
a H (φk,n )a(φk,n ) ≈ 0,
Moreover, in mmWave massive MIMO, acquisition of the instantaneous full CSI is not practical.
Instead, an average CSI in terms of [Ak,n ], [Bk,n ], and [Σk,n ] is a practical solution for the beamforming
design because the coherence time of the channel statistics based CSI is of the order of few seconds or
more as compared to the small scale of the order of milli-second [6].
170
Electronics 2019, 8, 1061
experience the same azimuth center angle (θ g ) and angular spread (Δ g ). In this case, Rrx = I in (12),
therefore, the channel covariance matrix of each user in group g is given by [30]
R g = U g Λ g U gH (25)
h gk = U g Λ1/2
g z g = U g h̃ gk (27)
where z g ∈ C Nt ×1 ∼ CN (0, Ir g ) and h̃ gk is beam domain channel. For large Nt , U g tends to discrete
Fourier transform (DFT) matrix Δ Nt ∈ C Nt × Nt [31]. Each column of U g represents one direction of
angle-of-departure (AoD), i.e., a beam.
Alternatively, for the case, when dominant eigenvalues r̂ g ≤ r g , then, the channel matrix can be
written as ([13], Equation (5))
h gk = Nt1/2 R1/2
g z gk (28)
given by
F AB = [F1AB , ..., FG
AB
], (29)
F DB = diag[F1DB , ..., FG
DB
] (30)
171
Electronics 2019, 8, 1061
⎡ ⎤
H1H F1AB H1H F2AB · · · H1H FGAB
⎢ H AB
⎢ H2 F1 H2H F2AB · · · H2H FGAB ⎥⎥
HeHf f = H H F AB =⎢
⎢ .. .. .. .. ⎥
⎥ (32)
⎣ . . . . ⎦
HG F1AB
H HG F2AB
H · · · HG FG
H AB
The excessive pilot transmission in downlink and feedback in uplink of FDD system can be
reduced by only sending the group-wise average CSI based channel estimates in uplink. This is
accomplished by using the diagonal elements H gH F gAB as feedback information with the size of
K g × NRF,g for g = 1, ...G. The analog pre-beamforming is designed in such a way that the other
elements of matrix (32) H gH F gAB ≈ 0 for all g = g. This group-wise division creates virtual sectors,
each group corresponds to a virtual sector [30].
The second order channel statistics-based RF beamformer F AB remains the same across multiple
coherence blocks which gives the effective instantaneous channel between BS and user k as
H
hn,g k ,e f f
hn,g
H
F AB ,
k g
(33)
with hn,gk ,e f f ∈ C NRF,g ×1 . Therefore, channel statistics-based CSI sufficiently reduces the feedback
overhead on each user, otherwise, for instantaneous CSI, each user have to send the Nt × 1 size
of channel estimate on the uplink channel. The covariance of effective channel hn,g H is given by
k ,e f f
using (13) as,
H
E{hn,gk ,e f f hn,g
H
k ,e f f
} = E{F gAB hn,gk hn,g
H
F AB }
k g
H
= νF gAB Bk,n Σ2 Bk,n
H AB
Fg (34)
The analog beamformer consists of columns of the DFT matrix, which can be easily implemented
AB can be obtained by eigenvalue decomposition of channel
by phase shifter network. Therefore, Fn,g
covariance matrix. With the group-wise hybrid beamforming, the received signal y g,n for group g in
subchannel n becomes
y gk ,n = h gHk ,n F gAB f DB
gk ,n s gk ,n + h gk ,n
H
∑ F gAB f DB
g ,n s gk ,n
k
k =k
2 34 5
Inter-user interference
+ h gHk ,n ∑ ∑ FgAB fDB
g j ,n s g j ,n + w gk ,n (36)
g =g j
2 34 5
Inter-group interference
The received signal to interference and noise ratio (SINR) at the user k in group g and subchannel
n is given by
gk ,n |
|h gHk ,n F gAB f DB 2
SI NR gk ,n = (37)
|h gHk ,n |2 ∑k =k |F gAB f DB
gk ,n | + ∑ g = g ∑ j | F g f g ,n |
2 AB DB 2 + σ2
j
R gk ,n = Ψ gk ,n log2 (1 + SI NR gk ,n ), (38)
172
Electronics 2019, 8, 1061
where Ψ gk ,n is the binary variable such that it is equal to 1 if user k is selected in group g in the
subchannel n. In order to achieve balance tradeoff between throughput and fairness [32], we use
proportional fairness (PF) based throughput maximization. We define per user proportional fairness
metric as
R gk ,n (t)
U (F gAB , f DB
gk ,n ) = , ∀n, gk , (39)
R̄ gk ,n (t)
where R̄ gk ,n (t) is average throughput (moving average) over a past window of length Tw = 1/α [33], as
The large number of antennas in massive MIMO systems enable the use of the eigenmodes of the
channel covariance matrix, i.e., Bk,n comprises of the columns of the DFT matrix [6]. DFT-based beams
with Nt = 16 and Nt = 64 are shown in Figure 4a,b, respectively.
(a) (b)
Figure 4. DFT-based beams in a 120 sector. (a) DFT-based beams in a 120 sector with Nt = 16;
(b) DFT-based beams in a 120 sector with Nt = 64.
The beam steering matrix Bk,n consists of selected columns of Nt × Nt DFT matrix Δ Nt such that
where Δ Nt = [b1 , b2 , ..., b Nt ] consisting of all eigenmodes and Υn is an Nt × r R binary beam selection
matrix, with r R is the rank of the channel covariance matrix. The selection matrix Υn ∈ C Nt ×rR
with only a single one on each row and column such that ∑i [Υn ]i,j = 1 ∀ j. Now we formulate our
optimization problem for joint spatio–radio resource allocation and precoders design with the objective
to maximize the utility function as
G Nf Kg
max
F gAB ,f DB
∑ ∑ ∑U F gAB , f DB
gk ,n (42)
g ,n
k g =1 n =1 k =1
subject to
H H
C1 : tr (FnDB F AB F AB FnDB ) ≤ Pn , ∀n
C2 : rank (F AB F DB ) ≤ NRF ,
Kg
C3 : ∑ Ψk,n ≤ Kg , ∀ g, n,
k =1
C4 : tr (Υn ) ≤ G, ∀n,
C5 : [Ψk,n ], [Υn ] ∈ {0, 1}.
173
Electronics 2019, 8, 1061
The above optimization problem is a mixed integer programming (MIP) problem with coupling
between the digital and RF precoders in the power constraint. This MIP problem is NP-hard [14].
3. Relaxed-Convex Transformation
Though the above MIP optimization problem is NP-hard, it can be transformed to a relaxed
convex optimization problem by (i) relaxing the binary integer constraints to real number between 0
and 1 [14], and (ii) decoupling the digital and analog precoders. For decoupling purpose, we make
H
use of change of variables FnDB = (F AB F AB )− 2 F̃nDB , where F̃nDB is the equivalent digital precoder [34].
1
G Nf Kg
max
F gAB ,f̃ DB
∑ ∑ ∑ Ũgk ,n F gAB , f DB
gk ,n (43)
g ,n
k g =1 n =1 k =1
subject to
DB H DB
C1 : tr (F̃n F̃n ) ≤ Pn , ∀n
C2 : rank (F̃ DB
) ≤ NRF ,
Kg
C3 : ∑ Ψk,n ≤ Kg , ∀ g, n,
k =1
C4 : tr (Υn ) ≤ G, ∀n,
C5 : 0 ≤ [Ψk,n ] ≤ 1, 0 ≤ [Υn ] ≤ 1,
For a given RF precoder F AB and the knowledge of perfect CSI at the base-station, the digital
precoder can be obtained by conventional MU-MIMO techniques, e.g., the zero-forcing and block
diagonalization [15].
For the digital precoder, we adopt the ZF precoder for no multiuser interference among the
users in each groups. The beamforming vector of user k is chosen to be orthogonal to the effective
channel vectors of all the other users in the group. Zero-forcing is a suboptimal but low complexity
approach within the linear precoders’ class. ZF precoder is asymptotically optimal among all downlink
beamforming techniques in high SNR region. It guarantees high spectral efficiency for large-scale
antennas with low-complexity linear processing [35]. For Nt Nr , it has shown that zero-forcing
beamforming can achieve up to 98% of the non-linear dirty paper coding (DPC) capacity [36].
In order to make this paper self-contained, we describe the block diagonalization briefly. Since digital
precoder is used to mitigate the multiuser interference within a groups and all groups are independent,
we omit the subscript g. First we consider the downlink transmission over one subchannel n with
the general case of BS with Nt antennas and Kn users with nk antennas each, such that ∑kK=1 nk = Nr .
The downlink channel on the subchannel n is expressed as Nr × Nt matrix,
Hn,e f f = [H1,n,e
T
f f , ..., HKn ,n,e f f ]
T T
(44)
Hk,n,e f f = [H1,n,e
T
f f , ..., HK −1,n,e f f , Hk +1,n,e f f , ..., HKn ,n,e f f ]
T T T T
(45)
Let the rank of Hk,n,e f f be denoted by rk,n , then the nullspace of Hk,n,e f f has dimension
Nt − rk,n ≥ nk . Performing the SVD of each user’s channel matrix in subchannel n leads to the following
(1) (0)
Hk,n,e f f = Uk,n Σk,n Vk,n
H
= Uk,n Σk,n [Vk,n Vk,n ] H , (46)
where Uk,n and Vk,n are the unitary matrices. The columns of Uk,n are the left singular vectors of
Hk,n,e f f , the columns of Vk,n are the right singular vectors of Hk,n,e f f , and Σk,n is a diagonal matrix in
174
Electronics 2019, 8, 1061
(1)
which the diagonal entries are the singular values of Hk,n,e f f . In the last equality of (46), Vk,n holds the
(0)
first rk,n right singular vectors of Hk,n,e f f and Vk,n contains the Nt − rk,n singular vectors of Hk,n,e f f
(0)
which are in the nullspace of Hk,n,e f f . The columns of Vk,n are best suited for user k beamforming
DB , because they will provide zero interference at other UEs. Usually V (0)
matrix Fk,n k,n contains more
(0)
number of columns than the nk , therefore we use some linear combinations of the columns of Vk,n to
make at most nk columns.
(0) Σk,n 0 (1)
(0) H
Hk,n,e f f Vk,n = Uk,n Vk,n Vk,n , (47)
0 0
(0) (0)
where Hk,n,e f f Vk,n gives the matrix with columns as the linear combinations of the columns of Vk,n .
(0)
The right hand side of the equation is the SVD of Hk,n,e f f Vk,n , where Σk,n is rk,n × rk,n diagonal
(1) (0)
matrix and represents the rk,n singular vectors with nonzero singular values of Hk,n,e f f Vk,n .
Vk,n
The Equation (47) can also be written as,
(0) (1) H
Hk,n,e f f Vk,n = Uk,n Σk,n Vk,n
(1) H (0) H
Hk,n,e f f = Uk,n Σk,n Vk,n Vk,n (48)
The transmit beamforming matrix that maximizes the user k throughput without any inter-user
interference is obtained as,
(0) (1)
DB
Fk,n = Vk,n Vk,n (49)
FnDB = [F1,n
DB
, ..., FKDB
n ,n
]P1/2
n , (50)
HB = I,
B
where Fk,n Fk,n 1 ≤ k ≤ Kn and Pn is a block diagonal matrix whose elements scale the power
allocated to each interference-free virtual subchannel for all UEs. The receive combining matrix for this
user is Uk,n [37].
In the case of single antenna users, complete diagonalization is achieved entirely at the BS by
channel inversion, i.e., FnDB = (Hn,e
H
f f ) , where ( Hn,e f f ) is the pseudo-inverse of Hn,e f f [38].
† H † H
†
FnDB = β n Hn,eH
ff
†
[f1,n
DB DB
, ..., fK,n ] = β n [h1,n , ..., hK,n ] H F AB (51)
where β n is a normalization factor chosen to satisfy the power constraint and is given by
K
β2n = (52)
F AB FnDB 2F
175
Electronics 2019, 8, 1061
PT,n 2 H AB DB DB H AB H h
K g β | h gk ,n F g F g,n F g,n F g gk ,n |
2
SI NR gk ,n = (54)
|h gHk ,n |2 ∑k =k |F gAB f DB
gk ,n | + ∑ g = g ∑ j | F g f g ,n |
2 AB DB 2 + σ2
j
G Nf Kg
Û = ∑ ∑ ∑ Ũgk ,n . (55)
g =1 n =1 k =1
4. Suboptimal Solution
Joint optimization of analog and digital beamformers is challenging because they use different
channel information for the design of analog and digital beamformers. Hybrid beamforming methods
consider decoupled designs of analog and digital beamforming to reduce the complexity of joint
optimization, but the main challenge remains the use of different channel information. To approximate
the optimal solution to this mixed integer programming problem, we summarize our proposed
algorithm below:
The analog precoder is formed by selecting K g columns of DFT matrix of eigenvectors of channel
covariance matrix R g of users’ group g in (41) to minimize the inter-group interference I g ,
where I g = ∑iG=1 ∑G
j=1 | Hi F j | . To solve the MIP problem, we divide the solution into two parts.
H AB 2
j =i
In the first part, we get the analog precoder using the selected columns of the DFT matrix which
maximize the PF sum-rate. The inherent benefit of the DFT matrix is its constant modulus which enables
the use of analog phase shifters and RF switches to realize the analog beamforming. In the second part,
for the given analog precoder, intra-group users scheduling is performed and a ZF digital precoder is
designed to maximize the sum-rate utility function. The decoupling of the analog and digital precoders
design makes the solution suboptimal but tractable [34]. The joint hybrid beamforming and user
scheduling Algorithm 1 takes K g , N f , NRF , Nt , and K. It generates the analog beamforming matrix F AB ,
digital beamforming matrix F DB , and the set of users in each group. The first part of the algorithm
(line 9 to line 19) forms DFT/eigenmodes-based analog beamforming using limited statistical CSI
feedback from the users. Beam and users pairing within each group is taken place in this part of the
algorithm. The while loop at line 12 executes till all the binary combinations in Nt × Nt are exhausted
with the condition that each column contain exactly one binary 1 and total number of 1s are equal to
the number of streams (or number of RF chains). The second part (starts from line 39) assigns the radio
resources to users to maximize the utility function.
176
Electronics 2019, 8, 1061
14: if U (Υ g ) ≥ U (Υlast
g ) then
15: Υlast
g ← Υg
16: end if
17: end while
18: end while
19: Get F gAB = Δ Nt Υlast
g
22: while k ≤ K g do
28: Kn = Kn ∪ {k ∗ }
29: K g = K g − {k ∗ }
30: U (FnDB )last = U (FnDB )updated
31: k++
32: else
33: break
34: end if
35: end while
36: b++
37: end while
38: n++
39: end while
40: Stack the beamforming matrices F DB = [F1DB , ..., F DB
N ] f
41: Output
42: F AB = [F1AB , ..., FG
AB ]
177
Electronics 2019, 8, 1061
No
while n≤Nf
Initialization
Yes
Rg at users No
while b≤NRF,g
FAB=Bk,n Yes
No
while k≤Kg
No
while g≤G
Yes
Yes
No
U(Υg) If U(FnDB)update
≥U(FnDB)last
No Yes
If U(Υg)≥ U(Υg)last
Кn=КnU{k*’}
Кg=Кg-{k*’}
Yes U(FnDB)last=U(FnDB)update
k++
U(Υg)last ← U(Υg)
b++
FgAB=ΔNtΥg(Last) n++
FDB={F1DB,…,FNfDB}
Output:
FAB={F1AB,…,FGAB}
DB
F ={F1DB,…,FNfDB}
К={К1,…,КNf}
5. Machine Learning: K-Means Based Optimal Users Grouping for Analog Beamforming
In this section, we use machine learning technique to group the users. Then, the DFT based fixed
switched-beams are used to realize the analog beamforming matrix. The joint users scheduling and
hybrid beamforming architecture with ML-based users grouping is shown in Figure 6.
178
Electronics 2019, 8, 1061
User 1
FDB NRF RF FAB Nt
Ns NRF
ZF Chains DFT
Kg Selected
users beams Υg
User K
K-Mean ML
Intra-group G groups of Kg grouping
Users scheduling users per group Location coordinates
Base station
Figure 6. Joint users scheduling and hybrid beamforming architecture with ML-based users grouping.
Machine learning algorithms can broadly be divided into two main categories, namely supervised
learning and unsupervised learning algorithms. The former class of algorithms learn by training on
the input labeled examples, called training dataset, {( x (1) , y(1) ), ( x (2) , y(2) ), ( x (3) , y(3) ), ..., ( x (m) , y(m) )},
where the ith example ( x (i) , y(i) ) consists of the ith instance of feature vector x (i) and the corresponding
label y(i) . Given a labeled training dataset, these algorithms try to find the decision boundary that
separates the positive and negative labeled examples by fitting a hypothesis to the input dataset.
Unsupervised machine learning algorithms, on the other hand, are given an unlabeled input dataset.
These algorithms are used for extracting information or features from the dataset. These features might
be related, but not confined, to the underlying structures or patterns in the input data, relationships in
data items, grouping/clustering of data items, etc. Discovered features are meant to provide a deeper
insight into the input dataset that can subsequently be exploited for achieving specific goals. Clustering
algorithms make an important part of unsupervised learning where the input examples are grouped
into two or more separate clusters based on some features. The K-Means (KM) algorithm, is probably
the most popular clustering algorithm. It is an iterative algorithm that starts with a set of initial
centroids given to it as input. During each iteration, it performs the following two steps.
1. Assign Cluster: For every user, the algorithm computes the distance between the user and every
centroid. The user is then associated to the cluster with the closest centroid. During this step,
a user might change its association from one cluster to another one.
2. Recompute centroids: Once all users have been associated to their respective cluster,
the new position of centroid for every cluster is then calculated.
Figure 7a depicts how the cluster centroids keep moving across iterations until the system
stabilizes for an example network consisting of thirty users being grouped in five clusters. The system
becomes stable in only five iterations and the final cluster layout is shown in Figure 7b.
Let us define the following notations to be used later in this section.
179
Electronics 2019, 8, 1061
It may be pointed out that Equation (57) allows us to compare multiple clustering layouts based
on their cost and select the one with the lowest cost. The above optimization objective function
constitutes a non-convex and NP-hard problem because it has many possible local minima and
integer optimization variable c(i) . The KM algorithm heuristically optimize this function by alternate
minimization method. It iterates between two steps (Assign cluster and Recompute centroids) as
described above.
(a) (b)
Figure 7. Change in position of centroids as K-Means clustering algorithm progresses. (a) shows
the transition of cluster centroids (shown as crosses) up to iteration 5, whereas, (b) shows only
the final stable state after iteration 5. In the figures, cross-sign represents cluster centroids and the
colored-circle-sign represents the user associated with the same group or cluster.
In this section, we use the KM algorithm for optimal clustering of m users competing for resources
in a particular cell. The clustering is performed based on their geographic location, thus our input
dataset {u(1) , u(2) , u(3) , ..., u(m) } has m vectors u(i) , 1 ≤ i ≤ m, consisting of location coordinates,
of ith user. For the sake of simplicity, we assume these users are deployed in a two dimensional
(i ) (i )
area, i.e., a plane and so u(i) = ( x1 , x2 ), i.e., an ordered pair of location coordinates. Our clustering
algorithm is summarized in Algorithm 2.
The proposed algorithm takes the location coordinates of m users as input. It also takes
two numbers mink and maxk as additional input. The algorithm outputs the best number of clusters,
k, such that mink ≤ k ≤ maxk , and corresponding members of each cluster. It starts with k = mink
and randomly selects k user locations as the initial centroids (line 6). It assigns the closest centroid
to each user (line 8) and then computes new centroids by calculating the center/average location of
all nodes in each cluster (line 11). So, in effect, the location of centroids keeps moving in successive
iterations. It repeats the above two steps until the change in centroid positions is zero or negligible. We
repeat the test maxt times with a new set of randomly chosen initial centroids every time. During every
test, the discovered centroids, corresponding centroid assignment to users, and the cost are saved
(lines 14–16) for later comparison. After running the loop for maxt times, we select and store the best
k centroids resulting from the test with the lowest cost while discarding the remaining (lines 19–21).
180
Electronics 2019, 8, 1061
The same is repeated for the next value of k, i.e., k = k + 1, until k > maxk . At the end we have
cnt = maxk − maxk vectors μk , one for each value of k, the corresponding assignment vector ak and
cost ck . Finally, we choose the vector μ having the lowest cost and corresponding assignment vector
a among cnt stored cases. That is the best number of clusters and corresponding centroids that the
algorithm found.
3: cnt = cnt + 1
4: for t = 1 : maxt do
5: repeat
After the groups formation, BS sends this information to all users, where users use this information
to form reduced average statistical CSI. For example, a user in a group of 5, needs to send the average
statistical CSI only after 1/5 of regular feedback interval time.
6. Simulation Results
Consider the downlink of a multiuser massive MIMO single cell with three 120 degree sectors.
We neglect inter-sector interference and focus on a single 120 degree sector served by a ULA of Nt = 64
isotropic antennas at BS. The users grouping forms virtual sectors inside 120 physical sector.
181
Electronics 2019, 8, 1061
In simulation, the results are obtained by averaging over 100 drops. In each drop we randomly
generate spatial correlation matrices R g . For each realization of spatial correlation matrix R g ,
we simulate 1000 realizations of instantaneous channel H.
The joint spatio–radio scheduling and hybrid precoder scheme first forms the users groups and
then selects the beams that maximizes the sum-rate through downlink training process. Secondly,
it calculates the ZF based digital precoder using low dimensional effective channel feedback from
the users.
Figure 8 shows the CDF of the non-zero eigenvalues of channel covariance matrix. Notice that
approximately 50% of the non-zero eigenvalues are close to zero. The sum-rate increases as the number
of groups increases at the cost of increased feedback overhead as shown in Figure 9. Using machine
learning technique in Section 5 we can get optimal number of groups from channel covariance feedback.
This results in increased sum-rate with substantial reduced feedback. The optimal G = 3 gives 27.6%
increase in sum-rate compared to when G = 1 and 62.5% decrease in feedback overhead compared
to G = 8. The comparison of performance of ML-based users grouping with previous work cannot
be provided because there is no previous work that uses ML-based technique to reduce the CSI
feedback overhead in massive MIMO systems. Many papers use users grouping in massive MIMO
hybrid beamforming [3,5,39,40], but they do not utilize ML-based users grouping. Therefore, we have
compared our proposed solution with two benchmarks of full-CSI (G = K) and coarse-CSI (G = 1).
Figure 10 shows sum-rate with number of users at 10dB SNR. For a fixed number of groups G = 3,
the increase in number of users, increases number of users per group. Due to the fixed number of
groups, the feedback overhead remains the constant. Sum-rate is increasing with users because we
assumed NRF = Ns = K. If we fix the number of RF chains to some hardware limit, then the sum-rate
will saturate at specific number of users. It can be seen in Figure 10, that increasing number of users
per group decreases the slop of the sum-rate for limited CSI schemes. This decrease is due the increase
in intra-group interference.
Sum-rate also depends on the number of RF chains but this dependence is not linear as shown in
Figure 11. This figure shows sum-rate variation with number of RF chains NRF when Ns = 8, K = 8,
Nt = 64, and SNR = 10 dB. Sum rate increases with number of RF chains because it yields better
conditioned effective channel matrix. It can be seen that the spectral efficiency does not increase
monotonically with NRF and saturates at NRF = Nt where hybrid precoding is turned to the pure
digital precoding. The increase in spectral efficiency with the number of RF chains comes at the cost of
higher dimensional effective channel feedback overhead and power consumption in RF chains.
The spectral efficiency of the proposed scheme also varies with number of transmit antennas as
shown in Figure 12. In the figure, NRF = Ns = K = 8, SNR = 10 dB, and BS has 16, 64, 128 or 256 ULA
antennas. The performance gain increases with the increase in number of transmit antennas because
large antennas array increases the resolution of the transmit beams (also depicts in figure 4) and, hence,
decreases the potential of inter-beams interference.
In general, the spectral efficiency is a function of SNR and for the SNR = 10 dB, our ML-based
users grouping and hybrid beamforming scheme gives 27.6% increased sum-rate at the cost of 33.3%
extra feedback overhead as compared to the coarse-CSI case (G = 1). Our proposed scheme incurs 62.5%
reduced feedback at the cost of 25.2% reduction in sum-rate as compared to the full-CSI case (G = K).
182
Electronics 2019, 8, 1061
(PSLULFDO&')
&')RI(LJHQYDOXHV
(LJHQYDOXHV
*
* .
RSWLPDO*
6XPUDWH ESV+]
615 G%
2SWLPDOJURXSLQJ/LPLWHG&6,
1RJURXSLQJ)XOO&6,
2QHJURXS/LPLWHG&6,
6XPUDWH ESV+]
8VHUV
Figure 10. Sum-rate Vs number of users with different number of groups and CSI, SNR = 10 dB.
183
Electronics 2019, 8, 1061
*
* .
RSWLPDO*
6XPUDWH ESV+]
1XPEHURI5)FKDLQV15)
Figure 11. Sum-rate Vs number of RF chains. Number of users K = 8, number of transmit antennas
Nt = 64, and SNR = 10 dB.
* ZLWK615 G%
* .ZLWK615 G%
6XPUDWH ESV+]
RSWLPDO*ZLWK615 G%
* .ZLWK615 G%
RSWLPDO*ZLWK615 G%
* ZLWK615 G%
1XPEHURIWUDQVPLWDQWHQQDV
Figure 12. Sum-rate Vs number of transmit antennas. The number of RF chains at BS is fixed at 8,
number of users is 8 and the results are obtained for SNR = 10 dB and SNR = 20 dB.
7. Conclusions
This paper studied the limited feedback two-stage hybrid beamformimg for decomposing the
precoding matrix at the base-station. The huge channel state information feedback of massive MIMO
has been reduced by the channel covariance-based RF precoding and beam selection. The well-known
regularized block diagonalization can mitigate the inter-group interference, but requires substantial
feedback. We used K-mean algorithm based unsupervised machine learning technique for users
grouping and channel covariance-based eigenmodes/discrete Fourier transforms to reduce the
feedback overhead and designed a simplified analog precoder. The digital precoder is designed with
joint optimization of intra-group user utility function. It has been shown that more than 50% feedback
overhead is reduced by the eigenmodes-based analog precoder design. The spatio–radio resources
scheduling and limited feedback-based hybrid precoding increases the sum-rate by 27.6% compared to
the sum-rate of one-group case at the cost of 33.3% extra feedback overhead, and reduces the feedback
overhead by 62.5% at the cost of 25.2% reduction in sum-rate, compared to the full CSI feedback.
Author Contributions: H.K. and I.A., contributed the key idea and defined the main system model. G.B. and
M.A. assisted with the system model and the mathematical analysis. I.A. developed the problem formulation.
184
Electronics 2019, 8, 1061
G.B. and H.K. developed the model for users grouping. H.K., I.A., G.B. and M.A. analysed all the results and
added the relevant discussions. All authors contributed to the paper write up.
Funding: This research was supported by King Abdul Aziz City for Science and Technology Project under Grant
PC-37-66.
Conflicts of Interest: The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
References
1. Adhikary, A.; Nam, J.; Ahn, J.Y.; Caire, G. Joint spatial division and multiplexing-The large-scale array regime.
IEEE Trans. Inf. Theory 2013, 59, 6441–6463. [CrossRef]
2. Kutty, S.; Member, S.; Sen, D. Beamforming for millimeter wave communications: An inclusive survey. IEEE
Commun. Surv. Tutor. 2016, 18, 949–973. [CrossRef]
3. Nam, J.; Adhikary, A.; Ahn, J.Y.; Caire, G. Joint Spatial Division and Multiplexing: Opportunistic
Beamforming, User Grouping and Simplified Downlink Scheduling. IEEE J. Sel. Top. Signal Process. 2014,
8, 876–890. [CrossRef]
4. Adhikary, A.; Al Safadi, E.; Samimi, M.; Wang, R.; Caire, G.; Rappaport, T.; Molisch, A. Joint Spatial Division
and Multiplexing for Wave Channels. IEEE J. Sel. Areas Commun. 2014. [CrossRef]
5. Li, Z.; Han, S.; Sangodoyin, S.; Wang, R.; Molisch, A.F. Joint Optimization of Hybrid Beamforming for
Multi-User Massive MIMO Downlink. IEEE Trans. Wirel. Commun. 2018, 1276, 1–14. [CrossRef]
185
Electronics 2019, 8, 1061
6. Li, Z.; Han, S.; Molisch, A.F. Optimizing Channel-Statistics-Based Analog Beamforming for Millimeter-Wave
Multi-User Massive MIMO Downlink. IEEE Trans. Wirel. Commun. 2017, 16, 4288–4303. [CrossRef]
7. Ratnam, V.; Molisch, A.; Bursalioglu, O.Y.; Papadopoulos, H.C. Hybrid Beamforming with Selection for
Multi-user Massive MIMO Systems. IEEE Trans. Signal Process. 2018, 66, 4105–4121. [CrossRef]
8. Liu, A.; Lau, V. Phase only RF precoding for massive MIMO systems with limited RF chains. IEEE Trans.
Signal Process. 2014, 62, 4505–4515. [CrossRef]
9. Li, Z.; Han, S.; Molisch, A.F. User-Centric Virtual Sectorization for Millimeter-Wave Massive MIMO
Downlink. IEEE Trans. Wirel. Commun. 2018, 17, 445–460. [CrossRef]
10. Ahmed, I.; Khammari, H.; Shahid, A.; Musa, A.; Kim, K.S.; Poorter, E.D.; Moerman, I. A Survey on Hybrid
Beamforming Techniques in 5G: Architecture and System Model Perspectives. IEEE Commun. Surv. Tutor.
2018, 20, 3060–3097. [CrossRef]
11. Park, S.; Alkhateeb, A.; Heath, R.W. Dynamic Subarrays for Hybrid Precoding in Wideband mmWave
MIMO Systems. IEEE Trans. Wirel. Commun. 2017, 16, 2907–2920. [CrossRef]
12. Zhu, D.; Li, B.; Liang, P. A Novel Hybrid Beamforming Algorithm with Unified Analog Beamforming by
Subspace Construction Based on Partial CSI for Massive MIMO-OFDM Systems. IEEE Trans. Commun. 2017,
65, 594–607. [CrossRef]
13. Jeon, Y.; Song, C.; Lee, S.R.; Maeng, S.; Jung, J.; Lee, I. New Beamforming Designs for Joint Spatial Division
and Multiplexing in Large-Scale MISO Multi-User Systems. IEEE Trans. Wirel. Commun. 2017, 16, 3029–3041.
[CrossRef]
14. Jiang, Z.; Chen, S.; Zhou, S. Joint User Scheduling and Beam Selection Optimization for Beam-Based Massive
MIMO Downlinks. IEEE Trans. Wirel. Commun. 2018, 17, 2190–2204. [CrossRef]
15. Ahmed, I.; Khammari, H.; Shahid, A. Resource allocation for transmit hybrid beamforming in decoupled
millimeter wave multiuser-MIMO downlink. IEEE Access 2017, 5, 170–182. [CrossRef]
16. O’Shea, T.J.; Hoydis, J. An Introduction to Deep Learning for the Physical Layer. IEEE Trans. Cogn. Commun.
Netw. 2017, 3, 563–575. [CrossRef]
17. Kim, K.; Lee, J.; Choi, J. Deep Learning Based Pilot Allocation Scheme (DL-PAS) for 5G Massive
MIMO System. IEEE Commun. Lett. 2018, 22, 828–831. [CrossRef]
18. Wen, C.K.; Shih, W.T.; Jin, S. Deep Learning for Massive MIMO CSI Feedback. IEEE Wirel. Commun. Lett.
2018, 1–4. [CrossRef]
19. Joung, J. Machine Learning-Based Antenna Selection in Wireless Communications. IEEE Commun. Lett.
2016, 20, 2241–2244. [CrossRef]
20. Wang, J.B.; Wang, J.; Wu, Y.; Wang, J.Y.; Zhu, H.; Lin, M.; Wang, J. A Machine Learning Framework for
Resource Allocation Assisted by Cloud Computing. IEEE Netw. 2018, 32, 144–151. [CrossRef]
21. Calabrese, F.D.; Wang, L.; Ghadimi, E.; Peters, G.; Soldati, P. Learning Radio Resource Management in 5G
Networks: Framework, Opportunities and Challenges. arXiv 2016, arXiv:1611.10253.
22. Ahmed, I.; Khammari, H. Joint Machine Learning based Resource Allocation and Hybrid Beamforming
Design for Massive MIMO Systems. In Proceedings of the IEEE Globecom Workshops (GC Wkshps),
Abu Dhabi, UAE, 9–13 December 2018; pp. 1–6.
23. Rahman, M.S.; Josiam, K. Low complexity RF beam search algorithms for millimeter-wave systems.
In Proceedings of the 2014 IEEE Global Communications Conference (GLOBECOM 2014), Austin, TX, USA,
8–12 December 2014; pp. 3815–3820. [CrossRef]
24. Almers, P.; Bonek, E.; Burr, A.; Czink, N.; Debbah, M.; Degli-Esposti, V.; Hofstetter, H.; Kyölsti, P.; Laurenson,
D.; Matz, G.; et al. Survey of channel and radio propagation models for wireless MIMO systems. Eurasip J.
Wirel. Commun. Netw. 2007, 2007, 019070. [CrossRef]
25. Andrews, J.G.; Bai, T.; Kulkarni, M.; Alkhateeb, A.; Gupta, A.; Heath, R.W. Modeling and Analyzing
Millimeter Wave Cellular Systems. IEEE Trans. Commun. 2017, 65, 403–430. [CrossRef]
26. Ayach, O.E.; Rajagopal, S.; Abu-Surra, S.; Pi, Z.; Heath, R.W. Spatially sparse precoding in millimeter wave
MIMO systems. IEEE Trans. Wirel. Commun. 2014, 13, 1499–1513. [CrossRef]
27. Sohrabi, F.; Yu, W. Hybrid digital and analog beamforming design for large-scale MIMO systems.
In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Brisbane, Australia, 19–24 April 2015; pp. 2929–2933. [CrossRef]
186
Electronics 2019, 8, 1061
28. Ayach, O.E.; Heath, R.W.; Abu-Surra, S.; Rajagopal, S.; Pi, Z. Low complexity precoding for large millimeter
wave MIMO systems. In Proceedings of the IEEE International Conference on Communications (ICC),
Ottawa, ON, Canada, 10–15 June 2012; pp. 3724–3729. [CrossRef]
29. Rappaport, T.S.; Gutierrez, F.; Ben-Dor, E.; Murdock, J.N.; Qiao, Y.; Tamir, J.I. Broadband millimeter-wave
propagation measurements and models using adaptive-beam antennas for outdoor Urban cellular
communications. IEEE Trans. Antennas Propag. 2013, 61, 1850–1859. [CrossRef]
30. Nam, J.; Ahn, J.Y.; Adhikary, A.; Caire, G. Joint spatial division and multiplexing: Realizing massive MIMO
gains with limited channel state information. In Proceedings of the 2012 46th Annual Conference on
Information Sciences and Systems (CISS 2012), Princeton, NJ, USA, 21–23 March 2012; Volume 1. [CrossRef]
31. Lu, C.; Wang, W.; Zhong, W.; Gao, X. User scheduling and beam allocation for massive MIMO systems with
two-stage precoding. In Proceedings of the IEEE International Symposium on Personal, Indoor and Mobile
Radio Communications (PIMRC), Valencia, Spain, 4–8 September 2016. [CrossRef]
32. Andrews, M. A survey of scheduling theory in wireless data networks. In Wireless Communications; Springer:
New York, NY, USA, 2007; pp. 1–17.
33. Escudero-Garzas, J.J.; Devillers, B.; Garcia-Armada, A. Fairness-Adaptive Goodput-Based Resource
Allocation in OFDMA Downlink with ARQ. IEEE Trans. Veh. Technol. 2014, 63, 1178–1192. [CrossRef]
34. Alkhateeb, A., Jr.; Heath , R.W. Frequency Selective Hybrid Precoding for Limited Feedback Millimeter
Wave Systems. IEEE Trans. Commun. 2016, 64, 1801–1818. [CrossRef]
35. Kang, J.; Lee, J.H.; Choi, W. Dimension-optimized Two-Stage Precoder Design for Massive MIMO Systems
with Limited Feedback. In Proceedings of the IEEE Military Communications Conference MILCOM,
Los Angeles, CA, USA, 29–31 October 2018; pp. 530–535. [CrossRef]
36. Tataria, H.; Smith, P.J.; Greenstein, L.J.; Dmochowski, P.A.; Shafi, M. Performance and analysis of downlink
multiuser MIMO systems with regularized zero-forcing precoding in Ricean fading channels. In Proceedings
of the IEEE International Conference on Communications, Kuala Lumpur, Malaysia, 22–27 May 2016;
pp. 1–7.
37. Ho, W.; Liang, Y. Optimal Resource Allocation for Multiuser MIMO-OFDM Systems With User Rate
Constraints. IEEE Trans. Veh. Technol. 2009, 58, 1190–1203. [CrossRef]
38. Spencer, Q.; Swindlehurst, A.; Haardt, M. Zero-forcing methods for downlink spatial multiplexing in
multiuser MIMO channels. IEEE Trans. Signal Process. 2004, 52, 461–471. [CrossRef]
39. Li, Z.; Han, S.; Molisch, A.F. Hybrid beamforming design for millimeter-wave multi-user massive MIMO
downlink. In Proceedings of the 2016 IEEE International Conference on Communications (ICC 2016), Kuala
Lumpur, Malaysia, 22–27 May 2016. [CrossRef]
40. Kang, J.; Lee, J.H.; Choi, W. Machine Learning-Based Dimension Optimization for Two-Stage Precoder in
Massive MIMO Systems with Limited Feedback. Appl. Sci. 2019, 9, 2894. [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
187
electronics
Article
Evaluation of Multi-Beam Massive MIMO
Considering MAC Layer Using IEEE802.11ac
and FDD-LTE
Fumiya Muramatsu 1 , Kentaro Nishimori 1, *, Ryotaro Taniguchi 1 , Takefumi Hiraguri 2
and Tsutomu Mitsui 2
1 Graduate School of Science and Technology, Niigata University, Niigata-shi, Niigata 950-2102, Japan;
[email protected] (F.M.); [email protected] (R.T.)
2 Faculty of Fundamental Engineering, Nippon Institute of Technology, 4-1 Gakuendai, Miyashiro-machi,
Minamisaitama-gun, Saitama 345-8501, Japan; [email protected] (T.H.); [email protected] (T.M.)
* Correspondence: [email protected]; Tel.: +81-25-262-7431
Keywords: massive MIMO; analog multi-beam; hybrid beam-forming; PHY layer; MAC layer
1. Introduction
Cellular network data traffic volumes have increased significantly with the advent of smart
devices. This trend will continue to grow as Internet of Things (IoT) equipment and big-data services
become more common. To meet this demand, recent research and development have focused on
achieving a 20-Gbps or more standard for future wireless communication [1,2].
Multiple-input multiple-output (MIMO) systems can be used to improve the transmission rate
within a limited frequency band [3]. To this end, multi-user MIMO (MU-MIMO) has been developed
to enable MIMO transmission to multiple users [4]. MU-MIMO technologies have been standardized
for LTE-advanced and IEEE802.11ac [5,6]. Accordingly, massive MIMO is viewed as a fifth-generation
(5G) technology, and it is expected to play an important role in achieving 5G targets [1,7].
In a MIMO system, a base station (BS) acquires channel state information (CSI) between itself and
a user terminal (UT) [8]. As CSI acquisition under massive MIMO involves numerous UT–BS antenna
pair channels, one of the most important challenges in implementing it is the acquisition of large
amounts of CSI with a small overhead. To achieve this, it is necessary to evaluate not only the physical
(PHY) layer but also the medium access control (MAC) layer. The MAC layer is a communication
protocol of a part of the data link layer corresponding to the second layer of the OSI reference model in
IEEE802, and it is located in a layer that is one layer above the physical layer.
Implicit beamforming (BF) has been proposed as an approach for eliminating CSI feedback in
MU-MIMO systems [9]. However, the communication efficiency of short-packet communication
decreases even if implicit BF is applied in a massive MIMO system [10].
CSI estimation and initial UT tracking, which require a significant amount of overhead,
are necessary in hybrid BF massive MIMO systems [1]. A hybrid BF approach referred to as multi-beam
massive MIMO transmission has been proposed to solve this problem. CSI estimation is unnecessary
in this approach. The effectiveness of multi-beam massive MIMO has been demonstrated through
computer simulation [11–13]. In multi-beam massive MIMO systems, a number of analog multi-beams
are created for initial UT tracking and a subset of these beams with high received power are selected
for use. As multi-beams have narrow beam widths, interference signals can be mitigated and residual
interference can be cancelled by applying a blind array algorithm to digital beam components using
only the information pertaining to a received signal [14–16]. This paper proposes a multi-beam
massive MIMO approach that achieves high transmission efficiency and flexible communication using
asynchronous UT transmission. As the proposed system does not require timing synchronization
among UTs, all UTs can freely transmit signals while avoiding collision between their signals.
The diminished signal transmission time resulting from this configuration reduces the total power
consumption by the UTs.
In our previous studies, we proposed multi-beam massive MIMO beam-selection methods [13]
that utilized received-signal information such as the power difference and amplitude correlation
between beams. This approach was capable of appropriately performing beam-selection at a high
signal-to-interference noise ratio (SINR). Furthermore, this method could be performed through
signal processing using a simple configuration and was highly suitable for hybrid analog-digital
massive MIMO.
Channel quality indicator (CQI) values, which can serve as an index of modulation scheme
determination, cannot be obtained under multi-beam massive MIMO because it does not perform
CSI estimation. In this paper, we propose a simple adaptive modulation scheme for multi-beam
massive MIMO transmission based on amplitude correlation and received power. Under this method,
an appropriate modulation scheme can be simply determined based on the relationship between
amplitude correlation and the SINR. We assessed the proposed method by evaluating the transmission
rates it achieved using the IEEE802.11ac and frequency division duplex (FDD)-LTE standards.
In addition, we evaluated the throughput of the method under the MAC layer of each standard
to consider overhead.
The rest of this paper is organized as follows: The general massive MIMO transmission approach
is described in Section 2. In Section 3, we describe the use of amplitude correlation for modulation and
propose a simple adaptive modulation scheme for multi-beam massive MIMO. In Section 4, we present
our simulation model and describe its application in evaluating simple adaptive modulation schemes.
Then, we describe the results of the performance evaluation of the proposed multi-beam massive MIMO
method through the computer simulation of the IEEE802.11ac and FDD-LTE standard environments.
190
Electronics 2019, 8, 225
Multi-beam massive MIMO eliminates the requirement for CSI estimation by forming
numerous narrow directional beams and spatially separating the signals of each UT. As a result,
the synchronization between a BS and a UT, which is required for CSI estimation, becomes unnecessary.
Under standard MU-MIMO, UTs must transmit signals to a BS simultaneously to employ CSI feedback
prior to communication. As multi-beam massive MIMO does not require CSI feedback, a UT can
communicate with a BS at any time.
Figure 2 shows the configuration of analog multi-beam massive MIMO under uplink
communication, along with its 16 multi-beam pattern. The proposed system generates N orthogonal
multiple beams in the analog component and uses a butler matrix circuit to achieve multi-beam
formation [23,24]. The butler matrix circuit, which was introduced in [24], forms an orthogonal beam
using a Fast Fourier Transform (FFT) on a spatially arranged array. This system is also effective from
the viewpoint of the pilot contamination problem in the multi-cell scenario because of the signals by
multiple UTs can be received without the CSI estimation by the multi-beam circuit.
Figure 1. Uplink configurations of typical analog–digital hybrid (left) and full-digital (right)
beamforming (BF) massive multiple-input multiple-output (MIMO) systems.
Figure 2. Uplink configuration of multi-beam massive MIMO in uplink and 16 multi-beam pattern.
191
Electronics 2019, 8, 225
The proposed method performs beam selection using an amplitude detector and a processor.
Through beam selection, it is possible to spatially separate a specific UT using beam directivity. However,
interference signals from the other UTs, which are also received by side lobes, cannot be rejected
through only the multi-beam forming network. A method that uses only amplitude information, such as
amplitude correlation, must be used to achieve beam selection with low interference power.
The system developed in this study applies robust independent component analysis (ICA) using
a blind adaptive algorithm [16,25]. ICA is a blind signal processing technique that is commonly used
in image processing and medicine. In this technique, an observed random vector is decomposed into
statistically independent variables [16]. ICA does not require CSI because it utilizes only received
signals. The use of ICA allows for a hybrid configuration for efficient transmission in massive MIMO
systems in which multi-beams are applied to analog components and a blind algorithm is applied to
digital components.
N
∑ ( xi − x )(yi − y)
i =1
ρ( x, y) = 7 7 , (1)
N N
∑ ( x i − x )2 ∑ ( y i − y )2
i =1 i =1
192
Electronics 2019, 8, 225
Amplitude correlation
0.8
0.6
0.4
0.2
0
-2 0 2 4 6 8 10 12
SINR [dB]
Figure 3. Relationship between amplitude correlation and signal-to-interference noise ratio (SINR).
4. Computer Simulation
The results of the computer simulation of the proposed system were verified in terms of the
Rayleigh fading and angular spread of quadrature amplitude modulation (QAM) signals [26,27].
The simulation conditions are listed in Table 1. Figure 4 shows the simulation model, which is a
scattering ring model [27] with a specific angular spread. The model applies 101 paths per UT with an
angular spread of 1.0 degree. We assume flat fading in narrowband signals and do not consider delay
spread. Sixty-four elements are arranged in the horizontal direction at intervals of 0.5 wavelengths,
and beam width is approximately 1.60 degrees. As the assumed transmission method involves a
single carrier for narrowband signals, delay waves have no influence on signal separation. The signals
for each sub-carrier are regarded as narrowband signals to apply them as actual broadband signals
through orthogonal frequency division multiplexing. In the UT distribution, the center direction of
the BS is set as 0 degrees and individual UTs are placed at random angles within a range of −60 to
60 degrees. A constant distance is set between the BS and all UTs so that a signal-to-noise ratio (SNR)
of 20 [dB] per UT is received at the BS.
193
Electronics 2019, 8, 225
First, the processing of the uplink was simulated transmitting QPSK signals from all 20 UTs to the
BS, which recorded the amplitude correlation and received power for each beam. On the downlink,
the BS transmitted QPSK 1024-QAM signals to eight users with high reception power. Then, the UTs
received signals from the BS calculated bit error rate (BER). We assumed that a modulation scheme
yielding a BER of less than 10−2 could be applied [28,29]. We evaluated the applicable modulation
schemes in terms of the SINR and received power recorded on the uplink to simulate an adaptive
modulation scheme.
100
0.7
Amplitude correlation
80
0.6
CDF [%]
60
0.5
0.4 Median
40
16 QAM
32 QAM
Standard 20
0.3 deviation
64 QAM
128 QAM
0.2 0
0 2 4 6 8 10 20 22 24 26 28 30 32 34
Modulation order [bits/symbol] Received power [dB]
Figure 5. Relationships between applicable modulation scheme and amplitude correlation (left) and
received power (right).
194
Electronics 2019, 8, 225
4.2. IEEE802.11ac
We evaluated the indicator shown in Table 2 and the downlink effectiveness of multi-beam
massive MIMO transmission based on the IEEE802.11ac procedures [5,30]. The simulation conditions
and model were the same as those used in Section 4. Equation (2) was used as an index of evaluation
based on the BER calculated on the UTs.
M (1 − BER) (BER ≤ 10−2 )
R= [bit/symbol], (2)
0 (otherwise)
where M denotes the number of bits per symbol and R denotes the number of bits per symbol obtained
independently of the coding rate.
The assessment was divided into parts, i.e., the evaluation of the PHY layer without considering
the overhead arising from the control signal and the evaluation of the MAC layer considering
overhead. Table 3 shows the relationship between the MCS index and the transmission rate under the
IEEE802.11ac 20-[MHz] operation [5]. The values of R are obtained by multiplying the modulation
order by the coding rate, and they are equivalent to R, which is the number of bits per symbol.
Accordingly, it was assumed that an MCS index that satisfies R ≥ R could be applied to the R
obtained on the UTs. Based on this, we could evaluate the transmission rate corresponding to the
MCS index.
Table 3. Relationship between MCS index and transmission rate under IEEE802.11ac at 20 MHz.
In the MAC layer, throughput is calculated according to the downlink MU-MIMO procedure
shown in Figure 6 [5]. As seen in the figure, the BS first transmits request signals to all UTs to obtain
channel information. Once the BS has obtained channel feedback from each UT data transmission
signal, data reception processing is performed for all UTs. Throughput is obtained by dividing the
data size of frame aggregation, which was set as 63,000 bytes in this evaluation, by the time required
by the processes. As multi-beam massive MIMO does not require procedures such as CSI estimation
or feedback, we assumed that no time other than that necessary for data transmission was required.
195
Electronics 2019, 8, 225
Propagation channel
estimation request CSI feedback Data transmission Confirm data reception
DIFS
+ Backoff
BS
NDPA NDP BRP BRP DATA BAR BAR
UT#1 BR BA
BR BA
UT#2
⋮ ⋮ ⋮ ⋮
BR BA
UT#NU
NDPA : Null Data Packet Announcement, NDP : Null Data Packet, BR : Beamforming Report,
BRP : BR Poll, BA: Block ACK, BAR: BA Request, DIFS: DCF Inter Frame Space
We evaluated the performance of a typical hybrid BF massive MIMO and compared it with
those of the sub-array and full-digital BF configurations discussed in Section 2. Table 4 shows the
precoding and decoding methods used under each configuration in the downlink. For the hybrid
BF, we assumed that maximum ratio combining (MRC) was performed in the analog component
using a phase shifter [31]. In the digital component, the minimum mean square error (MMSE) weight
calculated in the uplink was used as a precoder [31]. Eight sub-arrays (NK = 8) and eight sub-array
elements (NL = 8) were utilized, resulting in 64 elements. In the full-digital BF, transmission precoding
was performed using a block diagonalization (BD) method with 64 elements [32].
Figure 7 shows the cumulative distribution function (CDF) characteristics of transmission rate
and throughput. The adaptive modulation results show the characteristics obtained for each trial
when the highest modulation order is selected in the modulation scheme in which the BER is 10−2
or less. The proposed modulation results are obtained using the characteristics based on Table 2.
At the assumed transmission rate, the full-digital BF configuration achieves 86.7 [Mbps], which is
the maximum transmission rate obtainable under IEEE802.11ac, because it scans a sharp beam width
using 64 elements. The median value of multi-beam massive MIMO is approximately 70 [Mbps], and it
is asymptotic to the ideal characteristic obtained by utilizing the proposed simple adaptive modulation.
The hybrid BF configuration characteristic achieves a low modulation order because beam width
is widened when there are eight elements per sub-array. The full-digital configuration can achieve
a throughput of only approximately 40 [Mbps] because of the overhead of channel estimation and
feedback obtained when there are 64 antenna elements. In hybrid BF, overhead occurs owing to eight
elements, which is the number of sub-arrays. As shown in Figure 6, throughput decreases when the
transmission rate of even one UT is slow because IEEE802.11ac requires the synchronization of all UTs,
as shown in Figure 6. This reduces the throughput characteristic of the hybrid BF. The multi-beam
massive MIMO characteristic is the best among the three configurations because it does not require
synchronization and overhead and can effectively utilize the transmission rate obtained in the
PHY layer.
196
Electronics 2019, 8, 225
100 100
Multi-beam
( adaptive modulation)
80 80
Multi-beam
CDF [%]
CDF [%]
( proposed modulation)
60 60
Hybrid BF
( adaptive modulation)
40 40
Full-digital BF
( adaptive modulation)
20 20
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Transmission rate [Mbps] Throughput [Mbps]
4.3. FDD-LTE
We evaluated the proposed method under FDD-LTE (Release 15) [6].We assumed a bandwidth
of 20 [MHz] per UT and 100 resource blocks (RBs). Table 5 shows the relationship between the MCS
index and the transmission rate in LTE when there is one antenna. The transport block size (TBS) is
based on [6], and the transmission rate is calculated based on the efficiency per bandwidth. The TBS
corresponds to an actual data component divided into RBs and transmitted, while user data indicate
the resource size for a given control signal. As in the IEEE802.11ac simulation, we compared the
values of R and R produced by the respective approaches and evaluated the results in terms of the
corresponding transmission rates.
Table 5. Relationship between MCS index and transmission rate under frequency division duplex
(FDD)-LTE (100 RBs, 20 MHz).
I MCS Modulation TBS User Data Coding Rate R [bits/symbol] Transmission Rate [Mbps]
0 QPSK 2792 30000 0.094 0.188 7.22
1 QPSK 4548 30000 0.154 0.308 9.61
2 QPSK 7224 30000 0.242 0.484 21.7
.. .. .. .. .. ..
. . . . . .
27 256QAM 84760 120000 0.707 5.656 127.0
The throughput in LTE, which is denoted here by T, can be calculated as follows: [6,33].
T = TBS × Nstr
Nsubc × Nslot × Nsym × M × Nstr × CR [bps],
where Nsubc is the number of sub-carriers, Nslot is the number of slots per second, Nsym is the number
of symbols per slot, Nstr is the number of streams, and CR is the coding rate.
We apply Equation (3) to consider control signals because Equation (3) cannot express the number
of control signals that can be eliminated under multi-beam massive MIMO.
Userdata
T = TBS × × Nsubf × Nstr [bps], (3)
Userdata
where Nsubf is the number of sub-frames per second. The value of Userdata is obtained by increasing or
decreasing the number of control signals from secured Userdata, which is defined in [6]. For example,
when I MCS = 27 in Table 1, Userdata is obtained as 120,000 [bits]. This corresponds to a Userdata of
128,000 [bits] in the multi-beam massive MIMO, which is obtained by removing the control signal from
Userdata. As there is no complete definition of a control signal in the case of 64 elements, we assumed
that one antenna port would be added to handle the additional control signal required each time the
number of antennas was doubled. Therefore, Userdata was 83,200 [bits] for 64 antennas under the
full-digital configuration.
197
Electronics 2019, 8, 225
Figure 8 shows the relationship between the transmission rate and throughput based on
Equation (3). We used this figure to evaluate throughput based on overhead for each configuration.
100
Overhead-less
Throughput [Mbps]
80 NT = 1
NT = 8
NT = 64
60
40
20
0
0 20 40 60 80 100 120
Transmission rate [Mbps]
Figure 8. Transmission rate versus throughput from Equation (3).
Figure 9 shows the CDF characteristics of the transmission rate and throughput under FDD-LTE.
The modulation scheme for each characteristic is determined in the same manner as that used in
the evaluation under IEEE802.11ac. The transmission rate follows the same trend as that under
IEEE802.11ac. However, unlike IEEE 802.11ac, a high transmission rate can be achieved even with
a low modulation order. This suggests that the performance of the hybrid BF approaches that of
multi-beam massive MIMO. An evaluation of throughput based on Figure 8 reveals that it increases
linearly with transmission rate under all configurations. Given that multi-beam massive MIMO has
the lowest number of control signals of all configurations, it achieves the highest throughput at a given
transmission rate. This verifies the effectiveness of multi-beam massive MIMO, which can effectively
utilize the transmission rate in the PHY layer.
100 100
Multi-beam
( adaptive modulation)
80 80
Multi-beam
CDF [%]
CDF [%]
( proposed modulation)
60 60
Hybrid BF
( adaptive modulation)
40 40
Full-digital BF
( adaptive modulation)
20 20
0 0
0 20 40 60 80 100 120 0 20 40 60 80 100
Transmission rate [Mbps] Throughput [Mbps]
5. Conclusions
In this paper, we proposed a simple adaptive modulation method for multi-beam massive
MIMO and evaluated its performance when implemented in a MAC layer. The effectiveness of the
proposed transmission method was validated through computer simulation. The performance of the
method asymptotically approached the ideal characteristic because of the implementation of simple
adaptive modulation using amplitude correlation and received power. The proposed method was also
shown to achieve higher throughput characteristics compared to a hybrid approach with sub-arrays
and a full-digital beam-forming configuration under the IEEE802.11ac and FDD-LTE environments.
Future works include the further theoretical analysis and experimental evaluation using the analog
multi-beam by Butler matrix circuit.
Author Contributions: This study was led by F.M. while K.N., R.T., T.H. and T.M. assisted with the
computer simulations.
198
Electronics 2019, 8, 225
Funding: This work was partially supported by the SCOPE #165004002, #185004002 and KAKENHI, Grant-in-Aid
for Scientific Research (B) (17H01738, 17H03262).
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Papadopoulos, H.; Wang, C.; Bursalioglu, O.; Hou, X.; Kishiyama, Y. Massive MIMO technologies and
challenges towards 5G. IEICE Trans. Commun. 2015, E99-B, 602–621. [CrossRef]
2. Obara, T.; Suyama, S.; Shen, J.; Okumura, Y. Joint Processing of Analog Fixed Beamforming and
CSI-Based Precoding for Super High Bit Rate Massive MIMO Transmission Using Higher Frequency Bands.
IEICE Trans. Commun. 2015, E98-B, 1474–1481. [CrossRef]
3. Foschini, G.J.; Gans, M.J. On limits of wireless communications in a fading environment when using multiple
antennas. Wirel. Pers. Commun. 1998, 6, 331–335. [CrossRef]
4. Gesbert, D.; Kountouris, M.; Heath, R.W., Jr.; Chae, C.B. ; Salzer, T. Shifting the MIMO paradigm. IEEE Signal
Process. Mag. 2007, 24, 36–46. [CrossRef]
5. IEEE P802.11ac./D5.0, Part 11: Wireless LAN Medium Access Control (MAC and Physical Layer (PHY)
Specifications; IEEE Standard for Information Technology; IEEE: New York, NY, USA, 2013.
6. 3rd Generation Partnership Project (3GPP). TS 36.213 Physical Layer Procedures; V10.2.0; 3GPP:
Sophia Antipolis, France, 2018.
7. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overview of massive MIMO: Benefits and
challenges. IEEE J Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
8. Spencer, Q.H.; Peel, C.B.; Swindlehurst, A.L.; Haardt, M. An introduction to the multi-user MIMO downlink.
IEEE Commun. Mag. 2004, 42, 60–67. [CrossRef]
9. Murakami, T.; Fukuzono, H.; Takatori, Y.; Mizoguchi, M. Multiuser MIMO with implicit channel feedback in
massive antenna systems. IEICE Commun. Express 2013, 2, 336–342. [CrossRef]
10. Hiraguri, T.; Nishimori, K. Survey of transmission methods and efficiency using MIMO technologies for
wireless LAN systems. IEICE Trans. Commun. 2015, E98-B, 1250–1267. [CrossRef]
11. Kameyama, K.; Nishimori, K.; Hiraguri, T.; Yamada, H.; Makino, H. Analog multi-beam massive MIMO
transmission eliminating CSI estimation. IEICE Trans. Commun. 2016, J99-B, 743–752. (In Japanese)
12. Taniguchi, R.; Nishimori, K.; Makino, H. Multi-beam massive MIMO using constant modulus algorithm for
QAM signals employing amplitude and phase offset compensation. IEICE Trans. Commun. 2017, E100-B,
262–268. [CrossRef]
13. Muramatsu, F.; Nishimori, K.; Taniguchi, R.; Hiraguri, T. Multi-beam massive MIMO with beam-selection
using only amplitude information in uplink channel. IEICE Trans. Commun. 2018, E101-B, 1544–1551.
[CrossRef]
14. Treichler, J.R.; Agee, B.G. A new approach to multipath correction of constant modulus signals. IEEE Trans.
Acoust. Speech Signal Process. 1983, 31, 459–472. [CrossRef]
15. Agee, B. The least-Square CMA: A new technique for rapid correction of constant modulus signals.
Proc. IEEE ICASSP 1986, 953–956.
16. Hyvarinen, A. Fast and robust fixed-point algorithms for independent component analysis. IEEE Trans.
Neural Netw. 1999, 10, 626–634. [CrossRef]
17. Sohrabi, F.; Yu, W. Hybrid digital and analog beamforming design for large-scale antenna arrays. IEEE J. Sel.
Top. Signal Process. 2016, 10, 501–513. [CrossRef]
18. Bogale, T.E.; Le, L.B. Beamforming for multiuser massive MIMO systems: Digital versus hybrid
analog-digital. In Proceedings of the 2014 IEEE Global Communications Conference, Austin, TX, USA,
8–12 December 2014; pp. 4066–4071.
19. Molisch, A.F.; Ratnam, V.; Han, S.; Li, Z.; Nguyen, L.H.; Li, L.; Haneda, K. Hybrid beamforming for massive
MIMO: A survey. IEEE Commun. Mag. 2017, 55, 134–141. [CrossRef]
20. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
21. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
199
Electronics 2019, 8, 225
22. Hoydis, J.; Brink, S.T.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
23. Butler, J.; Lowe, R. Beamforming matrix simplifies design of electronically scanned antennas. Electron. Des.
1961, 9, 170–173.
24. Yamamoto, S.; Hirokawa, J.; Ando, M. A single-layer hollow-waveguide 8-way Butler matrix with modified
phase shifters. In Proceedings of the ISAP2015, Hobart, Australia, 9–12 November 2015; pp. 937–940.
25. Zarzoso, V.; Comon, P. Robust independent component analysis by iterative maximization of the kurtosis
contrast with algebraic optimal step size. IEEE Trans. Neural Netw. 2010, 21, 248–261. [CrossRef]
26. Gesbert, D.; Bolcskei, H.; Gore, D.A.; Paulraj, A.J. Outdoor MIMO wireless channels: Models and
performance prediction. IEEE Trans. Commun. 2002, 50, 1926–1934. [CrossRef]
27. Oestges, C.; Erceg, V.; Paulraj, A.J. A physical scattering model for MIMO macrocellular broadband wireless
channels. IEEE J. Sel. Areas Commun. 2003, 21, 721–729. [CrossRef]
28. Chung, S.T.; Goldsmith, A.J. Degrees of freedom in adaptive modulation: A unified view. IEEE Trans. Commun.
2001, 49, 1561–1571. [CrossRef]
29. Nishimori, K.; Yomo, H.; Popovski, P. Distributed interference cancellation for cognitive radios using periodic
signals of the primary system. IEEE Trans. Wirel. Commun. 2011, 10, 2791–2981. [CrossRef]
30. Kikuma, N.; Nishimori, K.; Hiraguri, T. Effect of user antenna selection on block beamforming algorithms
for suppressing inter-user interference in multiuser MIMO system. IEICE Trans. Commun. 2018, E101-B,
1523–1535. [CrossRef]
31. Tanbourgi, R.; Dhillon, H.S.; Andrews, J.G.; Jondral F.K. Effect of spatial interference correlation on the
performance of maximum ratio combining. IEEE Trans. Wirel. Commun. 2014, 13, 3307–3316. [CrossRef]
32. Stankovic, V.; Haardt, M. Generalized design of multi-user MIMO precoding matrices. IEEE Trans.
Wirel. Commun. 2008, 7, 953–961. [CrossRef]
33. Rathi, S.; Malik, N.; Chahal, N.; Malik, S. Throughput for TDD and FDD 4G LTE systems. Int. J. Innov.
Technol. Explor. Eng. (IJITEE) 2014, 3, 73–77.
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
200
electronics
Article
Computationally Efficient Channel Estimation in 5G
Massive Multiple-Input Multiple-output Systems
Imran Khan 1 , Mohammad Haseeb Zafar 1 , Majid Ashraf 1 and Sunghwan Kim 2, *
1 Department of Electrical Engineering, University of Engineering & Technology, Peshawar P.O.B. 814,
Pakistan; [email protected] (I.K.); [email protected] (M.H.Z.);
[email protected] (M.A.)
2 School of Electrical Engineering, University of Ulsan, Ulsan 44610, Korea
* Correspondence: [email protected]; Tel.: +82-052-259-1401
Abstract: Traditional channel estimation algorithms such as minimum mean square error (MMSE)
are widely used in massive multiple-input multiple-output (MIMO) systems, but require a matrix
inversion operation and an enormous amount of computations, which result in high computational
complexity and make them impractical to implement. To overcome the matrix inversion problem,
we propose a computationally efficient hybrid steepest descent Gauss–Seidel (SDGS) joint detection,
which directly estimates the user’s transmitted symbol vector, and can quickly converge to obtain
an ideal estimation value with a few simple iterations. Moreover, signal detection performance was
further improved by utilizing the bit log-likelihood ratio (LLR) for soft channel decoding. Simulation
results showed that the proposed algorithm had better channel estimation performance, which
improved the signal detection by 31.68% while the complexity was reduced by 45.72%, compared
with the existing algorithms.
Keywords: 5G; massive MIMO; computational efficiency; precoding algorithms; channel estimation
1. Introduction
Multiple-input multiple-output (MIMO) technology is becoming more and more mature,
especially when combined with orthogonal frequency division multiplexing (OFDM) [1–5], which has
been successfully applied in multiple wireless communications fields such as Long-Term Evolution
(LTE) and LTE-Advanced. However, traditional MIMO technology can only achieve a 4 × 4 or
8 × 8 scale system [6], which makes it difficult to meet the explosive growth in mobile data services.
Therefore, in recent years, massive MIMO has been proposed based on traditional MIMO technology [7].
Massive MIMO systems configure up to hundreds of antenna arrays at the base station to serve multiple
single-antenna end-users simultaneously [8], which can improve spectrum utilization and power
utilization in wireless communications systems by two to three orders of magnitude [9–11]. This has
become one of the most promising enabling technologies and one of the hottest research directions in
5G [12]. The maximum likelihood (ML) algorithm is the optimal algorithm in massive MIMO detection
algorithms, but its computational complexity increases exponentially with the number of system
antennas and the modulation order of baseband signals. It is difficult for it to be fast, effective, and
realized in practical applications. Linear detection methods, such as the zero-forcing (ZF) algorithm
and minimum mean square error (MMSE) algorithm, can achieve near-optimal detection performance
in massive MIMO systems. The complexity in this kind of detection algorithm is greatly reduced,
compared with the complexity of the ML algorithm, but introduces a complex high-dimensional
matrix inversion operation, so a low-cost and efficient engineering implementation is still a problem
to be solved. Aimed at this problem, many simplified algorithms based on the MMSE detection
scheme have been proposed in recent years, and can be roughly divided into three types: The series
expansion class-approximation method [13,14], the iterative class-approximation method [15,16] and a
gradient-based search for an approximate solution [17–20]. The authors in [13] proposed a method of
using Neumann series expansion to approximate the inverse MMSE filter matrix, but when the number
of expansion stages was gradually increased (i > 2), the computational complexity was still high,
even equal to or exceeding the exact MMSE. The complexity of the filter matrix inversion algorithm
also loses a large degree of detection performance. The authors in [14] applied the Newton algorithm
derived from the first-order Taylor series expansion (similar to Neumann series expansion) to massive
MIMO signal detection, and used the iterative method to improve the estimation accuracy of the
MMSE filter matrix inversion. However, from the aspects of detection performance and computational
complexity, the algorithm based on the Newton iteration was not dominant. Different from the two
series expansion-based algorithms above, it is necessary to estimate the signal vector sent by the user
by inverting the approximate matrix. Some iterative algorithms based on solving linear equations, such
as the Richardson iterative (RI) algorithm [15] and the successive over-relaxation (SOR) algorithm [16],
use the special properties of the MMSE filter’s symmetric positive definite matrix. Through the method
of solving linear equations, they directly estimate the transmission vector, thus avoiding the inversion
of high-dimensional matrices.
The RI and SOR algorithms mentioned above have lower computational complexity at a fixed
number of iterations, but RI convergence is slower and requires a higher number of iterations to
achieve certain detection performance requirements. In SOR, although the detection performance
is close to excellent, its internal iterative structure means the algorithm cannot be implemented in
parallel in practical applications. The third type of algorithm is mainly designed and implemented
based on the idea of a matrix gradient, including the conjugate-gradient (CG) method [17] and the
steepest descent (SD) method [18]. This type of algorithm uses the matrix gradient search method and
avoids the high-dimensional matrix inversion problem. However, compared to the method of series
expansion, the CG and SD algorithms bring about great improvement in detection performance, but
calculation of the matrix gradient after each iteration also causes higher complexity.
In this paper, a low-complexity joint detection algorithm was proposed. The SD algorithm had a
good convergence direction at the beginning of the iteration, and the Gauss–Seidel (GS) algorithm with
low complexity mentioned in [19] was combined with the SD method (called SDGS), which provided
an effective search direction for GS iterations, speeding up convergence and improving the detection
performance. Furthermore, applying it to soft output detection gave an approximate calculation
method for the bit log-likelihood ratio (LLR) of the channel decoder input. A good compromise
between detection performance and computational complexity was achieved.
The rest of this paper is organized as follows: Section 2 discusses the system model and analytical
derivations, Section 3 explains signal detection, while Section 4 explains the mixed iterative algorithm
and the proposed algorithm. Section 5 provides the simulation results, while Section 6 concludes
the paper.
2. System Model
The research object considered in this paper was the uplink for a massive MIMO system consisting
of a base station equipped with N antennas and K single-antenna users where N K, as shown
in Figure 1. Let sc = [s1 , s2 , . . . , sK ]T denote the K × 1 dimensional symbol vector sent by all users
simultaneously, where sk ∈ ε was the transmitted symbol from the kth user, and ε was the modulation
symbol set.
Let Hc ∈ CN×K represent the Rayleigh fading channel matrix; then, the N × 1 dimensional signal
vector received by the base station could be recorded as:
yc = Hc sc + nc (1)
202
Electronics 2018, 7, 382
where nc represented an additive white Gaussian noise (AWGN) vector with an N × 1 dimensional
mean of 0 and a covariance matrix of σ 2 IN . Converting the complex model of Equation (1) into an
equivalent real model gave:
y = Hs + n (2)
Among these terms, s ∈ R2K , H ∈ R2N×2K , y ∈ R2N , and n ∈ R2N , which were:
(Hc ) −(Hc ) (yc )
H= , y=
(Hc ) (Hc ) (yc )
(3)
(sc ) (nc )
s= , n=
(sc ) (nc )
Among those, (·) and (·) indicated the real part and imaginary part, respectively.
8(
8(
.
.
.
8(
%6L
8(.
&HOOM
8(
.
.
.
8(
%6L
8(.
&HOOL
8(
.
.
.
%6L
8(.
&HOOO
where ŷ = H H y. The filter matrix W of the MMSE detector could be expressed as:
W = G + σ2 I2K (5)
203
Electronics 2018, 7, 382
where G = H H H was the Gram matrix. In massive MIMO systems, the computational complexity of
W −1 is O K3 , which makes the implementation of the MMSE algorithm very complex.
where Υi = μ2i /v2i represented the signal-to-interference plus noise ratio (SINR), and Ob0 and Ob1
represented the modulation symbol set with the bth bit being 0 and 1, respectively.
lim ( I − WX )n = 0 (8)
n→∞
204
Electronics 2018, 7, 382
n
when lim − D −1 E = 0, the progression of Equation (10) converges. If we only expand the first i
n→∞
term of the Neumann series, we can get:
n
Wi−1 = Σin−=10 − D −1 E D −1 (11)
when the value of i is small, the Neumann series expansion can approximate W −1 with lower
complexity. For example, when i = 2, W2−1 = D −1 − D −1 ED −1 , which is computationally complex,
and the complexity is O K2 .
where i = 1, 2, . . . represents the number of iterations of the GS algorithm. In a massive MIMO system,
as the number of base station antennas increases substantially, when it is much larger than the number
of single-antenna users ( N K ), the individual column vectors of the uplink channel matrix H are
progressively orthogonal [20], and W = G + σ2 I2K is a symmetric positive definite matrix. Similarly,
W can be decomposed into:
W = D + L + LH (13)
Among those terms, D, L, and L H , respectively, is the diagonal element matrix of W, the strict
lower triangular element matrix, and the strict upper triangular matrix. The GS algorithm can be
used to avoid inverting the high-dimensional matrix, which directly estimates the transmitted signal
vector ŝ:
ŝ(i) = ( D + L)−1 ŷ − L H ŝ(i−1) (14)
where ŝ(0) represents the initial solution and is usually set to a zero vector.
4. Proposed Algorithm
• Step 1: For the diagonal approximation’s initial value setting, Equation (4) can be converted to
W ŝ = ŷ; W is a symmetric positive definite matrix and a diagonally dominant matrix, so W −1 is
also a diagonally dominant matrix.
• Step 2: Determine the initial solution using D −1 instead of W −1 :
205
Electronics 2018, 7, 382
s(0) = D −1 ŷ (15)
Since D is a diagonal matrix, it is obvious that calculating D −1 requires only low complexity, and
the initial value, s(0) , is set to the initial value of the SD algorithm according to Equation (15).
• Step 3: The iterative results of the first two GS algorithms are represented by the SD algorithm,
and the second GS iteration result can be expressed as:
s (2) = ( D + L ) −1 ŷ − L H s(1) = ( D + L)−1 (( D + L) − W )s(1) + ŷ
(16)
= s(1) + ( D + L)−1 ŷ − Ws(1) = s(1) + ( D + L)−1 r (1)
H
r (0) r (0)
where r (1) = ŷ − W s(0) + ur (0) = ŷ − Ws(0) − uWr (0) = r (0) − up(0) ; u = H (0)
(17)
p (0) r
• Step 4: Combine single SD and GS iterations into one hybrid iteration by substituting
Equation (17) and
s(1) = s(0) + ur (0) → s(2) = s(0) + ur (0) + ( D + L)−1 r (0) − up(0) (18)
This represents the first two GS iterations as Equation (18); update the mixed iteration value
ŝ(1) = s(2) , and then perform the next GS iteration.
• Step 5: Using the (i − 1)th GS iteration using Equation (14), ideal estimated value ŝ(i) of the
transmitted signal vector s can be obtained by setting the appropriate number of iterations, i:
ŝ(i) = ( D + L)−1 ŷ − L H ŝ(i−1) (19)
Then, ŝ(i) is related to the complex domain for the next soft decision, so the hybrid iterative
algorithm can converge very quickly after a small number of iteration.
μ 6 ii
6i = U (20)
K 2
62i =
v ∑ U6 ji 6ii σ 2
+E (21)
j =i
−1 −1 −1 −1
6 ≈W
where U 6 c Gc = Dc−1 Gc , and E
6≈W
6 c Gc W
6c 6W
=U 6 c = UD
6 c−1 , so we can calculate Yi = μ v2i .
62i /6
206
Electronics 2018, 7, 382
4.3.2. GS iteration
Equation (19) can be expressed as (D + L)ŝ(i) = ŷ − LH ŝ(i−1) = c. After i iterations, the calculation
of ŝ(i) mainly comes from the following two steps: First, c is a 2K × 2K strictly lower triangular element
matrix; 2K × 2K and the 2K × 1 vector ŝ(i−1) are multiplied, and c must be multiplied 2K2 − K times.
(i)
Second, in Equation (19), the mth element, ŝm , can be expressed as:
⎧
⎨ c1
L11 , m=1
(i )
sm = cm −∑m −1 ( i ) (22)
⎩ k=1 sk Lmk
, m = 2, . . . , 2K
Lmm
where cm represents the mth element of c, and Lmk represents the mth row and kth column element of
(i)
the lower triangular matrix (D + L). When m = 1, it is obvious that ŝ1 requires 2K multiplications,
(i)
and all ŝm (m = 2, . . . , 2K) require 2K2 − K multiplications, so a total of 2K2 multiplications are
required for each iteration.
Table 1. Complexity comparison of four kinds of detection algorithms for hard decision calculation.
5. Simulation Results
We deployed Matlab (R2017a, Mathworks, Natick, MA, USA) for performing analysis and
experimentation. In order to verify the soft and hard detection performance of the SDGS algorithm
207
Electronics 2018, 7, 382
proposed in this paper, this section presents Monte Carlo simulation results based on Matlab. The main
simulation parameters configured are in Table 2.
Parameter Value
Number of transmitter antennas ( N ) 64–128
Number of receiver antennas (K ) 16
Baseband modulation mode 16 QAM
Signal-to-noise ratio (SNR) 10–12 dB
Code rate 1/2
Channel characteristics Uncorrelated Rayleigh fading
Number of iterations (i ) 1000
Figure 2 compares the bit error rate (BER) based on Neumann series (NS) expansion, the conjugate
gradient (CG) detection algorithm, the Gauss–Seidel iterative detection algorithm, the MMSE exact
inversion detection algorithm, and the proposed SDGS joint algorithm under different antenna
configurations. The decision mode is a hard decision; that is, estimated signal vector ŝ is directly judged.
The simulation results showed that the detection performance of the various algorithm increased
with the number of iterations or the number of items expanded by the Neumann series. For example,
when the number of iterations i = 2, the BER performance of the SDGS algorithm was much better
than the BER when the number of items expanded by the Neumann series was 2. By comparing the
performance in Figure 2a,b, it can be seen that with the increase in the ratio of the number of base
station antennas to the number of users ( N/K ), the BER performance of the various algorithms was
greatly improved. For example, if the BER was to reach 10−3 , the MMSE algorithm and the proposed
algorithm require an SNR of about 13 dB when the antenna configuration is 64 × 16, and only 8 dB
when the configuration is 128 × 16.
(a)
(b)
Figure 2. Hard decision bit error rate (BER) performance comparison. (a) Analysis at 64 × 16 antenna
configuration; (b) Analysis at 128 × 16 antenna configuration.
208
Electronics 2018, 7, 382
Figure 3 shows the soft decision simulation results for the two antenna configurations. With BER
based on the NS, CG, and GS iterative detection algorithms, the MMSE exact inversion detection
algorithm and the SDGS joint algorithm were compared. We set the system’s convolutional code rate
to 1/2, and the LLR calculation used the approximate calculation method described in this paper.
Simulation results showed that no matter what kind of MMSE receiver was used, the soft decision
was checked. The measured performance was much better than the hard decision. For example, when
the BER reached 10−4 , when the antenna was configured, the MMSE algorithm and the proposed
algorithm required an SNR of 10 dB for hard decisions and only 5 dB for soft decisions. In addition, for
the same number of iterations, the BER performance of the SDGS algorithm proposed in this paper was
better than the other three simplified algorithms, and after a few iterations, the detection performance
could quickly approach the detection performance of the ideal MMSE filter matrix inversion.
(a)
(b)
Figure 3. Soft decision bit error rate (BER) performance comparison. (a) Analysis at 64 × 16 antenna
configuration; (b) Analysis at 128 × 16 antenna configuration.
Figure 4 shows the hard decision BER comparison of the proposed SDGS algorithm with NS, CG,
GS and MMSE under a high fading scenario with 128 × 16 antenna configuration. As can be seen from
Figure 4 the BER of the proposed SDGS algorithm was better and followed the MMSE performance
with increasing SNR and number of iterations. Moreover, due to high fading impact on the SNR, there
was a gap between the proposed SDGS algorithm and MMSE algorithm at a high SNR level.
Figure 5 shows the BER comparison of the proposed SDGS algorithm with NS, CG, GS and MMSE
under a low fading level and 128 × 16 antenna configuration. It can be seen from Figure 5 that all
the algorithms showed lower BER and better performance as compared with the hard decision BER
performance in Figures 2 and 4. Therefore, to keep the system performance in a suitable level, the
209
Electronics 2018, 7, 382
fading level and number of iterations should be considered, which has an obvious impact on the
system’s overall performance. Furthermore, the proposed SDGS algorithm in Figure 5 had a close BER
performance with MMSE which indicated that the SDGS algorithm showed better performance in the
low fading level.
Figure 4. Hard decision bit error rate (BER) performance comparison with high fading level at 128 × 16
antenna configuration.
Figure 5. Soft decision bit error rate (BER) performance comparison with slow fading level at 128 × 16
antenna configuration.
6. Conclusions
Signal detection methods based on MMSE filtering in massive MIMO systems are widely used,
but matrix inversion with higher complexity makes it more difficult to implement them in practical
applications. Some methods of approximate inversion, such as Neumann series expansion, has reduced
the detection complexity, but due to a large degree of detection performance loss; others avoid the
complex matrix inversion and directly estimate the signal vector. Although, computational complexity
is reduced by orders of magnitude, detection performance needs to be improved. Based on the MMSE
criterion, this paper proposes a low-complexity, hybrid, iterative SDGS joint detection algorithm,
which directly estimates the user’s transmitted symbol vector and can quickly converge to obtain an
ideal estimation value with a few simple iterations. The matrix inversion operation is avoided, and
210
Electronics 2018, 7, 382
algorithm complexity is kept at O K2 . In addition, in order to make full use of soft information, the
algorithm is applied to the soft decision, and an approximate calculation method of the LLR for channel
decoding is given, which further improves the signal detection performance. Theoretical derivation
and simulation results show that the SDGS algorithm can be used as one of the most effective solutions
for signal detection in massive MIMO systems.
Author Contributions: I.K. conceived and designed the presented idea and developed the theory, performed
the simulations, and wrote the paper. M.H.Z. analyzed the research and performed experimentations, and M.A.
provided extensive technical support throughout the research. S.K. provided extensive support in the theoretical
analysis and provided funding support.
Funding: This research was supported by the Basic Science Research Program through the National Research
Foundation of Korea (NRF-2016R1D1A1B03934653).
Conflicts of Interest: The authors declare no conflicts of interest.
References
1. Chang, Y.K.; Ueng, F.B.; Jhang, Y.W. Turbo MIMO-OFDM Receiver in Time-Varying Channels. KSII Trans.
Internet Inf. Syst. 2018, 12, 3704–3724.
2. Tseng, S.M.; Chen, Y.F. Average PSNR Optimized Cross Layer User Grouping and Resource Allocation for
Uplink MU-MIMO OFDMA Video Communications. IEEE Access 2018, 6, 50559–50571. [CrossRef]
3. Ghosh, A.; Ratasuk, R.; Mondal, B.; Mangalvedhe, N.; Thomas, T. LTE-advanced: Next-generation wireless
broadband technology. IEEE Wirel. Commun. 2010, 17, 10–22. [CrossRef]
4. Khan, I.; Zafar, M.H.; Jan, M.T.; Lloret, J.; Basheri, M.; Singh, D. Spectral and Energy Efficient Low-Overhead
Uplink and Downlink Channel Estimation for 5G Massive MIMO Systems. Entropy 2018, 20, 92. [CrossRef]
5. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2012, 30, 40–60. [CrossRef]
6. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next-generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
7. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
8. Khan, I.; Singh, D. Efficient compressive sensing based sparse channel estimation for 5G massive MIMO
systems. AEU-Int. J. Electron. Commun. 2018, 89, 181–190. [CrossRef]
9. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2011, 61, 1436–1449.
10. Dai, L.; Wang, Z.; Yang, Z. Spectrally efficient time-frequency training OFDM for mobile large-scale MIMO
systems. IEEE J. Sel. Areas Commun. 2013, 31, 251–263. [CrossRef]
11. Benmimoune, M.; Driouch, E.; Ajib, W. Joint antenna selection in grouping in Massive MIMO Systems.
In Proceedings of the 10th International Symposium on Communication Systems, Networks and Digital
Signal Processing (CSNDSP), Prague, Czech Republic, 20–22 July 2016; pp. 1–6.
12. Khan, I. A Robust Signal Detection Schemefor 5G Massive Multiuser MIMO Systems. IEEE TVT. 2018, 67,
9567–9604.
13. Wu, M.; Yin, B.; Wang, G.; Dick, C.; Cavallaro, J.R.; Studer, C. Large-scale MIMO detection for 3GPP LTE:
Algorithms and FPGA implementations. IEEE J. Sel. Top. Signal Process. 2014, 85, 916–929. [CrossRef]
14. Tang, C.; Liu, C.; Yuan, L.; Xing, Z. High precision low complexity matrix inversion based on Newton
iteration for data detection in the massive MIMO. IEEE Commun. Lett. 2016, 20, 490–493. [CrossRef]
15. Gao, X.; Dai, L.; Yuen, C.; Zhang, Y. Low-complexity MMSE signal detection based on Richardson method
for large-scale MIMO systems. In Proceedings of the Vehicular Technology Conference, Vancouver, BC,
Canada, 14–17 September 2014; pp. 1–5.
16. Gao, X.; Dai, L.; Hu, Y.; Wang, Z.; Wang, Z. Matrix inversion-less signal detection using SOR method for
uplink large-scale MIMO systems. In Proceedings of the IEEE Global Communications Conference, Qingdao,
China, 8–12 December 2014; pp. 3291–3295.
211
Electronics 2018, 7, 382
17. Hu, Y.; Wang, Z.; Gaol, X.; Ning, J. Low-complexity signal detection using the CG method for uplink
large-scale MIMO systems. In Proceedings of the IEEE International Conference on Communications, Macau,
China, 19–21 November 2014; pp. 477–481.
18. Hageman, L.A. The Iterative Solution for Large Linear Systems; Academic Press: Manhattan, NY, USA, 1971;
pp. 9–22.
19. Dai, L.; Gao, X.; Su, X.; Han, S.; Chih-Lin, I.; Wang, Z. Low-complexity soft-output signal detection based on
a Gauss-Seidel method for uplink multiuser large-scale MIMO systems. IEEE Trans. Veh. Technol. 2014, 64,
4839–4845. [CrossRef]
20. Wu, M.; Yin, B.; Vosoughi, A.; Studer, C.; Cavallaro, J.R.; Dick, C. Approximate matrix inversion for
high-throughput data detection in the large-scale MIMO uplink. In Proceedings of the International
Symposium on Circuits & Systems, Beijing, China, 19–23 May 2013; pp. 12155–12158.
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
212
electronics
Article
A Novel Iterative Discrete Estimation Algorithm for
Low-Complexity Signal Detection in Uplink Massive
MIMO Systems
Hui Feng 1 , Xiaoqing Zhao 2 , Zhengquan Li 2, * and Song Xing 3
1 Department of Electronic Science and Technology, Tongji University, Shanghai 201804, China
2 Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence,
Jiangnan University, Wuxi 214122, China
3 Department of Information Systems, California State University, Los Angeles, CA 90032, USA
* Correspondence: [email protected]; Tel.: +86-0510-8591-0633
Abstract: In this paper, a novel iterative discrete estimation (IDE) algorithm, which is called the
modified IDE (MIDE), is proposed to reduce the computational complexity in MIMO detection in
uplink massive MIMO systems. MIDE is a revision of the alternating direction method of multipliers
(ADMM)-based algorithm, in which a self-updating method is designed with the damping factor
estimated and updated at each iteration based on the Euclidean distance between the iterative
solutions of the IDE-based algorithm in order to accelerate the algorithm’s convergence. Compared to
the existing ADMM-based detection algorithm, the overall computational complexity of the proposed
MIDE algorithm is reduced from O Nt3 + O Nr Nt2 to O Nt2 + O ( Nr Nt ) in terms of the number
of complex-valued multiplications, where Ntand Nr are the number of users and the number of
receiving antennas at the base station (BS), respectively. Simulation results show that the proposed
MIDE algorithm performs better in terms of the bit error rate (BER) than some recently-proposed
approximation algorithms in MIMO detection of uplink massive MIMO systems.
Keywords: massive MIMO systems; MIDE algorithm; low computational complexity; BER
1. Introduction
With the development of the mobile Internet and the Internet of Things, much high data rate
communication is required in the new generation of cellular networks like 5G [1]. By equipping
hundreds of antennas at the base station (BS) serving tens of users, the massive multiple-input
multiple-output (MIMO) is deemed one key technology for meeting the 5G requirements due to its
improvements in data throughput, link reliability, higher spectral efficiency, and better communication
quality compared with the traditional MIMO usage [2–4].
However, when applying the massive MIMO, a major computational challenge is the data
detection in uplink MIMO systems due to the large increase in the system dimensions [5].
The maximum likelihood (ML) is the optimal detection approach on data detection, but its
computational complexity grows exponentially with the number of user equipment (UE) and
the modulation order [6,7]. Some suboptimal detection alternatives are proposed to reduce the
computational complexity while obtaining a good bit error rate (BER) performance. For example,
the linear minimum mean squared error (LMMSE) algorithm is one of the widely-used suboptimal
detection algorithms with near-optimal BER performance and reduced computational complexity [8].
However, the LMMSE algorithm still involves the computation of the Gram matrix, as well as matrix
inversion, where their corresponding computational complexity is O Nr Nt2 and O Nt3 , respectively,
with Nt denoting the number of single-antenna UE and Nr denoting the number of antennas at the BS.
It is worth noting that some approaches for approximating the matrix inversion have been
proposed to reduce the computational complexity [9–19], among which, for example, the Neumann
series (NS) approximation is used to approximate the matrix inversion by a series of truncated
NS expansions [9,10]. However, only marginal reduction in the computational complexity can
be reached with the increased terms of the NS expansion. Hence, various classical iterative
algorithms have been provided to approximate the inverse matrix in LMMSE detection to achieve
low computational complexity, which include the Richardson (RI) algorithm [11], the Jacobi
algorithm [12–15], the successive over relaxation (SOR) algorithm [16], the symmetric successive
over relaxation (SSOR) algorithm [17], and the Gauss–Seidel (GS) algorithm [18,19]. The computational
complexity of the matrix inverse is reduced by these approximation-based detection algorithms from
O Nt3 to O Nt2 . It is noted that these algorithms involving the inverse-matrix approximation in
LMMSE detection (also known as the approximated LMMSE) achieve a near-LMMSE performance,
but with lower computational complexity.
It is well known that the approximated LMMSE detection algorithms achieve a substantial
performance loss when Nr > Nt in multiuser massive MIMO systems. Various algorithms have been
proposed to obtain a better BER performance than that of LMMSE detection in multiuser massive
MIMO systems, which include the non-convex and the convex optimization algorithms [20–22].
For example, the alternating minimization (AltMin) algorithm is one of the non-convex
optimization algorithms, which is applied to the data detection in a multiuser massive MIMO
system [20]. Specifically, the AltMin algorithm converts the ML detection problem into a sum of convex
functions by decomposing the received vector into multiple sub-vectors. Hence, the non-convex
problem is transformed into the convex problem in the AltMin algorithm. The AltMin algorithm
shows better BER performance than that of the LMMSE detector in overloaded network scenarios
with relatively low computational complexity. However, it shows near-LMMSE performance with
even higher computational complexity when the ratio of the number of BS antennas to the number of
single-antenna users is larger.
Similarly, some convex optimization algorithms are used to solve the non-convex optimization
problems, which include, for example, the alternating direction method of multipliers (ADMM)
detection algorithm [21,22]. In the multiuser massive MIMO system, ADMM demonstrates better BER
performance than the LMMSE detection algorithm with the relatively low computational complexity
of the iterative procedure. However, the computational complexity of the preprocessing in the ADMM
algorithm includes the calculation complexity of the Gram matrix and LDLdecomposition [23,24],
which results in very high computational complexity for massive MIMO systems.
To make a tradeoff between the performance and the computational complexity with different
antenna configurations, the iterative discrete estimation (IDE) algorithm is integrated into the ADMM
algorithm [25], which presents low computational complexity due to the avoidance of the calculation
of the Gram matrix and LDL decomposition. Motivated by the aforementioned algorithms, we propose
a modified IDE (MIDE) algorithm to present a better BER performance and lower computational
complexity than the ADMM algorithms. To summarize, the main contributions of this work are listed
as follows.
• A novel iterative data detection algorithm for uplink multiuser massive MIMO systems is
designed by exploiting the IDE-based algorithmic framework. The proposed MIDE algorithm refactors
the detection algorithm as a series of simpler subproblems with closed-form solutions.
• A heuristic damping factor is defined based on the Euclidean distance instead of a fixed factor.
Compared with the fixed damping factor, this self-updated damping factor contributes to a faster
convergence in the proposed MIDE algorithm.
• The computational complexity analysis indicates that the proposed algorithm has a lower
computational complexity than the traditional approximated detection approximation algorithms
(LMMSE, AltMin, and ADMM), under the same BER performance. Specifically, the complexity of the
novel MIDE detection algorithm is only O Nt2 + O ( Nr Nt ).
214
Electronics 2018, 8, 980
• Simulation results reveal that for the typical independent and identically distributed (i.i.d.)
frequency flat Rayleigh fading channel in massive MIMO systems, the proposed MIDE detection
algorithm performs better than the ADMM and AltMin-based detection algorithms and the LMMSE
detection algorithm in terms of BER performance with various system configurations.
The rest of the paper is organized as follows. In Section 2, we briefly introduce the system
model. Section 3 specifies the proposed low-complexity signal detection based on the IDE algorithm
and performs the computational complexity analysis of the algorithms. In Section 4, the numerical
simulation results of the BER performance are presented. Finally, Section 5 concludes the paper.
Notation: Throughout the paper, the lowercase and uppercase boldface type is used for vectors
(e.g., a) and matrices (e.g., A). The superscripts (·)−1 and (·) H denote the matrix inversion and the
conjugate transpose, respectively. The L2 norms of the vectors are represented by ·2 . and denote
the real part and the imaginary part of the complex-valued signal, respectively.
The typical uplink massive MIMO system is considered in this work, as shown in Figure 1,
where there are Nt single-transmitting antenna UE devices and Nr receiving antennas at the BS [26].
In general, Nr is larger than Nt for an uplink massive MIMO communication system [27].
0RGXODWRU
V [ \
,QIRUPDWLRQ 'HWHFWHG
ELWV ELWV
...
...
6LJQDO
63
...
'HWHFWRU
1W 1U
0RGXODWRU
V 1W [ 1W \ 1U
2. System Model
At the transmitter side, the source information s = [s1 , · · · , si , · · · s Nt ] T where each symbol s
is mapped to constellation symbols by taking symbols from a set of the constellation alphabet Ω.
The transmitted signal x = [x1 , · · · , xi , · · · x Nt ] T is constructed by the modulated symbol s, where xi
denotes a signal transmitted by the ith UE device. The vector y = [y1 , · · · , yi , · · · y Nr ] T represents the
receiving signal at the BS, where yi denotes a signal received by the ith receive antenna, and:
y = Hx + w (1)
where w is the Nr -by-one additive white Gaussian noise (AWGN) vector following CN 0, σ2 with σ2
representing the average power of the noise. In Equation (1), the matrix H denotes the Nr -by-Nt flat
fading channel gain, and H can be expressed as:
⎡ ⎤
h11 h12 · · · h1Nt
⎢ ⎥
⎢ h21 h22 · · · h2Nt ⎥
H=⎢ ⎢ .. .. .. .. ⎥
⎥ (2)
⎣ . . . . ⎦
h Nr 1 h Nr 2 · · · h Nr Nt
where the element hi,j , i = {1, 2, · · · , Nr },j = {1, 2, · · · , Nt } denotes the channel impulse response
between the receiving antenna j and the user i. It is noted that hij follows an i.i.d. Gaussian distribution
with zero mean and unit variance. In addition, the channel matrix H is assumed to be known perfectly
at the BS [28,29].
215
Electronics 2018, 8, 980
It is noted that in Equation (3), the finite-alphabet constraint x ∈ Ω Nt can be converted into the
indicator function IΩ (x̂), which is given by:
0, ifx̂ ∈ Ω Nt
IΩ (x̂) = (4)
∞, otherwise
By combining Equation (3) and (4), the signal detection problem can be converted into the
constrained optimization problem, which is given by:
where z is the least-squares solution of the least-squares and x̂ is the estimated transmitted symbol.
Since the optimization problem is defined over complex-valued variables, the Lagrangian objective
function for the optimization problem in Equation (5) can be remodeled as:
where γ > 0 is the penalty parameter and u is the dual vector. In order to solve this problem efficiently,
we decompose it into three sub-problems. First, we solve z while holding x̂ and u fixed; then, we solve
x̂ while holding z and u fixed; finally, we solve u while holding x̂ and z fixed. Specifically, the following
procedure is repeated with iterations.
zk = arg min Lγ z, x̂k−1 , uk−1 (7)
z
x̂k = arg min Lγ zk , x̂k , uk−1 (8)
x̂
uk = uk−1 + γ zk − x̂k (9)
Note that the z-minimization procedure is convex, but the x̂-update is projected onto a convex set
Ω Nt . To make the iterative procedure converge, the IDE-based algorithm is applied to remove the dual
vector u at each iterative and turn the z-update and x̂-update to reach a consensus. After manipulation,
the x̂-update involves solving a linearly-constrained minimum Euclidean-norm problem, and the
z-update in IDE is given by:
216
Electronics 2018, 8, 980
−1
zk = x̂k−1 + diag H H H H H y − Hx̂k−1 (10)
where ΠΩ (·) is the projection onto Ω Nt , i.e., the elements of zk+1 can be implemented through simple
rounding of each component to the closest element in Ω.
Additionally, to make the iteration converge, a fixed damping factor α is employed in the
IDE-based algorithm to update the iterative solution x̂, i.e.,
x̂dk+1 = 1 − αk x̂kd + αk x̂k+1 (12)
where x̂dk+1 denotes the solution after updating. By applying x̂d to the z-update, the IDE-based
algorithm for detection can be expressed as:
−1
zk = x̂kd−1 + diag H H H H H y − Hx̂dk−1 (13)
x̂k = ΠΩ zk (14)
x̂kd = 1 − αk x̂dk−1 + αk x̂k (15)
Since there is no prior information of the final result x̂d , the initial value of x̂0d can be set as a
zero vector. Hence, the expression for x̂d in Equation (16) is composed of the solution of x̂ with
different values of the damping factor. Based on the expression in Equation (16), the convergence of the
iterations can be measured by the difference between x̂kd and x̂k+1 . Specifically, the Euclidean distance,
which is one of the widely-used approaches for measuring the distance between two vectors [30],
is defined as:
8
9N
9 t
= : ∑ x̂kd (i, 1) − x̂k+1 (i, 1)
2
dk x̂kd , x̂k+1 (17)
i =1
217
Electronics 2018, 8, 980
We can notice from Equation (17) that the smaller the distance d, the closer the two vectors,
implying the convergence of the IDE iterations. Based on this discussion, the following heuristic
damping factor at the kth iteration is defined as:
dk
αk = (18)
dk +q
where q is a positive constant. Obviously, a higher value of d leads to a higher α, and vice versa.
In other words, when dk → 0, αk → 0, and when dk → 1, αk → 1. We utilize the first iteration result d1
to obtain the value of p. According to Equation (16), d1 can be calculated as:
8
9N
9 t
d = : ∑ x̂0d (i, 1) − x̂1 (i, 1)
1 2
i =1
8
9N
9 t
= : ∑ (x̂1 (i, 1))
2
(19)
i =1
8
9N
9 t
= : ∑ ( (x̂1 (i, 1))) + ( (x̂1 (i, 1)))
2 2
i =1
It is noticed that the value of d1 is varying and decided based on several factors such as the
modulation method, the noise in the system, etc. For the ease of calculation, the expectation of d1
is computed instead of the direct calculation of d1 . Since the real part and the imaginary part of the
vector x̂1 have the same uniform distribution, the expectation for d1 is obtained as:
⎛8
9N ⎞
9 t
E d1 = 2E ⎝: ∑ ( (x̂1 (i, 1))) ⎠
2
(20)
i =1
The expectation of d1 is based on the constellation points of the modulation scheme. With 16-QAM,
for example, x̂1 (i, 1) can be {−3, −1, +1, +3}, and the probability p of each possible value of the
point is the same without the prior information, i.e., p = 0.25 in this case. Hence, Equation (19) can be
rewritten as:
⎛8
9N ⎞ 7
9 t ∑ cand2
E d1 = 2E ⎝: ∑ ( (x̂1 (i, 1))) ⎠ = 2Nt
2
√ (21)
i =1 M
where cand represents the candidate value set of x̂1 (i, 1) and M represents the modulation
cardinality. For the ease of description, we still take the 16-QAM as an example. Then, cand is denoted
as the candidate values {−3, −1, +1, +3} and ∑ cand2 = (−3)2 + (−1)2 + 12 + 32 . M represents the
modulation cardinality. Based on experience, the value of α1 = 0.8 is set as 0.8. Then, substituting
Equation (22) into Equation (18) yields the estimation of q as:
7
E d1 Nt ∑ cand2
q= = √ (22)
4 2 M
The simulation results in Section 4 show that the proposed MIDE algorithm can improve the BER
performance significantly compared to the IDE-based algorithm, which employs the fixed damping
factor. The procedure of the proposed MIDE detection algorithm is illustrated in Algorithm 1.
218
Electronics 2018, 8, 980
2: W ← DH H ;
.
cand2
3: q ← N2t ∑√ ;
M
6 Initialization
4: x̂0d = 0;
Iterative process (repeat K times)
5: for k = 1–K (iteration
times) do
6: zk ← x̂kd−1 + W y − Hx̂dk−1
7: x̂k ←ΠΩ zk
8: rk ← x̂kd−1 − x̂k
7
Nt
2 2
9: d ←
k
∑ rk (i, 1) + rk (i, 1)
i =1
k
10: αk ← d
;
d +q
k
(1) Preprocess: The first part comes from the related computation before the iterative process.
The main factors affecting the computational complexity of the preprocess are the computation
of D and the multiplication of the Nt × Nt diagonal matrix and the Nt × Nr matrix. Let hi be
the ith column of the complex-valued channel matrix H. Then, the diagonal calculation can be
presented as:
1
hiH h j
, i=j
Dij = (23)
0, i=j
219
Electronics 2018, 8, 980
and two scalar multiplications with Nt × 1 vectors. Then, the complexity in this part is counted
as 3Nt .
Therefore, the overall required number of complex multiplications by the MIDE algorithm is
Nt Nr + Nt2 + K (2Nr Nt + 3Nt ). For comparison, the similar calculations of the complexity of the typical
detection algorithms (e.g., LMMSE with full matrix inversion, AltMin, ADMM) are given, as well as
the one of the proposed MIDE in Table 1.
Algorithm Complexity
LMMSE [8] 5 3
6 Nt + + 43 Nt + Nr Nt2 + Nr Nt
3 2
4 Nt
AltMin [18] (4K + 2) Nr Nt
2 Nt + Nt + Nr Nt + Nr Nt + K Nt + Nr Nt
1 3 2 2 2
ADMM [20]
MIDE Nt Nr + Nt2 + K (2Nr Nt + 3Nt )
Note that all these algorithms utilize the approximation approaches to solve the ML problem.
It is obvious that the proposed MIDE algorithm and AltMin algorithm have lower computational
complexity at each iteration among the compared iterative approaches. The numerical analysis will
be provided for further analysis of the computational complexity, which depends on the number of
iterations K and is presented in Section 4.
4. Simulation Results
220
Electronics 2018, 8, 980
%(5
0,'([
0,'([
0,'([
7LWHUDWLRQ
Figure 3 depicts the BER performance comparison of the IDE-based detection algorithm with the
fixed damping factor α = 0.05 and the proposed MIDE detection with the self-updated damping factor.
It is clear that the proposed MIDE-based detection with the self-updated damping factor showed better
BER performance than the conventional IDE-based detection algorithm with all antenna configurations.
%(5
,'([
0,'([
,'([
0,'([
,'([
0,'([
615 G%
Figure 3. BER performance of the conventional damping factor and the proposed damping factor.
Moreover, Figure 4 compares the BER performances of the proposed MIDE algorithm and the
conventional ADMM algorithm. From the figure, it is clear that the BER performance of both algorithms
degraded when the number of users increased. However, it is observed that the proposed MIDE-based
detection performed better than the conventional ADMM detection in terms of BER performance with
all antenna configurations. Furthermore, we can observe from Figure 4 that when the target of BER
was set as 10−3 , the SNR required by the proposed algorithm was at least 0.5 dB less than the one of
the conventional ADMM algorithm.
221
Electronics 2018, 8, 980
%(5
$'00>@[
0,'([
$'00>@[
0,'([
$'00>@[
0,'([
615 G%
Figure 4. BER performance comparison of the conventional ADMM and the proposed algorithm.
Finally, we show the BER performance comparison of the conventional AltMin algorithm,
the LMMSE algorithm, and the proposed MIDE algorithm in Figure 5.
%(5
0,'([
$OW0LQ>@[
/006(>@[
0,'([
$OW0LQ>@[
/006(>@[
0,'([
$OW0LQ>@[
/006(>@[
615 G%
Figure 5. BER performance comparison of the AltMin algorithm, the LMMSE algorithm, and the
proposed algorithm.
It is clear that with any ratio of Nr /Nt , the proposed MIDE algorithm performed better than all
the compared algorithms in terms of BER performance.
222
Electronics 2018, 8, 980
MIDE algorithm, respectively, based on the convergence simulations. Figure 6 illustrates the total
number of multiplications vs. the number of transmitting antennas, which was based on the analysis
of the computational complexity of the algorithms in Section 3.3.
1XPEHURIFRPSOH[YDOXHGPXOWLSOLFDWLRQRSUDWLRQV /006(>@
$OW0LQ>@
$'00>@
0,'(
1XPEHURIXVHUV
From Figure 6, we can see that the computational complexity increased with the number of
users in all compared algorithms. However, the proposed MIDE algorithm achieved the lowest
computational complexity among all compared algorithms under various antenna configurations.
Specifically, the MIDE algorithm showed a relatively lower computational complexity than the AltMin
algorithm, which was proven to have a low complexity detection for uplink massive systems in [20].
In addition, from Figure 6, the proposed MIDE algorithm achieved much lower computational
complexity than the LMMSE and ADMM detection algorithms when the dimension of the MIMO
system became larger. As a consequence, the proposed MIDE detection is much more applicable for
massive MIMO systems with its low computational complexity.
5. Conclusions
In this paper, we proposed a low-complexity, IDE-based detection algorithm in uplink massive
MIMO systems. The proposed MIDE algorithm avoided the calculation of the Gram matrix,
the matrix inversion, and LDL decomposition to reach a low computational complexity. In addition,
a self-updating damping method was provided with the damping factor estimated and updated at
each iteration based on the Euclidean distance between the latest two detection solutions, which
accelerated the convergence of the IDE-based detection algorithms. Simulation results showed that
the proposed MIDE algorithm performed better than the conventional LMMSE, AltMin, and ADMM
detection algorithms in terms of the BER performance and the computational complexity.
Author Contributions: H.F. and X.Z. contributed the idea generation; S.X. and X.Z. conducted the study design
and paper writing; Z.L. helped the analysis of the simulation results.
Funding: This work is supported in part by the National Natural Science Foundation of China (No. 61571108,
No. 61701197, and No. 61801193), the open research fund of National Mobile Communications Research Laboratory
of millimeter wave, Southeast University (No. 2018D15), the open research fund of the National Key Laboratory
of millimeter wave, Southeast University (No.K201918), the Open Foundation of Key Laboratory of Wireless
Communication, Nanjing University of Posts and Telecommunication (No. 2017WICOM01), and the Wuxi Science
and Technology Development Fund (No. H20191001).
Conflicts of Interest: The authors declare no conflict of interest.
223
Electronics 2018, 8, 980
References
1. Xiang, W.; Zheng, K.; Shen, X.S. 5G Mobile Communications; Springer International Publishing: Basel,
Switzerland, 2017.
2. Lu, L.; Li, G.Y.A.; Swindlehurst, L.; Ashikhmin, A.; Zhang, R. An overview of massive MIMO: Benefits and
challenges. IEEE J. Sel. Topics Signal Process. 2014, 8, 742–758. [CrossRef]
3. Wu, M.; Yin, B.; Wang, G.; Dick, C.; Cavallaro, J.R.; Studer, C. Large-scale MIMO detection for 3GPP LTE:
algorithms and FPGA implementations. IEEE J. Sel. Topics Signal Process. 2014, 8, 916–929. [CrossRef]
4. Ngo, H.; Larsson, E.; Marzetta, T. Energy and spectral efficiency of very large multiuser MIMO systems.
IEEE Trans. Commun. 2012, 61, 1436–1449.
5. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T. L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
6. Chockalingam, A.; Rajan, B.S. Large MIMO Systems; Cambridge University Press: Cambridge, UK, 2014.
7. Kuchi, K.; Ayyar, A.B. Performance analysis of ML detection in MIMO systems with co-channel interference.
IEEE Commun. Lett. 2011, 15, 786–788. [CrossRef]
8. Mandloi, M.; Bhatia, V. Low-complexity near-optimal iterative sequential detection for uplink massive
MIMO systems. IEEE Commun. Lett. 2017, 21, 568–571. [CrossRef]
9. Gustafsson, O.; Bertilsson, E.; Klasson, J.; Ingemarsson, C. Approximate Neumann Series or exact matrix
inversion for massive MIMO? In Proceedings of the 2017 IEEE 24th Symposium on Computer Arithmetic
(ARITH), London, UK, 24–26 July 2017.
10. Fang, L.; Huang, D. Neumann Series Expansion Based LMMSE Channel Estimation for OFDM Systems.
IEEE Commun. Lett. 2016, 20, 748–751. [CrossRef]
11. Gao, X.; Dai, L.; Ma, Y.; Wang, Z. Low-complexity near-optimalsignal detection for uplink large-scale MIMO
systems. Electron. Lett. 2014, 50, 1326–1328. [CrossRef]
12. Qin, X.; Yan, Z.; He, G. A near-optimal detection scheme based on joint steepest descent and Jacobi method
for uplink massive MIMO systems. IEEE Commun. Lett. 2016, 20, 276–279. [CrossRef]
13. Lee, Y. Decision-aided Jacobi iteration for signal detection in massive MIMO systems. Electron. Lett. 2017, 53,
1552–1554. [CrossRef]
14. Kong, B.Y.; Park, I. Low-complexity symbol detection for massive MIMO uplink based on Jacobi method.
In Proceedings of the IEEE 27th Annual International Symposium on Personal, Indoor, and Mobile Radio
Communications (PIMRC), Valencia, Spain, 1–5 September 2016.
15. Minango, J.; de Almeida, C.; Daniel Altamirano, C. Low-complexity MMSE detector for massive MIMO systems
based on Damped Jacobi method. In Proceedings of the 2017 IEEE 28th Annual International Symposium on
Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, QC, Canada, 1–5 October 2017.
16. Deng, Q.; Guo, L.; Dong, C.; Lin, J.; Meng, D.; Chen, X. High-throughput signal detection based on fast
matrix inversion updates for uplink massive multiuser multiple-input multi-output systems. IET Commun.
2017, 11, 2228–2235. [CrossRef]
17. Sun, Y.; Li, Z.; Zhang, C.; Zhang, R.; Yan, F.; Shen, L. Low complexity signal detector based on SSOR
iteration for large-scale MIMO systems. In Proceedings of the 9th International Conference on Wireless
Communications and Signal Processing (WCSP), Nanjing, China, 1–6 October 2017.
18. Dai, L.; Gao, X.; Su, X.; Han, S.; Chih-Lin, I.; Wang, Z. Low-complexity soft-output signal detection based on
Gauss–Seidel method for uplink multiuser large-scale MIMO systems. IEEE Trans. Veh. Technol. 2015, 64,
4839–4845. [CrossRef]
19. Zeng, J.; Lin, J.; Wang, Z. An improved Gauss–Seidel algorithm and its efficient architecture for massive
MIMO systems. IEEE Trans. Circuits Syst. II-Express Briefs. 2018, 65, 1194–1198. [CrossRef]
20. Elgabli, A.; Elghariani, A.; Aggarwal, V.; Bell, M.R. A low-complexity detection algorithm for uplink massive
MIMO systems based on Alternating Minimization. IEEE Wirel. Commun. Lett. 2019, 8, 917–920. [CrossRef]
21. Souto, N.; Dinis, R. MIMO detection and equalization for single-carrier systems using the Alternating
Direction Method of Multipliers. IEEE Signal Process. Lett. 2016, 23 , 1751–1755. [CrossRef]
22. Shahabuddin, S.; Juntti, M.; Studer, C. Admm-based infinity normdetection for large MU-MIMO: Algorithm
and vlsi architecture. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS),
Baltimore, MD, USA,1–4 May 2017.
224
Electronics 2018, 8, 980
23. Zhang, K.; Zhang, Y. Recursive algorithm for calculating eigenvalues of real symmetric matrix based on
LDLT decomposition. Computer Engineering and Applications. 2008, 44, 78–80.
24. Chou, C.; Wu, J. Low-complexity MIMO precoder design with LDL H channel decomposition. IEEE Trans.
Veh. Technol. 2011, 60, 2368–2372. [CrossRef]
25. Wang, C.; Wen, C.; Jin, S.; Tsai, S. Finite-Alphabet precoding for massive MU-MIMO with low-resolution
DACs. IEEE Trans. Wirel. Commun. 2018, 17, 4706–4720. [CrossRef]
26. Chen, J. A low complexity data detection algorithm for uplink multiuser massive MIMO systems. IEEE J. Sel.
Areas Commun. 2017, 35, 1701–1714. [CrossRef]
27. Jin, F.; Liu, Q.; Liu, H.; Wu, P. A low complexity signal detection scheme based on improved Newton iteration
for massive MIMO systems. IEEE Commun. Lett. 2019, 23, 748–751. [CrossRef]
28. Wu, Z.; Ge, L.; You, X.; Zhang, C. Efficient near-MMSE detector for large-scale MIMO systems. In Proceedings
of the 2017 IEEE International Workshop on Signal Processing Systems (SiPS), Lorient, France, 1–6 October 2017.
29. Wu, C.; Huang, W.; Chung, W. Robust update algorithms for Zero-Forcing detection in uplink large-scale
MIMO systems. IEEE Commun. Lett. 2018, 22, 424–427. [CrossRef]
30. Monteiro, F.A.; Wassell, I.J. Euclidean distances in quantized spaces with pre-stored components for MIMO
detection. In Proceedings of the European Conference on Wireless Technologies, Munich, Germany, 8–10
October 2007.
31. Hunger, R. Floating Point Operations in Matrix-Vector Calculus; Technical Report; Munich University of
Technology: Munich, Germany, 2007.
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
225
electronics
Article
Energy Efficiency Optimization for Massive MIMO
Non-Orthogonal Unicast and Multicast Transmission
with Statistical CSI
Wenjin Wang *, Yufei Huang, Li You, Jiayuan Xiong, Jiamin Li and Xiqi Gao
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
* Correspondence: [email protected]; Tel.: +86-025-8379-0506
Abstract: We study the energy efficiency (EE) optimization problem in non-orthogonal unicast and
multicast transmission for massive multiple-input multiple-output (MIMO) systems with statistical
channel state information of all receivers available at the transmitter. Firstly, we formulate the
EE maximization problem. We reduce the number of variables to be solved and simplify this
large-dimensional-matrix-valued problem into a real-vector-valued problem. Next, we lower the
computational complexity significantly by replacing the objective with its deterministic equivalent
to avoid the high-complex expectation operation. With guaranteed convergence, we propose an
iterative algorithm on beam domain power allocation using the minorize maximize algorithm and
Dinkelbach’s transform and derive the locally optimal power allocation strategy to achieve the optimal
EE. Finally, we illustrate the significant EE performance gain of our EE maximization algorithm
compared with the conventional approach through conducting numerical simulations.
Keywords: energy efficiency; non-orthogonal unicast and multicast transmission; statistical channel
state information; massive MIMO; beam domain
1. Introduction
As mobile data expands rapidly, it is expected that global wireless data traffic will surpass
100 exabytes per month by 2023 [1]. A considerable proportion of the data traffic, such as massive
software updating and sports broadcasting, is of common interest, which stimulates the demand for
services that can deliver the same data to a group of user terminals (UTs) efficiently. Since physical
layer multicasting can provide efficient point-to-multipoint wireless transmission, it has great potential
for future mobile communication systems [2–4].
Recently, non-orthogonal unicast and multicast (NOUM) transmission has been gaining increasing
interest [5–7]. At the transmitter, the unicast and multicast signals are precoded and then sent out to the
receivers simultaneously, sharing the same time-frequency resources. Compared with the conventional
orthogonal unicast and multicast (OUM) transmission, NOUM transmission is more spectrum-efficient,
and more suitable for scenarios where both multicast and unicast signals are needed by a UT. Massive
multiple-input multiple-output (MIMO) has become one of the core technologies of the fifth generation
wireless system for its significant performance in energy efficiency (EE) and spectral efficiency [8,9].
Therefore, there has been considerable research on the combination of multicast transmission and
massive MIMO systems [6,10,11]. Please note that mutual coupling is a major concern in massive
MIMO because it can weaken the system performance [12–16]. We assume perfect isolation between
the antennas without loss of generality.
EE has become a significant design criterion for wireless communication systems [17–19].
The broad-scale antenna arrays equipped at the base station (BS) cause the power consumption
to increase in massive MIMO systems, and the energy consumed by wireless communications is
responsible for greenhouse gas emissions [20], which motivates the need to design energy-efficient
systems [8,21,22]. EE of a massive MIMO system was considered in [8]. However, it ignores the power
consumed by the BS circuit, while in [21], research on maximizing the EE and power transfer efficiency
for wireless-powered systems was analyzed, taking the circuit power consumption into account.
In [22], how the system parameters (number of antennas, transmitted power and number of UTs) affect
the EE of a multi-user MIMO system was investigated.
There are also previous works that studied energy-efficient NOUM transmission in massive
MIMO systems [23–25]. In [23], energy-efficient NOUM beamforming in multi-cell multi-user MIMO
scenario was studied. An optimization beamforming algorithm was proposed in [24] to optimize the EE
in the multi-cell multicast system. The extension of the problem was investigated in [25], which takes
antenna selection into consideration.
Please note that most of the previous works made the assumption that the UTs’ instantaneous
channel state information (CSI) is available at the BS. However, in realistic systems, obtaining good
estimates of instantaneous CSI is a challenging job [26–28]. Compared with obtaining instantaneous
CSI, the acquisition of statistical CSI is easier and more precise. In [11], rate maximization problem
for NOUM massive MIMO transmission was considered, and the EE maximization problem for
physical-layer multicast transmission was investigated in [29], both assumed that the BS only has
access to the UTs’ statistical CSI.
To our knowledge, the research on EE optimization of NOUM transmission for massive MIMO
systems with statistical CSI at the transmitter has not been studied yet. We investigate this problem in
our work, and the major contributions we provide in this paper are listed as follows:
• With statistical CSI, we formulate the EE maximization problem for NOUM transmission in the
massive MIMO scenario.
• We determine the optimal transmit directions of the multicast and unicast transmission in
closed-form, respectively, and then simplify the large-scale complex-matrix-valued precoding
design problem into a real-vector-valued power allocation problem in the beam domain.
• We reduce the computational complexity of the EE optimization problem significantly by replacing
the objective function with its deterministic equivalent (DE).
• With guaranteed convergence, we propose an algorithm on beam domain power allocation
using the minorize maximize (MM) algorithm and Dinkelbach’s transform. We deal with the EE
optimization problem by iteratively solving a series of related convex optimization problems.
The remainder of the paper is constructed as follows. The channel model is introduced in Section 2.
The EE maximization problem is formulated and investigated in Section 3. Numerical simulations are
conducted in Section 4. Section 5 summarizes the paper.
Column vectors and matrices are represented by lower and upper case boldface letters,
respectively, whereas italic letters stand for scalars, and the following are other notations used in
this paper.
228
Electronics 2019, 8, 857
2. System Model
Consider a single cell massive MIMO system with an M-antenna BS, jointly serves K UTs.
Denote by K {1, 2, . . . , K } the UT set, where the kth UT is equipped with Nk antennas. The multicast
and unicast services are carried out with the same time-frequency resources. The BS sends a multicast
signal that is of common interest to all the UTs in the cell while delivering unique messages to UTs
according to each UT’s demand during the downlink transmission, as shown in Figure 1.
where xm ∈ C M×1 represents the multicast signal and xuk ∈ C M×1 denotes the unicast signal sent to the
kth UT. Assume that xm and xuk are mutually uncorrelated, zero-mean, and their covariance matrices
+ ,
are Qm and Quk , respectively. Define tr {Qm } as the multicast transmission power and tr Quk as the
unicast transmission power. At the kth UT, the received signal is denoted by
yk = Hk x + nk ∈ C Nk ×1 , (2)
where Hk is the downlink channel matrix of size Nk × M, and nk ∼ CN 0, σ2 I Nk represents the
additive circularly symmetric complex-valued Gaussian noise with the variance being σ2 .
We adopt Weichselberger’s channel model [30,31] in our work because the correlation properties
between the transmit and receive ends of Weichselberger’s channel model are jointly considered rather
than separately characterized in the Kronecker model. Then, we can write the downlink channel
matrix in (2) as
Hk = Uk Gk VkH ∈ C Nk × M , (3)
where the average power of [Gk ]i,j is represented by [Ωk ]i,j . As Ωk has the property of remaining
approximately constant while the frequency changes widely, the statistical CSI can be obtained
accurately and efficiently [32].
The vast number of antenna arrays employed at the BS brings about new channel properties for
massive MIMO systems. For example, as the BS antenna number M tends to infinity, the eigenvector
matrices of the transmit correlation matrices between the BS and all UTs tend to be the same and are
229
Electronics 2019, 8, 857
only affected by the BS array topology [32,33]. Denote the corresponding deterministic unitary matrix
as V, and then in the massive MIMO scenario, the downlink channel matrix becomes
M→∞
Hk = Uk Gk V H . (5)
Please note that many of the previous works on massive MIMO adopted the channel model
mentioned in (5) such as [26,29,34], and it can achieve quite accurate performance [34].
Each UT will decode the common multicast signal and its desired unicast signal in order by
applying successive interference cancellation (SIC) method.
During the process of multicast decoding, the kth UT regards the term Hk xm in (6) as the desired
message while treating the others as interference. For the covariance matrix of the interference and
noise, we have
! "
k = σ I Nk + ∑ E Hk Qk Hk
Km 2 u H
∈ C Nk × Nk . (7)
2 34 5 k ∈K
noise 2 34 5
interference
Since UT k has the knowledge of its own instantaneous CSI and the covariance matrix Km
k ,
during the multicast transmission, we denote by Rm
k the ergodic rate of the kth UT
! ! ""
k = E log det Kk + Hk Q Hk
Rm − log det {Km
m m H
k }. (8)
+ ,
Then the terms E Gk V H Quk VGkH in (10) can be rewritten as Ak V H Quk V .
230
Electronics 2019, 8, 857
For the unicast signal decoding, with SCI, the multicast signal is removed, so the interference only
contains the unicast signal meant for other UTs. For the covariance matrix of the interference and noise
at the kth UT, we have ! "
Kuk = σ2 I Nk + ∑ E Hk Quk HkH ∈ C Nk × Nk . (12)
2 34 5 k =k
noise 2 34 5
interference
Then we denote by Ruk the ergodic rate of the kth UT during the unicast transmission
! ! ""
Ruk = E log det Kuk + Hk Quk HkH − log det {Kuk } . (13)
By inserting the massive system model in (5) and the Sylvester’s determinant identity into (13),
the unicast rate Ruk at the kth UT becomes
! ! "" ! "
u u
Ruk = E log det Kk + Gk V H Quk VGkH − log det Kk , (14)
u
where Kk is defined as
Ak V H Quk V ∈ C Nk × Nk ,
u
Kk σ2 I Nk + ∑ (15)
k =k
where the constant-coefficient μ ≥ 1 accounts for the reciprocal of the transmit amplifier drain efficiency.
+ ,
tr {Qm } means the multicast transmit power, and ∑k∈K tr Quk denotes the total unicast transmit
power. Pc stands for the constant circuit power consumption per antenna and is unaffected by the
actual transmit power. Ps represents the BS static power consumption and is irrelevant to the number
of antennas.
In the following, we formulate the EE optimization problem for NOUM massive MIMO system.
We aim at identifying the optimal transmit covariance matrices Qm and Quk for multicast and
unicast transmission that can maximize the system EE, respectively. We define a weight matrix
u = [u0 , u1 , . . . , uK ] with u0 being the weight of multicast rate and uk being the weight of kth unicast
rate. Then we can denote by R the weighted sum rate as follows:
R u0 K (min Rm
k
k )+ ∑ uk Ruk , (17)
k ∈K
231
Electronics 2019, 8, 857
W u0 K (min Rmk ) + ∑ uk Ruk
k k ∈K
max ,
Qm ,Quk ,∀k ∈K + u,
μ tr {Qm } + ∑ tr Qk + MPc + Ps
k ∈K (19)
s.t. tr {Qm } + ∑ tr {Quk } ≤ Pmax ,
k ∈K
Qm 0, Quk 0 (∀k ∈ K) ,
the transmitted signals. Meanwhile, Λm and Λuk are diagonal matrices with their diagonal elements
constituted by the eigenvalues of Qm and Quk , respectively, which denote the allocated power over the
corresponding directions.
The following theorem determines the values of the eigenvectors of Qm and Quk .
Theorem 1. The optimal multicast and unicast transmit covariance matrices of problem (19) is
u,opt
Qm,opt = VΛm V H , Qk = VΛuk V H , ∀k, (20)
where Λm and Λuk (∀k) are both diagonal matrices, and the matrix V equals to the eigenvector matrices of the
correlation matrices between the BS and all UTs and only depends on the BS array topology. The eigenvectors of
Qm and Quk are given by the columns of the matrix V,
Theorem 1 above indicates that when solving problem (19), since the eigenvectors
are deterministic, we only have to determine the power allocation matrix denoted by
+ m u u ,
Λ Λ , Λ1 , Λ2 , . . . , ΛuK , which reduces the number of variables to be optimized and the
computational complexity significantly. Therefore, the large-dimensional complex-matrix-valued
EE maximization problem can be transformed into a real-vector-valued power allocation problem in
the beam domain.
m u
Rewrite Kk and Kk as follows
m
Kk (Λ) σ2 I Nk + ∑ Ak Λuk , (21)
k ∈K
u
Kk (Λ) σ2 I Nk + ∑ Ak Λuk , (22)
k =k
and without loss of optimality, we can simplify the problem in (19) into the problem below
W u0 K min Rmk ( Λ ) + ∑ u R
k k
u (Λ)
k k ∈K
max ,
Λ + u,
μ tr {Λm } + ∑ tr Λk + MPc + Ps
k ∈K (23)
s.t. tr {Λ } +
m
∑ tr {Λuk } ≤ Pmax ,
k ∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) ,
232
Electronics 2019, 8, 857
233
Electronics 2019, 8, 857
⎤ ⎧
⎡ ⎪ Nk [Ωk ]n,i
∂t− ⎪
⎨ a = k,
k Λ( p) ∑ ,
( p) M
Δtk =⎣ ⎦ = n =1 σ 2 + ∑ ∑ [Ωk ]n,m Λuk ,( p) (29)
i,i ∂Λua ⎪
⎪ k =k m =1 m,m
⎩
i,i 0, a = k,
respectively.
Since t−
k Λ( p) and Λ a,( p) in (27) are constant in each iteration, we can ignore them and obtain
u
Λ ( p +1) =
T #
k ( Λ( p) )
∂t−
W u0 Kη + ∑ uk (Λ) − ∑ trt+
k ∂Λua Λua
k ∈K a =k
arg max ,
Λ
+ ,
μ tr {Λm } + ∑ tr Λuk + MPc + Ps
k∈K
⎧⎛ ⎞T ⎫ (30)
⎪
⎨ ∂s− Λ ⎪
⎬
k ( p )
s.t. s+ (Λ) − s− Λ( p) − ∑ tr ⎝ ⎠ Λua − Λu
a,( p) ⎪ − η ≥ 0 (∀ k ∈ K) ,
k k ⎪
a∈K ⎩
∂Λ u
a ⎭
Although the numerator of the objective function and constraint of the transformed
sub-problem (30) are concave, the computational complexity can still be quite high if the expectation
operation is manipulated using Monte-Carlo methods. Via applying the large-dimensional random
matrix theory in [36,37], we further reduce the optimization complexity by substituting the minuends
k ( Λ ) and Rk ( Λ ) with their DEs, respectively.
of Rm u
First, we define a diagonal matrix-valued function Yk (X) of size M × M, and its ith diagonal
element is ! ! " "
[Yk (X)]i,i = tr diag [Ωk ]:,i X . (31)
where Γm 6m 6m
k , Γk and Φk are given by
−1
6 m Km ∈ C M× M ,
k = Yk
Γm Φ k k (Λ)
6 m = Ak −1
Γ k Λm (I M + Λm Γmk ) ∈ C Nk × Nk , (33)
−1
6 m = IN
Φ 6 m Km
+Γ k (Λ) ∈ C Nk × Nk ,
k k k
234
Electronics 2019, 8, 857
6u and Φ
where Γuk , Γ 6 u are given by
k k
−1
Γuk = Yk 6 u Kuk (Λ)
Φ ∈ C M× M ,
k
6u = Ak Λu (I M + Λu Γu )−1 ∈ C Nk × Nk ,
Γ (35)
k k k k
−1
6 u = IN + Γ
Φ 6u Kuk (Λ) ∈ C Nk × Nk .
k k k
Λ ( p +1) =
T #
k ( Λ( p) )
∂t−
W u0 Kη + ∑ uk Tk+ (Λ) − ∑ tr ∂Λua Λ u
a
k ∈K a =k
arg max ,
Λ
+ ,
μ tr {Λm } + ∑ tr Λuk + MPc + Ps
k ∈K
⎧⎛ ⎞T ⎫ (36)
⎪
⎨ ∂s− ⎪
⎬
k Λ( p)
s.t. Sk+ (Λ) − s−
k Λ( p) − ∑ tr ⎝ ⎠ Λua − Λu
( ) − η ≥ 0 (∀k ∈ K) ,
⎪
a∈K ⎩
∂Λ a
u a, p ⎪
⎭
We can observe from the optimization problem in (36) that the denominator and numerator of
the objective function are linear and concave functions of Λ, respectively. We invoke Dinkelbach’s
transform [38] to deal with this concave-linear program. We can obtain the solution to (36) via solving
a series of problems below
! "
( q +1)
Λ( p) , η ( q +1) =
⎛ ⎛ ⎧⎛ ⎞T ⎫⎞⎞
⎪
⎨ ∂t− ⎪
⎬
⎜ ⎜ k Λ( p) ⎟⎟ (q)
arg max W ⎝u0 Kη + ∑ uk ⎝ Tk+ (Λ) −
∑ tr ⎪⎝ ∂Λua ⎠ Λa ⎪⎠⎠ − χ( p) P (Λ) ,
u
Λ k ∈K a =k ⎩ ⎭
⎧⎛ ⎞T ⎫
⎪
⎨ ∂s− Λ( p) ⎪
⎬ (37)
k
s.t. Sk+ (Λ) − s−
k Λ( p) − ∑ tr ⎪
⎝ ⎠ Λua − Λu
a,( p) ⎪ − η ≥ 0 (∀ k ∈ K) ,
a∈K ⎩ ∂Λ u
a ⎭
From the analysis above, we can observe that the proposed EE optimization algorithm involves
two-layer iterations. During the outer iteration, via invoking the MM algorithm, we replace the
numerator of the objective function in (26) with its lower bound function, thus making the numerator
235
Electronics 2019, 8, 857
concave. The MM-based algorithm is guaranteed to converge to the locally optimal solution [39–41];
in the inner iteration, we transform the fractional problem in (36) into solvable convex optimization
problems in (37) via Dinkelbach’s transform, which can derive the global optimum solution to (36)
with guaranteed convergence [42]. After several iterations, we can obtain the optimal beam domain
power allocation matrix Λ. Please note that Λ is locally optimal due to the local optimality of MM
algorithm. We present our algorithm in Algorithm 1.
Algorithm 1 Energy-Efficient Power Allocation Algorithm in the Beam Domain for Massive MIMO
NOUM Transmission
Input: Beam domain channel statistics Ωk , initial power allocation matrix Λ(0) , outer iteration
! "
W u0 K min Sk+ Λ( p) − s−
k Λ ( p) + ∑ u k Tk
+
Λ ( p) − t −
k Λ ( p)
k
EE( p) = k∈K (39)
P Λ( p)
3: while EE( p) − EE( p−1) ≥ 1 do
(q) (q)
4: Initialization: q = 0, let Λ( p) = Λ( p) , calculate χ( p) with (38)
(q) ( q −1)
5: while χ( p) − χ( p) ≥ 2 do
6: Let q = q + 1
(q) ( q −1)
7: Calculate Λ( p) via solving problem (37) with χ( p)
(q)
8: Calculate χ( p) using (38)
9: end while
10: Let p = p + 1
(q)
11: Let Λ( p) = Λ( p−1)
12: Calculate EE( p) with (39)
13: end while
14: return Λ = Λ( p)
4. Numerical Results
We provide numerical simulation results to demonstrate the performance of the EE optimization
algorithm proposed above for NOUM transmission massive MIMO scenario with statistical CSI.
Table 1 illustrates how the numerical simulation parameters are set.
First of all, in Figure 2, we illustrate the convergence performance by showing the iteration
process of our EE optimization algorithm under different transmit power budgets Pmax . The horizontal
ordinate is the outer iteration index. As we can see, the EE converges after only a few iterations. Also,
we can observe that in the lower power budget regime, the EE performance convergences faster than
that in the higher power budget regime.
236
Electronics 2019, 8, 857
Parameter Value
Scenario Suburban macro
Channel model 3GPP SCM
Pathloss −120 dB (∀k)
Array topology ULA with antenna spacing half wavelength
Noise variance σ2 = −131 dBm
Number of UTs K=8
Number of BS antennas M = 128
Number of UT antennas Nk = 4 (∀k)
Transmission bandwidth W= 10 MHz
Amplifier drain efficiency μ=5
Circuit power consumption per antenna Pc = 30 dBm
Static power consumption Ps = 40 dBm
Weights u0 = 0.7, uk = 0.3 (∀k )
G%P
G%P
G%P
(QHUJ\HIILFLHQF\ ELWV-RXOH
G%P
1XPEHURILWHUDWLRQV
Figure 2. The convergence performance of the proposed EE optimization algorithm for different power
budgets Pmax .
Then, we evaluate the EE of the NOUM transmission versus the power budget Pmax under
different numbers the antennas M at the BS in Figure 3. As we can see, the EE performance decreases
when the BS antenna number M increases for the reason that in the power consumption model we
adopted in (16), the total circuit power consumption grows linearly with M, the BS antenna number.
Next, the comparison of the EE performance of the power allocation algorithm proposed above
with the rate maximization approach [11] is shown in Figure 4. We notice that the EE performance of
the two approaches are similar at low transmit power budget regime. However, when the transmit
power budget gets high, the EE performance of the rate maximization approach decreases, while that
of our EE maximization approach remains high. This indicates that the rate maximization approach can
achieve almost EE optimal when Pmax is low. However, our EE maximization approach outperforms
the rate maximization one at high transmit power budget regime.
237
Electronics 2019, 8, 857
(QHUJ\HIILFLHQF\ ELWV-RXOH 0
0
0
7UDQVPLWSRZHUEXGJHW3PD[ G%P
Figure 3. The EE performance of the NOUM transmission versus the power budget Pmax for different
numbers of BS antennas M.
((PD[
5DWHPD[
(QHUJ\HIILFLHQF\ ELWV-RXOH
7UDQVPLWSRZHUEXGJHW3PD[ G%P
Figure 4. The EE performance of the proposed beam domain power allocation algorithm compared
with the rate maximization approach.
Finally, in Figure 5, the EE performance of our power allocation approach and that of full CSI
approach, which assumes instantaneous CSI is known at the BS, is compared. Since full CSI is an ideal
case, it can achieve better EE performance than other imperfect CSI situation. However, the full CSI
case suffers from pilot overhead. As Figure 5 illustrates, our proposed algorithm surpasses the full CSI
approach with 3/7 pilot overhead [43] in the EE performance.
238
Electronics 2019, 8, 857
6WDWLVWLFDO&6,
(QHUJ\HIILFLHQF\ ELWV-RXOH )XOO&6,
)XOO&6,SLORWRYHUKHDG
7UDQVPLWSRZHUEXGJHW3PD[ G%P
Figure 5. The comparison on the EE performance of proposed algorithm, full CSI case and full CSI
with 3/7 overhead.
5. Conclusions
To conclude, we considered the EE optimization problem in NOUM transmission systems
with statistical CSI available at the BS. We first formulated the EE maximization problem, and then
determined the closed-form optimal eigenvectors of the multicast and unicast transmit covariance
matrices for optimal EE, respectively. Next, with guaranteed convergence, we proposed a beam domain
power allocation algorithm adopting the MM algorithm, DE and Dinkelbach’s transform and derived
the locally optimal power allocation strategy to achieve the EE optimization. Finally, with numerical
results, we presented the performance gain of our EE maximization algorithm compared with the
conventional approach.
Author Contributions: W.W. and Y.H. perceived the idea and wrote the manuscript. L.Y., J.X., J.L. and X.G. gave
valuable suggestions on the structuring of the paper and assisted in the revising and proofreading.
Funding: This work was supported by the National Key R&D Program of China under Grant 2018YFB1801103,
the National Natural Science Foundation of China under Grants 61631018, 61801114, 61761136016, 61771264,
61871465, and 61501113, the Natural Science Foundation of Jiangsu Province under Grant BK20170688,
the Fundamental Research Funds for the Central Universities, and the Huawei Cooperation Project.
Acknowledgments: The authors would like to thank the Editor and the anonymous reviewers for their valuable
comments and suggestions.
Conflicts of Interest: The authors declare no conflict of interest.
239
Electronics 2019, 8, 857
References
1. Lecompte, D.; Gabin, F. Evolved multimedia broadcast/multicast service (eMBMS) in LTE-advanced:
Overview and Rel-11 enhancements. IEEE Commun. Mag. 2012, 50, 68–74. [CrossRef]
2. Wang, W.; Liu, A.; Zhang, Q.; You, L.; Gao, X.; Zheng, G. Robust multigroup multicast transmission for
frame-based multi-beam satellite systems. IEEE Access 2018, 6, 46074–46083. [CrossRef]
3. You, L.; Liu, A.; Wang, W.; Gao, X. Outage constrained robust multigroup multicast beamforming for
multi-beam satellite communication systems. IEEE Wirel. Commun. Lett. 2019, 8, 352–355. [CrossRef]
4. You, L.; Wang, J.; Wang, W.; Gao, X. Secure multicast transmission for massive MIMO with statistical channel
state information. IEEE Signal Process. Lett. 2019, 26, 803–807. [CrossRef]
5. Chen, E.; Tao, M.; Liu, Y. Joint base station clustering and beamforming for non-orthogonal multicast and
unicast transmission with backhaul constraints. IEEE Trans. Wirel. Commun. 2018, 17, 6265–6279. [CrossRef]
6. Sadeghi, M.; Björnson, E.; Larsson, E.G.; Yuen, C.; Marzetta, T. Joint unicast and multi-group multicast
transmission in massive MIMO systems. IEEE Trans. Wirel. Commun. 2018, 17, 6375–6388. [CrossRef]
7. Wang, J.; Xu, H.; Zhu, B.; Fan, L.; Zhou, A. Hybrid beamforming design for mmWave joint unicast and
multicast transmission. IEEE Commun. Lett. 2018, 22, 2012–2015. [CrossRef]
8. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
9. Hoydis, J.; ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
10. Sadeghi, M.; Björnson, E.; Larsson, E.G.; Yuen, C.; Marzetta, T.L. Max–min fair transmit precoding for
multi-group multicasting in massive MIMO. IEEE Trans. Wirel. Commun. 2018, 17, 1358–1373. [CrossRef]
11. You, L.; Xiong, J.; Li, K.; Wang, W.; Gao, X. Non-orthogonal unicast and multicast transmission for massive
MIMO with statistical channel state information. IEEE Access 2018, 6, 66841–66849. [CrossRef]
12. Alibakhshikenari, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Falcone, F.; Limiti, E. High-isolation
leaky-wave array antenna based on CRLH-metamaterial implemented on SIW with 30o frequency
beam-scanning capability at millimetre-waves. Electronics 2019, 8, 642. [CrossRef]
13. Song, Y.J.; Sarabandi, K. Equivalent circuit model for metamaterial-based electromagnetic band-gap isolator.
IEEE Antennas Wirel. Propag. Lett. 2012, 11, 1366–1369. [CrossRef]
14. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual-coupling
isolation using embedded metamaterial EM bandgap decoupling slab for densely packed array antennas.
IEEE Access 2019, 7, 51827–51840. [CrossRef]
15. Farsi, S.; Aliakbarian, H.; Schreurs, D.; Nauwelaers, B.; Vandenbosch, G.A.E. Mutual coupling reduction
between planar antennas by using a simple microstrip U-section. IEEE Antennas Wirel. Propag. Lett. 2012,
11, 1501–1503. [CrossRef]
16. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual coupling
suppression between two closely placed microstrip patches using EM-bandgap metamaterial fractal loading.
IEEE Access 2019, 7, 23606–23614. [CrossRef]
17. Singh, K.; Gupta, A.; Ratnarajah, T. QoS-driven energy-efficient resource allocation in multiuser
amplify-and-forward relay networks. IEEE Trans. Signal Inf. Process. Over Netw. 2017, 3, 771–786. [CrossRef]
18. Singh, K.; Gupta, A.; Ratnarajah, T. QoS-driven resource allocation and EE-balancing for multiuser two-way
amplify-and-forward relay networks. IEEE Trans. Wirel. Commun. 2017, 16, 3189–3204. [CrossRef]
19. Singh, K.; Gupta, A.; Ratnarajah, T. A utility-based joint subcarrier and power allocation for green
communications in multi-user two-way regenerative relay networks. IEEE Trans. Commun. 2017, 65, 3705–3722.
[CrossRef]
20. Li, G.Y.; Xu, Z.; Xiong, C.; Yang, C.; Zhang, S.; Chen, Y.; Xu, S. Energy-efficient wireless communications:
tutorial, survey, and open issues. IEEE Wirel. Commun. 2011, 18, 28–35. [CrossRef]
21. Khan, T.A.; Yazdan, A.; Heath, R.W. Optimization of power transfer efficiency and energy efficiency for
wireless-powered systems with massive MIMO. IEEE Trans. Wirel. Commun. 2018, 17, 7159–7172. [CrossRef]
22. Björnson, E.; Sanguinetti, L.; Hoydis, J.; Debbah, M. Optimal design of energy-efficient multi-user MIMO
systems: Is massive MIMO the answer? IEEE Trans. Wirel. Commun. 2015, 14, 3059–3075. [CrossRef]
240
Electronics 2019, 8, 857
23. Tervo, O.; Tran, L.; Chatzinotas, S.; Juntti, M.; Ottersten, B. Energy-efficient joint unicast and multicast
beamforming with multi-antenna user terminals. In Proceedings of the 2017 IEEE 18th International
Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Sapporo, Japan,
3–6 July 2017; pp. 1–5.
24. He, S.; Huang, Y.; Jin, S.; Yang, L. Energy efficient coordinated beamforming design in multi-cell multicast
networks. IEEE Commun. Lett. 2015, 19, 985–988. [CrossRef]
25. Tervo, O.; Tran, L.; Pennanen, H.; Chatzinotas, S.; Ottersten, B.; Juntti, M. Energy-efficient multicell
multigroup multicasting with joint beamforming and antenna selection. IEEE Trans. Signal Process.
2018, 66, 4904–4919. [CrossRef]
26. You, L.; Gao, X.; Xia, X.; Ma, N.; Peng, Y. Pilot reuse for massive MIMO transmission over spatially correlated
rayleigh fading channels. IEEE Trans. Wirel. Commun. 2015, 14, 3352–3366. [CrossRef]
27. You, L.; Gao, X.; Swindlehurst, A.L.; Zhong, W. Channel acquisition for massive MIMO-OFDM with
adjustable phase shift pilots. IEEE Trans. Signal Process. 2016, 64, 1461–1476. [CrossRef]
28. You, L.; Chen, X.; Wang, W.; Gao, X. Coordinated multicast precoding for multi-cell massive MIMO
transmission exploiting statistical channel state information. Electronics 2018, 7, 338. [CrossRef]
29. You, L.; Wang, W.; Gao, X. Energy-efficient multicast precoding for massive MIMO transmission with
statistical CSI. Energies 2018, 11, 3175. [CrossRef]
30. Weichselberger, W.; Herdin, M.; Ozcelik, H.; Bonek, E. A stochastic MIMO channel model with joint
correlation of both link ends. IEEE Trans. Wirel. Commun. 2006, 5, 90–100. [CrossRef]
31. Gao, X.; Jiang, B.; Li, X.; Gershman, A.B.; McKay, M.R. Statistical eigenmode transmission over jointly
correlated MIMO channels. IEEE Trans. Inf. Theory 2009, 55, 3735–3750. [CrossRef]
32. You, L.; Gao, X.; Li, G.Y.; Xia, X.; Ma, N. BDMA for millimeter-wave/terahertz massive MIMO transmission
with per-beam synchronization. IEEE J. Sel. Areas Commun. 2017, 35, 1550–1563. [CrossRef]
33. Sun, C.; Gao, X.; Jin, S.; Matthaiou, M.; Ding, Z.; Xiao, C. Beam division multiple access transmission for
massive MIMO communications. IEEE Trans. Commun. 2015, 63, 2170–2184. [CrossRef]
34. Wen, C.; Jin, S.; Wong, K.; Chen, J.; Ting, P. Channel estimation for massive MIMO using Gaussian-mixture
Bayesian learning. IEEE Trans. Wirel. Commun. 2015, 14, 1356–1368. [CrossRef]
35. Ng, D.W.K.; Lo, E.S.; Schober, R. Energy-efficient resource allocation in OFDMA systems with large numbers
of base station antennas. IEEE Trans. Wirel. Commun. 2012, 11, 3292–3304. [CrossRef]
36. Lu, A.; Gao, X.; Xiao, C. Free deterministic equivalents for the analysis of MIMO multiple access channel.
IEEE Trans. Inf. Theory 2016, 62, 4604–4629. [CrossRef]
37. Couillet, R.; Debbah, M. Random Matrix Methods for Wireless Communications; Cambridge University Press:
Cambridge, UK, 2011. [CrossRef]
38. Dinkelbach, W. On nonlinear fractional programming. Manag. Sci. 1967, 13, 492–498. [CrossRef]
39. Lipp, T.; Boyd, S. Variations and extension of the convex-concave procedure. Optim. Eng. 2016, 17, 263–287.
[CrossRef]
40. Yuille, A.L.; Rangarajan, A. The concave-convex procedure. Neural Comput. 2003, 15, 915–936. [CrossRef]
41. Hunter, D.R.; Lange, K. A tutorial on MM algorithms. Am. Stat. 2004, 58, 30–37. [CrossRef]
42. Zappone, A.; Jorswieck, E. Energy efficiency in wireless networks via fractional programming theory.
Found. Trends R
Commun. Inf. Theory 2015, 11, 185–396. [CrossRef]
43. Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas.
IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
44. Tulino, A.M.; Lozano, A.; Verdú, S. Capacity-achieving input covariance for single-user multi-antenna
channels. IEEE Trans. Wirel. Commun. 2006, 5, 662–671. [CrossRef]
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
241
electronics
Article
Artificial Intelligence-Based Discontinuous Reception
for Energy Saving in 5G Networks
Mudasar Latif Memon 1 , Mukesh Kumar Maheshwari 2 , Navrati Saxena 3, *, Abhishek Roy 4 and
Dong Ryeol Shin 3, *
1 College of Information and Communication Engineering, Sungkyunkwan University, Suwon 16419, Korea
2 Department of Electrical Engineering, Bahria University, Karachi 75260, Pakistan
3 College of Software, Sungkyunkwan University, Suwon 16419, Korea
4 Advanced Communication Technology, Wireless System Design, MediaTek USA Inc.,
San Jose, CA 95134, USA
* Correspondence: [email protected] (N.S.); [email protected] (D.R.S.); Tel.: +82-31-299-4676 (N.S.)
Abstract: 5G is expected to deal with high data rates for different types of wireless traffic. To enable
high data rates, 5G employs beam searching operation to align the best beam pairs. Beam searching
operation along with high order modulation techniques in 5G, exhausts the battery power of user
equipment (UE). LTE network uses discontinuous reception (DRX) with fixed sleep cycles to save UE
energy. LTE-DRX in current form cannot work in 5G network, as it does not consider multiple beam
communication and the length of sleep cycle is fixed. On the other hand, artificial intelligence (AI) has
a tendency to learn and predict the packet arrival-time values from real wireless traffic traces. In this
paper, we present AI based DRX (AI-DRX) mechanism for energy efficiency in 5G enabled devices.
We propose AI-DRX algorithm for multiple beam communications, to enable dynamic short and long
sleep cycles in DRX. AI-DRX saves the energy of UE while considering delay requirements of different
services. We train a recurrent neural network (RNN) on two real wireless traces with minimum root
mean square error (RMSE) of 5 ms for trace 1 and 6 ms for trace 2. Then, we utilize the trained RNN
model in AI-DRX algorithm to make dynamic short or long sleep cycles. As compared to LTE-DRX,
AI-DRX achieves 69% higher energy efficiency on trace 1 and 55% more energy efficiency on trace 2,
respectively. The AI-DRX attains 70% improvement in energy efficiency for trace 2 compared with
Poisson packet arrival model for λ = 1/20.
1. Introduction
The use of cellular gadgets, like smartphones, notebooks, and tablets has comforted our life.
The Ericsson mobility report predicts the rise of cellular traffic to 8.8 billion by 2024 [1]. These extensive
growing cellular users require improved data rates with heterogeneous services in next generation
networks. 5G expects to deal with various types of traffics including periodic and delay tolerant traffic
for IoT devices or burst type of traffic for delay intolerant services [2,3]. 3rd Generation Partnership
Project (3GPP) planned the standardization of GHz spectrum (mm-wave) to address the users’ demand
of high bandwidth. However, communication over high-frequency bands of the mm-wave requires
directional air interface and narrower beams to reduce the path loss. In directional air interface, UE has
to search for best beam pairs and make adequate beam alignment with the next generation nodeB (gNB)
[4]. In addition to beam searching process in 5G networks, massive MIMO, higher order modulation
schemes and advanced coding techniques also increase UE energy expenses.
Long term evolution (LTE) networks utilize DRX mechanism to reduce the energy consumption
of the UE [2,5]. DRX enables a UE to save energy by switching off the radio circuitry part, in case of no
incoming data. LTE-DRX turns off radio part during long and short sleep cycles in order to reduce the
energy consumption of UE. The sleep cycles in LTE-DRX are of the fixed time period. If any new packet
arrives during the fixed time sleep cycle, the packets will be buffered at evolved NodeB (eNB). The eNB
serves the stored packets after the completion of each sleep cycle in LTE-DRX. Hence, the LTE-DRX
saves the UE energy at the cost of delay [6,7]. LTE-DRX in its current form is not suitable for energy
savings in 5G enabled cellular devices [8,9], due to two main reasons.
• The communications in LTE do not consider beamforming, whereas in 5G networks UE has to
align the best beam pair before the start of communications.
• The LTE-DRX mechanism has a fixed length of sleep duration, which increases the energy
consumption and delay. Hence, LTE-DRX is not suitable for low latency communications in
5G networks.
Authors in [3] propose DRX for 5G network, which requires UE to search for best beam pairs
after completion of each sleep cycle in order to serve the packets. The additional beam searching
operation after each sleep cycle enhances the energy consumption of UE. The work in [10] suggests
beam aware DRX approach for 5G enabled machine to machine communications, in which UE has
prior information about best beam pairs. Similarly, beam aware DRX mechanism for energy saving
in the 5G network is proposed in [11]. The beam aware mechanism may not perform well in case
of UE mobility and beam misalignment [12]. Kwon et al. [12] show that the probability of beam
misalignment increases with an increase in UE velocity. The probability of misalignment is 0.1 and
0.38 if a UE moves with the velocity of 30 Km/h and 60 Km/h, respectively.
The DRX with built-in state of beamforming is proposed by Liu et al. [13]. Authors present the
concept of DRX for multiple beam communications and utilize the semi-Markov model to design
eleven-states of DRX. Their approach considers the beam training process only in case of beam
misalignment and after completion of long sleep duration. Hence, authors save the energy of UE while
minimizing the delay for 5G services. However, the fixed duration of sleep cycles in their approach
may cause more energy consumption.
On the other hand, Recurrent Neural Network (RNN) in AI has shown incredible results to predict
the upcoming value of a given sequence [14]. Long Short-Term Memory (LSTM) is a popular type
of RNN that is specially designed to learn long-term dependencies of a sequence for predicting the
upcoming value of a sequence [15]. The term long-term dependency refers to the sequence, whose
desired/current output values (prediction results) depend on long-sequence of previous input values
rather than the only single previous input value.
Motivated by the success of RNN to learn and predict long-term dependent sequence values in
various applications [11,14,16], we use RNN to extract the pattern of packet arrival time from real
wireless traffic traces and to predict the values of the next packet arrival time. Based on the prediction
results, we propose an AI-DRX algorithm that works on a ten-state DRX model to enable energy saving.
AI-DRX for multiple beam communications in 5G network saves the UE energy by enabling dynamic
short and long sleep cycles, respectively. To be more specific, the following are our key contributions
to save UE energy in multiple beam communications scenario of 5G networks.
• We perform the training of the LSTM neural network on wireless traffic. During training, LSTM
network extracts the packet arrival time pattern from the wireless traffic trace. The prediction
results show that the trained model predicts with minimum RMSE of 5 ms on random test set
from trace 1 and 6 ms on random test set from trace 2, respectively.
• We devise DRX as a ten-state model.
• We propose an artificial intelligence based DRX mechanism for multiple beam communications in
5G networks. We suggest AI-DRX algorithm using ten state model to enable dynamic short or
long sleep cycles, depending on the prediction results.
244
Electronics 2019, 8, 778
• We evaluate the performance of AI-DRX in terms of energy efficiency and mean delay. AI-DRX
achieves the energy efficiency of 59% on trace 1 and 95% on trace 2, respectively; while considering
the mean delay requirements of different services.
The remaining paper is organized as follows. In Section 2 we present an overview of existing DRX
mechanism and introduce the RNN with focus on LSTM neural network. Section 3 proposes AI-DRX
algorithm with a ten-state DRX model, to enable dynamic short or long sleep cycles in multiple beams
communications of the 5G network. Section 4 presents the performance analysis of AI-DRX in terms of
energy efficiency and mean delay. Finally, we conclude our work in Section 5.
2. Related Work
• Active State
• Sleep State
RRC Inactivity
Timer
RRC_
RRC_ idle
connected
DRX
Inactivity
Timer Packet Activity
Active
RRC_connected Mode
Short
Long RRC_idle Mode
Sleep
Sleep
DRX Sleep State
Based on the above two states, different DRX parameters are configured in RRC_connected mode
while considering various services’ delay requirements.
245
Electronics 2019, 8, 778
• Inactivity timer
• Active time
All the data packets are transmitted/received during the active time. The inactivity timer is a
countdown timer in DRX active state. This timer re-starts every time a new data packet arrives at eNB
and then eNB serves the received packets to UE. In case of no new packet arrival, the inactivity timer
gets expired and the UE switches to sleep state for a certain time period.
• ON time
• Short sleep cycle
• Short sleep timer
• Long sleep cycle
• Long sleep timer
During ON time, a UE monitors the PDCCH. ON time always starts after completion of each
sleep cycle. A short sleep cycle is a small duration of time that saves the UE energy by switching off
the transceiver part. A short sleep cycle is repeated up to Nsc number of short sleep cycles. After
the expiry of Nsc , a UE transits to the long sleep cycle. The long sleep cycle and long sleep timer are
similar to short sleep cycle and short sleep timer but have a longer time period than the counterpart,
respectively.
Figure 2 delineates the timing diagram of LTE-DRX. As shown in Figure 2, all the data transmission
and reception take place during the active state. After reception of each new packet, inactivity timer
restarts. In the case that no new packet reaches and inactivity timer finishes the countdown timer, then
UE switches to short sleep cycle. After every short sleep cycle, a UE monitors the PDCCH for any
incoming packet during the ON time. If no new packet arrives before the completion of short sleep
timer, the UE switches to the long sleep cycle and remains there until intimation of the new packet
is received. UE transits from a long sleep to idle state if no new packets arrive and the long sleep
timer expires.
246
Electronics 2019, 8, 778
Time
ON Time
ON Time
Long Sleep Cycle
Short Sleep Cycle
ON Time
Short Sleep Cycle
Inactivity
Timer
Active State
ON Time Sleep Cycle
247
Electronics 2019, 8, 778
Tx BeamsTx Beam 1
Rx Beams
… …
Beam
Alligned
UE
gNB
Figure 3. Directional communication with multiple beams in 5G network.
IT = Sig(wx x T + wh h T −1 + b I ) (2)
248
Electronics 2019, 8, 778
Current
Packet Time
6 LJ Input Gate Output Gate 6 LJ
[7
,7 27
*7 &7 Hyp
Hyp
Tan Cell Tan K7
Input Output Predicted
K7 Modulation Gate Modulation Gate Time of
Previous
& 7 Next Packet
Packet Time )7
Value 6 LJ Forget Gate
6 LJ Sigmoid Hyp
Tan
Hyberbolic Tangent Pointwise Multiplicative Pointwise Additive
Operation
Layer Layer Operation
Cell state CT and the output layer OT can be calculated by Equations (4) and (5), respectively.
The final prediction results can be observed by output modulation gate (h T ) using Equation (6).
CT = FT ⊗ CT −1 + IT ⊗ GT (4)
OT = Sig(wx x T + wh h T −1 + bO ) (5)
h T = OT ⊗ tanh(CT ) (6)
During the training process, LSTM learns to extract the relationship between the input and desired
output (upcoming value of sequence) by adjusting various weight values in Equations (1)–(3) and (5).
Once the model is trained with the least error, the learned weight values can be used in Equations (1)–(6)
to calculate the output values of a sequence (prediction). The error between prediction result and
observed (actual) value can be computed by the root mean square error (RMSE) and is given as:
7
∑ TN=1 ( Predicted T − Observed T )2
RMSE = (7)
N
where the notation T represents the number of samples and N shows the maximum number of samples.
249
Electronics 2019, 8, 778
• S0 is the active time. A UE sends and receives the data during S0 state. UE predicts the dynamic
time TDY during S0 state.
• S1 state is threshold comparison state. S1 compares TDY with both threshold values (Th Min and
Th Max ) to decide whether to remain in the active time or switch to the dynamic sleep cycle.
• S2 state is dynamic short sleep cycle. It saves the UE power for a short period of time up to
TDY . The predicted value of TDY in the dynamic sleep cycle is always lower than that of value in
S3 state.
• S3 state highlights the dynamic long sleep cycle. It is similar to S2 state but has a larger sleep
period than S2 . The predicted value of TDY always has larger value of dynamic long sleep cycle
than that of the dynamic short sleep cycle.
• S4 state shows beam training during active time. This state allows UE to train and search for best
available beam pairs between gNB and UE.
• S5 state represents feedback after beam training operation in active time.
• S6 state shows active time after best beam pairs are aligned. S6 is different from S0 as S6 always
occurs after beam training and feedback process. Moreover, S6 enables UE to search beam pairs in
active time until best beam pairs are found.
• S7 state delineates beam training process after the execution of dynamic long or short sleep cycle.
• S8 state is feedback operation that occurs after beam training and after the execution of dynamic
long or short sleep cycle, respectively.
• S9 state is the ON period, a UE only monitors PDCCH for an incoming packet. UE could not send
or receive data packets during ON time.
• AI-DRX algorithm takes ON timer TON , minimum threshold value Th Min and maximum threshold
value Th Max as input (Line 1).
• AI-DRX examines the buffer for any incoming data packets (Line 2).
• If any packet is received in the buffer, the packets will be served and the value of dynamic sleep
time TDY will be predicted, simultaneously (Line 4).
• During the active time, if no beam misalignment occurs between UE and gNB, AI-DRX compares
the predicted value of dynamic sleep time TDY with threshold values Th Min and Th Max .
• If the predicted value of dynamic sleep time TDY is less than the minimum threshold value Th Min
(Line 8), UE continues to remain in the active time (Line 9).
250
Electronics 2019, 8, 778
• If the predicted value of dynamic sleep time TDY is greater than or equal to the minimum threshold
value Th Min and less than maximum threshold value Th Max (Line 11). UE observes dynamic short
sleep cycle (Line 12).
• In case of any beam misalignment after UE wakes up from dynamic short sleep cycle (Line 13), UE
starts the beam training process (Line 14) and Feedback (Line 15) followed by ON time (Line 16).
• During ON time, if beam pairs are still misaligned (Line 17), UE searches beam pairs again
(Line 18).
• During ON time, if a new packet arrives (Line 21), UE switches to active mode and start receiving
data packets (Line 22).
• After completion of ON time, if no new packet arrives, UE continues to sleep for previous
predicted time value TDY of the long sleep cycle or short sleep cycle. respectively (Line 24).
• If TDY is greater than Th Max (Line 28) the UE will go to dynamic long sleep cycle (Line 29).
• UE will perform beam training and feedback after completion of each dynamic long sleep cycle
(Line 30).
• In the case of beam misalignment during an active time (Line 35), UE performs beam training
(Line 36) and feedback (Line 37) and then re-enter active time (Line 38).
• During the active time (Line 38), if UE still find beam misalignment (Line 39), AI-DRX executes
beam training process again (Line 40).
• After the alignment of beam pairs, the UE transits to active time (Line 42).
Packet
0
Arrives or
LQ
Misaligned
7'
<
Beam
Train
7K
Dynamic
6
0
6 Long Sleep
D[
7'<
Beam Dynamic
No New Packet, Continue Sleep for
6
Beam Misaligned
6
Beam Misaligned
Feed
back
6
Beam Misaligned
Feed
back
7'<
6
6
Active 6
ON
251
Electronics 2019, 8, 778
3.3. AI-DRX for Enabling Dynamic Long and Short Sleep Cycles
Algorithm 1 (AI-DRX) demonstrates the use of artificial intelligence in the implementation of
DRX for 5G networks. AI-DRX makes dynamic short and long sleep cycles in DRX. AI-DRX utilizes
trained LSTM model to predict the upcoming value of packet arrival time and subsequently to enable
dynamic sleep cycles in DRX. The training process is conducted offline on two traces of real wireless
traffic acquired from the University of Massachusetts (UMass) trace repository [27] and Crawdad
data set repository [28]. Training process learns the packet arrival time pattern of increasing sequence
values from both traces. Once the LSTM network is trained offline with least prediction error, the
trained model predicts the upcoming packet time of real wireless traffic. AI-DRX algorithm calculates
the dynamic sleep cycles by using prediction results of upcoming packet arrival time value.
252
Electronics 2019, 8, 778
ON Time
Active State
Beam Training
Feedback ON Time
The second case considers the dynamic long sleep cycle up to the predicted value of TDY , if TDY is
greater than Th Max . The concept of the dynamic long sleep cycle is depicted in Figure 7. The dynamic
long sleep cycle saves more energy than that of short sleep cycle. Moreover, there are more chances of
beam misalignment after the dynamic long sleep cycles. Hence, AI-DRX performs beam training and
feedback after completion of each dynamic long sleep cycle.
The third case deals with dynamic inactivity timer. If the predicted value of TDY is less than
Th Min , the UE will remain active until the period TDY or any new packet arrives. The concept of
dynamic inactivity timer can be seen in Figures 6 and 7. AI-DRX also addresses the problem of beam
misalignment during active time. AI-DRX performs beam training and feedback in case of any beam
misalignment during active time.
It may be noted that AI-DRX utilizes dynamic short and long sleep cycles instead of static fixed
time sleep cycles. Moreover, the inactivity timer value is also dynamic according to the prediction
results (using trained LSTM model). Our proposed algorithm keeps updating the predicted time value
based on most recent received packets and their packet arrival time.
253
Electronics 2019, 8, 778
Active
Dynamic Long
Beam Training
Beam Training
Beam Training
Feedback
Feedback
Feedback
ON Time
ON Time
ON Time
Sleep Sleep
DRX saves the power of UE at the cost of delay. Hence, we considered two performance
parameters: energy efficiency (EE) and mean delay. The performance parameter; EE of the UE
is the ratio of dynamic short sleep time and dynamic long sleep time to the sum of active time (TAC ),
ON time (TON ), beam training time, feedback time, dynamic short sleep (TDY ) time and dynamic long
sleep time (TDY ). EE can be calculated by Equation (8). The beam training and feedback processes are
considered during active time (TAC ) and dynamic short and long sleep cycles take place during (TDY ).
TDY
EE = (8)
TAC + TON + TDY
Similarly, for packet inter-arrival time λκ , the mean holding period of active state can be calculated
as [5]:
1 − e−λκ TI N
TAC = −λ T (9)
e κ I N (1 − e − λκ )
Furthermore, the packets arrived during the sleep state and ON duration are stored in the buffer
until next ON period. The packet arrival events are the random observer to the sleep period and ON
state. Therefore, the mean delay is defined as the sum of mean sleep time and ON duration and is
given as:
T T
Mean Delay = DY + ON (10)
2 2
4. Performance Analysis
We use MATLAB 2019a for training and testing of RNN on two different traces (data sets).
These traces of burst traffic type are taken from the Crawdad dataset repository (trace 1) [28] and
UMass trace repository (trace 2) [27]. The trace 2 shows the traffic pattern of HTTP and video streaming
applications. The video streaming trace is used as it is expected by 2024 over three-quarters of mobile
data traffic will be video traffic [1]. The trace 1 and trace 2 include seven parameters as shown in [11].
These parameters are: (1) serial number; (2) packet arrival time in seconds; (3) source address of the
packets; (4) destination address of the packet; (5) protocol used; (6) length of packets in bytes; and
(7) additional information. We have utilized the time parameter from both traces to train the LSTM
network. Time parameter contains the information of packet arrival time. Our purpose is to train the
LSTM network until it learns the time interval pattern of packet arrival time from a given sequence.
Data pre-processing makes the training process simple and avoid training from the divergence [11,29].
Hence, we standardize the data with zero value of mean and unit value of the variance of the training
set during the training process. Moreover, we also standardize the test set during the prediction time.
Hyper-parameters in LSTM network are selected manually to make the training process more
efficient. These parameters include the number of hidden units, learning rate drop factor, the maximum
number of epochs, initial learning rate, and the optimizer used. The best selection of hyper-parameters
254
Electronics 2019, 8, 778
may result in the learning with least prediction error. The training process performance can be
measured in terms of RMSE. The less the value of RMSE, the better is the prediction results of the
trained model on unseen data (test set). We have selected 200 and 125 hidden units in LSTM network
during training process over trace 1 and trace 2, respectively. The value of the learning rate drop factor
remains 0.2 for both traces. Maximum number of epochs during the training process of trace 1 are
considered to be 600 while for trace 2 are 1000 epochs. Initial learning rate during the training process
of both traces remains 0.004. Furthermore, the optimizer used during the training process for both
traces is Adam optimizer. The above mentioned hyper-parameters reduce the RMSE during training
and testing processes.
We have considered the total length up to 130,820 sample values (323.886 s) of trace 1, while
trace 2 has 77,470 values of time samples (417.642 s) [11]. We have divided both traces into 10% of
samples as the training set and a number of different test sets randomly selected from the remaining
90% of both traces. Each test set has an equal number of samples. Figure 8a shows the RMSE for the
initial test set of trace 1 that is as small as 12 ms. Whereas, Figure 8b shows the RMSE value for trace 2
is 10 ms for the first random test set.
7LPH PV
2EVHUYHG
3UHGLFWHG
2EVHUYHG
3UHGLFWHG
6DPSOHV
6DPSOHV
506( 506(
(UURU
(UURU
6DPSOHV 6DPSOHV
(a) Trace 1. (b) Trace 2.
Figure 8. Prediction result and root mean square error (RMSE) for first test set.
The RMSE values for random test sets from the remaining 90% samples of trace 1 and trace 2 are
shown in Figure 9. It can be seen from Figure 9a,c that the minimum value of RMSE is least at 6 ms
and 5 ms on random test sets from samples 70,500 to 71,953 and 90,700 to 92,153 of trace 1, respectively.
Whereas, Figure 9b,d show the RMSE value of 6 ms and 8 ms from 10,000 to 10,899 and 65,000 to
65,899 samples of trace 2, respectively.
AI-DRX can be implemented at the gNB of 5G network. During the execution of AI-DRX,
we consider packet generation event, active event, dynamic long sleep event, dynamic short sleep
event, ON duration event, beam searching event and feedback event. Moreover, our approach
enables dynamic sleep cycles in multiple beam communications scenario for 5G networks. The packet
generation event in our simulation scans the time column of both traces and produces the data packets
of the identical length on the same instant in the respective trace. The generated data packets are
collected in the buffer and served to the UE during the active event. UE checks the buffer during ON
period, in case of any packet in buffer, the UE switches to the active event, or else continues to sleep.
During an active event, the buffered packets are served to UE after getting the beam pairs alignment
between UE and gNB. At the same time, packet arrival time is inputted to the trained LSTM model to
predict the upcoming packet arrival time. We can obtain the dynamic sleep duration by subtracting
the previous packet arrival time value from the predicted dynamic time value TDY (upcoming packet
arrival time).
255
Electronics 2019, 8, 778
7LPH PV
7LPH PV
2EVHUYHG
2EVHUYHG )RUHFDVW
)RUHFDVW
6DPSOHV 6DPSOHV
506( 506(
(UURU
(UURU
6DPSOHV 6DPSOHV
(a) Test set from trace 1 (samples: 70,500 to 71,953). (b) Test set from trace 2 (samples: 10,000 to 10,899).
7LPH PV
7LPH PV
2EVHUYHG
2EVHUYHG
)RUHFDVW )RUHFDVW
6DPSOHV 6DPSOHV
506( 506(
(UURU
(UURU
Figure 10a,b, demonstrate the energy efficiency and mean delay with varying ON Period (TON )
for AI-DRX and LTE-DRX over trace 1 & trace 2, respectively. In Figure 10, TON is varied from 1 ms
to 160 ms while the values for Th Min = 20 & Th Max = 100 are considered. Figure 10a highlights the
decrease in energy efficiency with increase in TON . The reason for the drop in energy efficiency is,
UE waits for longer period in ON state prior switching to active state (to serve packets). It can be
seen from Figure 10b, the mean delay for trace 1 ranges from 150 ms to 194 ms with an increase in
ON period. The reason for rise in mean delay lies in the feet that with an increase in ON duration,
the time spend by UE in ON state will be higher, which results in higher delay. Moreover, by selecting
the optimum value of TON , delay observed by UE can be minimized. From Figure 10a, it is noticed
that the energy efficiency of trace 2 is higher than that of trace 1 due to higher arrival rate of trace 1 as
compared to trace 2.
We have compared the performance of AI-DRX with LTE-DRX. To implement the LTE-DRX, we
select the value of short sleep cycle to Th Min , the value of long sleep cycle to Th Max and fed trained
model with trace 1 and trace 2. It is observed in Figure 10a AI-DRX energy efficiency for trace 1 is 69%
higher than that of LTE-DRX, at the cost of higher delay. The reason arises from the fact that AI-DRX
256
Electronics 2019, 8, 778
calculates sleep time form real wireless traffic trace based on arrival rate, while LTE-DRX uses Th Min
and Th Max to set the short and long sleep time. For AI-DRX, the small value of Th Min and Th Max
achieves higher energy efficiency as UE easily transits to long sleep time, which results in a larger
delay. Whereas, for small values of Th Min and Th Max in LTE-DRX, a UE sleeps for a short period that
results in less energy efficiency and smaller mean delay. AI-DRX achieves 55% higher energy efficiency
as compared to that of LTE-DRX using trace 2.
(QHUJ\(IILFLHQF\
0HDQ'HOD\ PV
$,'5;7UDFH
$,'5;7UDFH
$,'5;7UDFH /7('5;7UDFH
$,'5;7UDFH /7('5;7UDFH
/7('5;7UDFH
/7('5;7UDFH
Figure 10. AI-DRX energy efficiency and mean delay with varying TON (Th Min = 20, Th Max = 100).
Figures 11 and 12 present the energy efficiency and mean delay with varying ON period TON for
Th Min = 200 & Th Max = 1000 and Th Min = 300 & Th Max = 1600, respectively. It is observed from
Figures 11 and 12 that the energy efficiency and mean delay of LTE-DRX increase with an increase
in Th Min & Th Max , as UE sleep longer. The energy efficiency of AI-DRX decreases with an increase
in Th Min & Th Max , as UE will not able to transit to sleep state if TDY < Th Min . With a small value of
sleep time TDY or higher value of Th Min & Th Max , UE remains in the active state, which results in less
energy efficiency.
(QHUJ\(IILFLHQF\
0HDQ'HOD\ PV
$,'5;7UDFH $,'5;7UDFH
$,'5;7UDFH $,'5;7UDFH
/7('5;7UDFH /7('5;7UDFH
/7('5;7UDFH /7('5;7UDFH
213HULRG7 21 PV 213HULRG7 21 PV
Figure 11. AI-DRX energy efficiency and mean delay with varying TON (Th Min = 200, Th Max = 1000).
257
Electronics 2019, 8, 778
$,'5;7UDFH
$,'5;7UDFH
/7('5;7UDFH
/7('5;7UDFH
(QHUJ\(IILFLHQF\
0HDQ'HOD\ PV
$,'5;7UDFH
$,'5;7UDFH
/7('5;7UDFH
/7('5;7UDFH
213HULRG7 21 PV 213HULRG7 21 PV
To validate our proposal, we have compared our work to traditional Poisson arrival model.
We generated three data sets considering Poisson arrival rate (λ) with mean value of λ = 1/20,
λ = 1/10, and λ = 1. We have trained model using Poisson arrival rate. The generated traces are
fed to AI-DRX algorithm to analyze the energy efficiency and mean delay. Figure 13a,b shows the
energy efficiency and mean delay for AI-DRX trace 2 and Poisson arrival with varying TON . The energy
efficiency of AI-DRX is 70% higher than Poisson arrival rate (λ = 1/20). The corresponding mean delay
of AI-DRX is 100 ms (on an average) higher as compared to Poisson arrival for λ = 1/20. The gain in
energy efficiency is achieved as AI-DRX considers real traffic arrival rate for selection of sleep cycles
and inactivity timer, while Poisson arrival considers mean arrival rate. The energy efficiency and mean
delay both are zero for higher Poisson arrival rate λ = 1. The reason lies in the fact that for higher
arrival rate, UE could not transit to sleep state to save the power.
(QHUJ\(IILFLHQF\
0HDQ'HOD\ PV
$,'5;7UDFH
3RLVVRQ$UULYDO
3RLVVRQ$UULYDO $,'5;7UDFH
3RLVVRQ$UULYDO 3RLVVRQ$UULYDO
3RLVVRQ$UULYDO
3RLVVRQ$UULYDO
213HULRG7 21 PV 213HULRG7 21 PV
Figure 13. AI-DRX energy efficiency and mean delay comparison with Poisson Arrival with varying
TON (Th Min = 300, Th Max = 1600).
Various services in a wireless network can tolerate different delay levels while not compromising
quality of service (QoS). QoS class identifier (QCI) is a metric that is used to identify the characteristics
of traffic. QCI measures the QoS with two parameters; (1) packet loss rate (PLR) and (2) packet
delay budget (PDB). PDB can be defined as maximum tolerable waiting time by a packet during
its delivery from eNB to UE. Standardized QCI characteristics are shown in Table 2 [6,30,31]. In
258
Electronics 2019, 8, 778
various kinds of non-real-time services like email, web browsing, after a certain time period, a
UE does not require to monitor PDCCH continuously [6,32]. Hence, these types of services can
tolerate higher delays up to 300 ms [6,30,31]. These types of services require DRX with higher values
of sleep time (TDY > Th Min & TDY > Th Max ) and smaller values of ON timer for better energy
efficiency. Whereas, the real-time services like voice and live video streaming cannot tolerate delay
[33]. Therefore, the delay should be a higher priority than energy saving. If we limit the mean
delay to 125 ms, the energy efficiency will be 95.02%. For this energy efficiency we have to select
Th Min = 200 , Th Max = 1000, TON = 6 ms. If we increase the TON the energy efficiency of UE
decreases as UE remains in active state. The mean delay also increases with increase in TON as UE
does not receive the data during ON period but only monitors the Physical Downlink Control Channel
(PDCCH). The mean delay observed by UE increases to 150 ms when TON = 80, with energy efficiency
of 80.2%. The network can maximize the energy efficiency of UE by selecting optimum value of
Th Min , Th Max , TON depending on QCI value of different services.
Type Qos Class Identifier Packet Loss Rate Packet Delay Budget Examples
GBR 1 10−2 100 ms Voice services
GBR 2 10−3 150 ms Live streaming services
GBR 3 10−3 50 ms Real time gaming services
GBR 4 10−6 300 ms Buffered streaming services
Non-GBR 5 10−6 100 ms IMS Signaling services
Non-GBR 6 10−6 300 ms TCP based application services
Non-GBR 7 10−3 100 ms Interactive gaming services
Non-GBR 8 10−3 300 ms TCP based video services
Non-GBR 9 10−6 300 ms TCP based video services
5. Conclusions
In this work, we have suggested an AI-based DRX mechanism for energy saving in multiple
beams communications scenario. We have modeled DRX as a ten-state model and suggested AI-DRX
algorithm depending on these 10 states. AI-DRX algorithm enables dynamic short and long sleep
cycles for energy efficiency of UE in the 5G network. We have trained LSTM network, a popular type
of RNN, to extract the packet arrival time pattern from real wireless traffic traces. Later, we have
utilized the learned model in AI-DRX algorithm for energy saving in 5G enabled devices. AI-DRX
economizes power consumption of a UE by enabling dynamic short and long sleep cycles. Extensive
training with selected hyper-parameters achieves the least RMSE of 5 ms on a random test set from
trace 1 and 6 ms on a random test set from trace 2, respectively. The energy efficiency obtained with
AI-DRX is approximately 60% and 95% for trace 1 and trace 2, respectively. AI-DRX achieves 69%
higher energy efficiency on trace 1 and 55% more energy efficiency on trace 2 as compared to LTE-DRX,
respectively. We also validated the performance of AI-DRX with traditional Poisson packet arrival
model. AI-DRX attains 70% more energy efficiency on trace 2 as compared to Poisson packet arrival
rate for λ = 1/20.
Author Contributions: Conceptualization, M.L.M. and M.K.M.; methodology, A.R.; investigation, M.L.M. and
M.K.M.; resources, M.L.M.; data curation, M.L.M.; writing—original draft preparation, M.L.M. and M.K.M.;
writing—review and editing, N.S. and A.R.; supervision, N.S., A.R. and D.R.S.; project administration, D.R.S.;
funding acquisition, D.R.S.
Funding: This research received no external funding.
Acknowledgments: This research was supported by the Basic Science Research Program through the National
Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2016R1D1A1B03935633).
259
Electronics 2019, 8, 778
References
1. Ericsson Mobility Report June 2019. Available online: https://www.ericsson.com/assets/local/mobility-
report/documents/2019/ericsson-mobility-report-june-2019.pdf (accessed on 12 June 2019).
2. Fowler, S.; Shahidullah, A.O.; Osman, M.; Karlsson, J.M.; Yuan, D. Analytical evaluation of extended DRX
with additional active cycles for light traffic. Comput. Netw. 2015, 77, 90–102. [CrossRef]
3. Maheshwari, M.K.; Agiwal, M.; Saxena, N.; Roy, A. Directional Discontinuous Reception (DDRX) for
mmWave enabled 5G Communications. IEEE Trans. Mobile Comput. 2018. [CrossRef]
4. Agiwal, M.; Roy, A.; Saxena, N. Next generation 5G wireless networks: A comprehensive surve.
IEEE Commun. Surv. Tut. 2016, 18, 1617–1655. [CrossRef]
5. Zhou, K.; Nikaein, N.; Spyropoulos, T. LTE/LTE-A discontinuous reception modeling for machine type
communications. IEEE Wireless Commun. Lett. 2013, 2, 102–105. [CrossRef]
6. Maheshwari, M.K.; Roy, A.; Saxena, N. DRX over LAA-LTE—A New Design and Analysis based on
Semi-Markov Model. IEEE Trans. Mob. Comput. 2018, 18, 276–289. [CrossRef]
7. Ferng, H.-W.; Tseng, Y.-T.; Wang, T.-H. Parameter-Adaptive Dynamic and Adjustable DRX Schemes for
LTE/LTE-A. IEEE Trans. Green Commun. Netw. 2019. [CrossRef]
8. Gódor, I. Revitalizing DRX for enhanced mobile sleep modes. Comput. Commun. 2019, 138, 98–105. [CrossRef]
9. 3GPP TSG-RAN WG2 #94 Tdoc R2-164023 Std., 2016. Available online: https://www.3gpp.org/ftp/tsg_
ran/WG2_RL2/TSGR2_94/Docs/R2-164023.zip (accessed on 4 April 2019).
10. Ho, C.H.; Huang, A.; Hsieh, H.-J.; Wei, H.-Y. Energy-Efficient Millimeter-Wave M2M 5G Systems with
Beam-Aware DRX Mechanism. In Proceedings of the 2017 IEEE 86th Vehicular Technology Conference
(VTC-Fall), Toronto, ON, Canada, 24–27 September 2017.
11. Mudasar, L.M.; Maheshwari, M.K.; Shin, D.R.; Roy, A. Deep-DRX: A framework for deep learning–based
discontinuous reception in 5G wireless networks. Trans. Emerg. Telecommun. Technol. 2019, 30, e3579.
12. Kwon, S.-W.; Hwang, J.; Agiwal, A.; Kang, H. Performance analysis of DRX mechanism considering analogue
beamforming in millimeter-wave mobile broadband system. In Proceedings of the 2014 IEEE Globecom
Workshops (GC Wkshps), Austin, TX, USA, 8–12 December 2014.
13. Liu, D.; Wang, C.; Rasmussen, L.K. Discontinuous Reception for Multiple-Beam Communication. IEEE Access
2019, 7, 46931–46946. [CrossRef]
14. Wang, J.; Tang, J.; Xu, Z.; Wang, Y.; Xue, G.; Zhang, X.; Yang, D. Spatiotemporal modeling and prediction
in cellular networks: A big data enabled deep learning approach. In Proceedings of the IEEE INFOCOM
2017—IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017.
15. Gers, A.F.; Schmidhuber, J.; Cummins, F.A. Learning to forget: Continual prediction with LSTM.
Neural Comput. 1999, 12, 850–855.
16. Google. Google AI Blog. Available online: https://ai.googleblog.com/2018/05/duplex-ai-system-for-
natural-conversation.html (accessed on 1 June 2019).
17. Bontu, C.S.; Illidge, E. DRX mechanism for power saving in LTE. IEEE Commun. Mag. 2009, 47, 48–55.
[CrossRef]
18. Evolved universal terrestrial radio access (E-UTRA); Medium access control (MAC) protocol specification.
Available online: https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?
specificationId=2437 (accessed on 2 March 2018)
19. Chang, H.-L.; Tsai, M.-H. Optimistic DRX for Machine-Type Communications in LTE-A Network. IEEE Access
2018, 6, 9887–9897. [CrossRef]
20. Chang, H.-L.; Lu, S.-L.; Chuang, T.-H.; Lin, C.-Y.; Tsai, M.-H.; Sou, S.-I.Optimistic DRX for machine-type
communications. In Proceedings of the 2016 IEEE International Conference on Communications (ICC),
Kuala Lumpur, Malaysia , 22–27 May 2016 .
21. Agiwal, M.; Maheshwari, M.K.; Saxena, N.; Roy, A. Directional-DRX for 5G wireless communications.
Electron. Lett. 2016, 52, 1816–1818. [CrossRef]
22. Maheshwari, M.K.; Agiwal M.; Saxena, N.; Roy, A. Hybrid directional discontinuous reception (HD-DRX)
for 5G communication. IEEE Commun Lett. 2017, 21, 1421–1424. [CrossRef]
260
Electronics 2019, 8, 778
23. Maheshwari, M.K.; Roy, A.; Saxena, N. Analytical modeling of DRX with flexible TTI for 5G communications.
Trans. Emerg. Telecommun. Technol. 2018, 29, e3275. [CrossRef]
24. Study on new radio access technology Physical layer aspects. Available online: https://portal.3gpp.org/
desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3066 (accessed on 2 March 2018)
25. Study on new radio access technology Radio interface protocol aspects. Available online: https://
portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3070 (accessed
on 2 March 2018).
26. Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Internal Representations by Error Propagation; Technical
Report; Institute for Cognitive Science, University of California: San Diego, CA, USA, 1985.
27. UMass Trace Repository. ACM MMSys conference dataset archive. Available online: http://traces.cs.umass.
edu/index.php/Mmsys/Mmsys (accessed on2 March 2018).
28. CRAWDAD Data Set Cambridge/Inmotion. Available online: https://crawdad.org/cambridge/inmotion/
20051001 (accessed on 2 March 2018).
29. Baughman, M.; Haas, C.; Wolski, R.; Foster, I.; Chard, K. Predicting Amazon spot prices with LSTM networks.
In Proceedings of the 9th Workshop on Scientific Cloud Computing, Tempe, AZ, USA, 11 June 2018.
30. Policy and charging control architecture. Available online: https://portal.3gpp.org/desktopmodules/
Specifications/SpecificationDetails.aspx?specificationId=810 (accessed on 2 March 2018).
31. Liang, J.M.; Chen, J.-J.; Cheng, H.-H.; Tseng, Y.-C. An energy-efficient sleep scheduling with QoS
consideration in 3GPP LTE-advanced networks for internet of things. IEEE J. Emerg. Sel. Top. Circ. Syst. 2013,
3, 13–22. [CrossRef]
32. Tseng, C.-C.; Wang, H.-C.; Kuo, F.-C.; Ting, K.-C.; Chen, H.-H.; Chen, G.-Y. Delay and power consumption in
LTE/LTE-A DRX mechanism with mixed short and long cycles. IEEE Trans. Veh. Technol. 2015, 65, 1721–1734.
[CrossRef]
33. Huang, B.; Tian, H.; Chen, L.; Zhu, J. DRX-aware scheduling method for delay-sensitive traffic.
IEEE Commun. Lett. 2010, 14, 1113–1115.
c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
261
electronics
Article
Planar Array Diagnostic Tool for Millimeter-Wave
Wireless Communication Systems
Oluwole J. Famoriji 1, *, Zhongxiang Zhang 2 , Akinwale Fadamiro 1 , Rabiu Zakariyya 1 and
Fujiang Lin 1
1 Micro-/Nano Electronic System Integration R & D Centre (MESIC), University of Science and Technology of
China (USTC), Hefei 230026, China; [email protected] (A.F.); [email protected] (R.Z.);
[email protected] (F.L.)
2 Applied Electromagnetic Field Group, Microwave and Radio Frequency Laboratory, Hefei Normal
University, Hefei 230601, China; [email protected]
* Correspondence: [email protected]
Abstract: In this paper, a diagnostic tool or procedure based on Bayesian compressive sensing
(BCS) is proposed for identification of failed element(s) which manifest in millimeter-wave planar
antenna arrays. With adequate a priori knowledge of the reference antenna array radiation pattern,
a diagnostic problem of faulty elements was formulated. Sparse recovery algorithms, including total
variation (TV), mixed 1 /2 norm, and minimization of the 1 , are readily available in the literature,
and were used to diagnose the array under test (AUT) from measurement points, consequently
providing faster and better diagnostic schemes than the traditional mechanisms, such as the back
propagation algorithm, matrix method algorithm, etc. However, these approaches exhibit some
drawbacks in terms of effectiveness and reliability in noisy data, and a large number of measurement
data points. To overcome these problems, a methodology based on BCS was adapted in this paper.
From far-field radiation pattern samples, planar array diagnosis was formulated as a sparse signal
recovery problem where BCS was applied to recover the locations of the faults using relevance vector
machine (RVM). The resulted BCS approach was validated through simulations and experiments
to provide suitable guidelines for users, as well as insight into the features and potential of the
proposed procedure. A Ka-band (28.9 GHz) 10 × 10 rectangular microstrip patch antenna array
that emulates failure with zero excitation was designed for far-field measurements in an anechoic
chamber. Both simulated and measured far-field samples were used to test the proposed approach.
The proposed technique is demonstrated to detect diagnostic problems with fewer measurements
provided the prior knowledge of the array radiation pattern is known, and the number of faults is
relatively smaller than the array size. The effectiveness and reliability of the technique is verified
experimentally and via simulation. In addition to a faster diagnosis and better reconstruction
accuracy, the BCS-based technique shows more robustness to additive noisy data compared to
other compressive sensing methods. The proposed procedure can be applied to next-generation
transceivers, aerospace systems, radar systems, and other communication systems.
Keywords: far-field; antenna array; diagnosis procedure; noisy data; BCS; millimeter-wave
1. Introduction
Antenna array is a key technology component in various communication systems such as radar,
radio-astronomy, remote sensing, satellite communications, and next-generation (fifth generation,
5G) wireless communications [1], where a very large number (in the hundreds) of radiating
elements are particularly used to meet the increasing demands of high radiation performance and
reconfigurability [2]. Conversely, the higher the number of radiating elements in the beam-forming
configuration, the higher the probability of failed element(s) will be. This causes abrupt field variations
across the aperture of the array, and distortion in the radiation features (e.g., beamwidth, peak sidelobe,
and boresight). Therefore, the availability of reliable and effective diagnosis tools for large arrays
remains an asset, because manual dismantling and replacement operations consume excessive time
and cost, and are even unfeasible in satellite-borne installations. Currently, failure identification in
antenna arrays is a theoretical and practical important research domain. Detection of faulty elements in
antenna arrays is of great interest in both military and civilian markets. Upcoming technologies adopt
active or passive antenna arrays with a large number of elements [1–5]. For instance, millimeter-wave
transceivers implement multiple-input multiple-output (MIMO) features and beamforming for future
5G applications, as shown in Figure 1. The block diagram shows the location of the AWMF-0108 in a
5G MIMO system [6]. The integrated circuit (IC) contents in the circle are the gain and phase control
blocks with amplification and RX/TX switching. The first industrial and commercial millimeter-wave
quad-core IC transceiver for 5G applications is the AWMF-0108 [6]. Many compactable antennas
were designed for that purpose. Thus, some communication systems will evolve for 5G technology,
even before full deployment, which is not expected until 2020. The large number of elements in the
planar antenna required by the transceiver must function optimally. Failure in the element(s) causes
far-field degradation of antenna systems. The detection of failed elements from field measurements
taken from a suitable observation point is very important to re-calibrate the feeding network and
to reinstall the needed radiation characteristics by reconfiguring the excitations of the healthy
elements [1,3]. Testing of antennas is then a necessity when a certain number of elements exhibit fault.
Therefore, the fast diagnosis of complex antenna structures is always a fundamental need.
Far-field measurements are a very powerful approach for antenna array testing. The measurement
data are sequentially presented for probable failure identification in the array under test (AUT).
The matrix method algorithm (MMA) and back propagation algorithm (BPA) are the most commonly
used mechanisms to detect the number and corresponding positions of defective elements using
a reference antenna (healthy) and the AUT (defective). BPA [7] was established using the Fourier
relationship between radiated far-field and the field situated on the array aperture, and it is applicable
to planar antenna arrays. A generalized form of BPA is MMA [8,9]. MMA uses linear algebra standard
tools to stabilize the inversion matrix, which relates the array aperture field to radiated far-field.
However, MMA and BPA demand a large number of measurements, thus causing long post-processing
of large arrays. One approach to mitigate the problem is the use of a priori knowledge of the array
without failure; consequently, only the defective array elements are identified. The modeled diagnostic
problem is solved by employing available customs whose computation time is a little longer than
standard methodologies. At this point, it is evident that the total time taken to get the antenna array
diagnosed greatly depends on measurement time, with post-processing times having a higher order
of magnitude. This is why sparse recovery-based methods require fewer measurement numbers and
provide faster antenna array diagnosis.
Recently, compressive sensing emerged rapidly as a potential technique for solving sparse
recovery problems [10–20]. Within this context, the appropriateness of compressive sensing in
addressing the array diagnosis problem was examined in References [3,12,14,16,18,21]. Evidently,
faulty element distribution in array configurations in practice were found to be highly sparse because it
accounts for small non-null entries in the excitation vector of the transmit/receive modules. Beginning
from that hypothesis, 1 -norm minimization mechanism was applied successfully to detect failures
in planar arrays using a small number of near-field [9] or far-field [14] measurements. Conversely,
deterministic compressive sparse techniques require a “measurement matrix” to comply with the
restricted isometric property (RIP) condition, for which the estimation of large matrices remains an
open challenge [3,14,21]. An alternative is the probabilistic compressive sensing approach reported
in Reference [21] to diagnose linear arrays from far-field measurements. However, most of these
techniques were not tested experimentally.
264
Electronics 2018, 7, 383
Figure 1. Ka-band active antenna array formation which integrates multiple-input multiple-output
(MIMO) and beamforming.
In this work, the problem of antenna element excitation level was not examined; however,
we estimated field distribution on the array aperture. This helps us identify the modifications
of aperture field distribution as a result of factors that cannot be quantified by simple failure of
elements, such as different reflections of the array and its feed. The problem faced in getting more
information about the AUT is the larger number of unknowns. However, in sparse recovery methods,
the required number of measurements increases slowly and logarithmically with the number of
unknowns [10–13]. Hence, the field reconstruction scheme benefits more in sparse recovery-based
mechanisms. Different sparse recovery algorithms used to conduct antenna array diagnosis were
unveiled [14] and compared to the traditional BPA and MMA. In particular, total variation (TV) norm,
mixed 1 /2 norm, and minimization of the 1 norm were used to proffer solutions to the resulting
inversion issues. From the field reconstructed on the antenna aperture, Fuchs et al. [20] acquired a
good antenna diagnosis. The approach was applied to far-field simulation data generated from a
100-element antenna array. The performance of the diagnosis was evaluated and compared to the two
standard techniques (BPA and MMA) under different conditions. The approaches were also applied
to far-field measurement data of an antenna array with failure to justify the practical applicability
of the proposed schemes. Although there were many more works on sparse recovery methods in
applied electromagnetics and microwaves involving the diagnosis processes of antenna arrays [13,14],
experimental data, which are fundamental for testing any procedure, were reported in few of them.
In References [15–20], differential scenarios with sparse recovery algorithms were employed to
perform antenna diagnosis and retrieve element excitations. Reference [21] proposed a joint scheme
for adaptive diagnosis of antenna arrays using communication signal fusion (radar-communication
scheme) and the echoes of probe signals received at the same antenna. This method equally solved
the antenna diagnosis problem at low signal-to-noise ratios (SNRs) to ensure optimal performance
of smart sensors in wireless sensor networks. Also, Reference [22] attempted array diagnosis in
millimeter waves using compressive sensing. This work considered both full and partial blockage,
which occurs from a plethora of particles (such as ice, water droplets, salt, and dirt) and the technique
jointly computed the locations of the blocked elements, and the induced phase-shifts and induced
attenuation provided the prior knowledge of the angles of departure/arrival. Reference [23] proposed a
deterministic sampling strategy for failure detection in uniform linear arrays via compressed sensing or
a sparse recovery approach. This is an extension of the Weyl formula which is basically used for prime
265
Electronics 2018, 7, 383
numbers. The strategy obtained was good for nonprime number (i.e., valid for any number of array
elements). This sampling approach is good for sparse electromagnetic (EM) problems encompassing
Fourier matrices. Reference [24] gives a review of different capacities of sparse recovery by analyzing
how compressive sensing can be applied to antenna array synthesis, diagnosis, and processing.
Illustrations of a set of applicative examples were given, including direction-of-arrival estimation,
along with present challenges and current trends in compressive sensing applications to the solution
of innovative and traditional antenna array challenges. In general, compressive sensing generates
few unknown numbers; however, it needs a comprehensive array model with exact knowledge of the
radiating element patterns to produce useful results. The technique is sparse with respect to the whole
array structure, and requires a priori information to recast it as the function of minimization of 1 norm.
However, these techniques are applicable only if the relationships between the data and the unknowns
satisfy the restricted isometry property (RIP). To overcome this challenge, the Bayesian compressive
sensing (BCS) approach is adopted. This technique was explored in many electromagnetic problems,
such as antenna design and synthesis [25,26], microwave imaging [27–30], and direction-of-arrival
estimation [31–33]. It is employed in this paper to estimate the number, magnitude, and location of
failures in antenna arrays from far-field measurements. The BCS approach was attempted to diagnose
large linear arrays [11], and more recently, planar array configurations [34]; however, no experimental
validation was reported. Hence, there is a need for a more reliable procedure tested experimentally
and via simulation, because experiments are fundamental tests of any given procedure.
Specifically, this work is an extension of that described in References [15,16]. The BCS method is
applied to both the simulated and measured far-field data of a millimeter-wave 100-element microstrip
patch antenna array in which failures were added intentionally. A new regularization technique
was unveiled and applied to field distribution in order to enhance the efficiency and reliability of
antenna array diagnosis. The proposed BCS-based approach is a better choice due to its fast nature
and robustness under different noise conditions. The key contributions of this paper are summarized
as follows:
1. The BCS technique was applied to diagnose a millimeter-wave planar array, and the result was
compared with other approaches reported in the literature.
2. The procedure shows high effectiveness and reliability with fewer measurement points compared
to the other methods, and is highly robust to additive noisy data. This was validated
experimentally and via simulations.
However, some boundary conditions were observed. The BCS-based approach detects diagnostic
problems with few measurements, provided prior knowledge of the reference array radiation pattern,
and the number of faults is relatively smaller than the array size. The remainder of this paper is
arranged as follows: Section 2 contains the problem formulation of antenna array diagnosis. Section 3
presents compressed sparse recovery methods. Resolution via the BCS-based approach is given in
Section 4. The numerical simulations are presented in Section 5. Diagnoses from experimental data are
presented and discussed in Section 6. Finally, some conclusions are drawn in Section 7.
where Exu ( x, y) x̂ and Eyu ( x, y)ŷ are the x and y planes of the aperture’s electric field,
respectively. Far-field F u (r, θ, ∅) is the measured field on part of the hemispherical surface
(0 ≤ θ ≤ π/2, 0 ≤ φ ≤ 2π ) at radius r from the phase center of the AUT, and r > 2D2 /λ, where D is
the diameter. Also, the amplitude and phase of a reference array (RA; array without failures) shown
266
Electronics 2018, 7, 383
in Figure 2b are assumed to be available. Associated quantities are marked with superscript “o”.
Eo ( x, y) is the field on the aperture Σ of the reference array (RA) and Fo (r, θ, ∅) represents the far-field
radiation. For the differential antenna (DA) shown in Figure 2c, the tangential distribution E( x, y) on
the aperture Σ is equal to the difference between the field distributions of the reference array and the
antenna under test, and the corresponding far-field F(r, θ, φ) is expressed as the difference between the
fields of reference array (RA) and AUT as
E( x, y) = Eu ( x, y) − Eo ( x, y), (2)
*+
*-+
*+
Figure 2. Antenna array: (a) reference antenna without failures; (b) antenna under test (AUT);
(c) differential antenna (DA). The number of failures is 2 within the total element number N = 21.
267
Electronics 2018, 7, 383
The differential antenna gives a resulting problem in which only the corresponding area to the
field modification radiates as a result of failure. By visually monitoring the field distribution on the
DA, the identification of faulty elements in the AUT can be observed.
where ∗ q represents lq norm, and γ is a function of noise and factors influencing the data. There are
various routines available to effectively solve the convex optimization problem of Equation (4) such
as References [25–27]. The three norms lq , selected based on a priori knowledge of the differential
antenna set-up with the diagnosis problem, can now be described for regularization of the inversion.
We applied them to conduct diagnosis of both the simulated and measured radiating antennas.
268
Electronics 2018, 7, 383
⎡ ⎤
−1 0
⎢ .. ⎥
⎢ 1 . ⎥
⎢ ⎥
∇y = ⎢ ⎥.
⎢ .. ⎥
⎣ . −1 ⎦
0 1
Then, the optimization problem in Equation (4) transforms to
Minimizing 1 -norm imposes the pointwise sparsity of solution per sample xk of the field on the
aperture of the DA.
N
6(ul , vl ) =
E ∑ β n e j λ ( xn ul +yn vl ) + vl ,
2π
(10)
n =1
where (ul , vl ) for l = 1, . . . , L is the angular location of the l-th angular sample, and vl is the noise effect
considered as Gaussian-distributed with zero mean and variance σ2 . β n , n = 1, . . . , N, is the failed
excitations vector, expressed as
269
Electronics 2018, 7, 383
hαn with probability Φ
βn = , n = 1, . . . , N. (11)
αn otherwise
h ∈ (0, 1) is the failure factor, while Φ is the rate of failure, and αn is the weighting coefficient.
From knowledge of the difference in field pattern, W (ul , vl ) = E(ul , vl ) − E 6(ul , vl ), l = 1, . . . , L,
array failures can be estimated by determining the minimum 0 − norm vector
Υ = {Υn = αn − β n ; n = 1, . . . , N }, (12)
which satisfies
W − ΨΥ = v. (13)
which gives
1 ΨT Ψ
Υ̂ = + diag f NP , (16)
σNP σNP
where T is the transpose operator, and σNP and f are the figures that are used to maximize the
NP
likelihood function
1
L σ, f = − Nlog2π + logC + W T C −1 W . (17)
2
Equation (11) is computed using RVM [29], with C = σ + ΨF −1 Ψ T , where F = diag( f ).
The implementation of the BCS technique (as shown in Figure 3) is summarized in Algorithm 1.
a. Step 1—Array parameter selection and definition of problem: For each element (n = 1, . . . N), accumulate the
array field at N measurement points (reference antenna) and randomly sample the AUT field.
Appropriate the noise variance and threshold for the maximization of L σ, f .
b. Step 2—Radiation pattern measurement matrix Ψ definition. Input the parameters of Ψ.
c. Step 3—Posterior mode estimation. Maximize Equation (10) iteratively to estimate σ and f using the RVM
procedure [29].
d. Step 4—Source difference reconstruction. Determine W(ul , vl ) for l = 1, . . . , L.
e. Step 5—Failed field excitation reconstruction. Determine the vector of failed excitations
β n (n = 1, . . . , N) using Equation (11).
270
Electronics 2018, 7, 383
271
Electronics 2018, 7, 383
Figure 5. Normalized array excitation with estimated failure vector Υ̂ and location and amplitude
of entries.
To analyze the impact on performance metrics of the technique of the noise on far-field patterns,
we ranged the SNR between 0 dB and 100 dB. Figure 6 shows the obtained result. The estimation error
ζ is high for low SNRs irrespective of failed element percentage. Also, for higher SNR, the robustness
of the approach increases, which shows that the best performances are attained at Φ = 3%. The impact
of the percentage of failed elements on the performance of the method proposed was also assessed by
varying the percentage of the failed elements from Φ = 2% to Φ = 20% (see Figure 7). Expectedly,
the performance metrics of the approach reduced for a higher percentage of failed elements, even for
very low noise levels. Conversely, the approach achieved a degree acceptable accuracy until about
10% of damaged elements at any SNR. This result validates the efficiency of the BCS technique in the
diagnosis of sparse failures in arrays.
272
Electronics 2018, 7, 383
Figure 6. Detection index against signal-to-noise ratio (SNR) for different failure percentages.
N (0, 1) + jN (0, 1)
nq = √ max |yq | × 10−SNRdB /20 , (19)
2
273
Electronics 2018, 7, 383
Figure 8. Designed microstrip patch antenna array in Ansys HFSS for diagnosis.
Also, considering an AUT with K = 5 element failures (Φ = 5%) (elements with zero excitation)
because failed elements are usually of small number in practice, the resulted excitation coefficients
are presented in Figure 13, and the estimated excitations by 30 random noisy measurement points are
presented in Figure 14, while the dB excitation error is depicted in Figure 15. The result is an indication
of good estimation of RA excitations and the locations of the faulty elements.
274
Electronics 2018, 7, 383
*+
*-+
*+
Figure 9. The antenna array for diagnostic purposes: (a) S-parameter; (b) RA radiation pattern; (c)
AUT radiation pattern.
275
Electronics 2018, 7, 383
Figure 11. Reconstructed excitation field of reference array employing 30 random noisy
measurement points.
Figure 12. Reference array reconstructed excitation error in dB by 30 random noisy measurement points.
276
Electronics 2018, 7, 383
Figure 14. AUT with K = 5 failures Φ = 5%: reconstructed excitation error field by 30 random noisy
measurement points.
Figure 15. AUT with K = 5 failures Φ = 5%: reconstructed excitation error in dB by 30 random noisy
measurement points.
277
Electronics 2018, 7, 383
Figure 16. Schematic of experimental set-up.
The antenna was particularly designed and fabricated for this purpose, and each element had its
feeding port which was excited using a power divider. Five radiating elements in the array were not
excited (zero excitation) to successfully emulate the failure of elements. The AUT set-up is depicted in
Figure 18. About 1000 co-polar and cross-polar measurements were taken on the far-field half-sphere
at 28.9 GHz in an anechoic chamber (see Figure 18).
Radiation pattern measurements obtained from the array with five faulty elements were fed into
the proposed algorithm for post-processing. The reconstructed excitation error which identified the
specific faulty elements is depicted in Figure 19, and the corresponding dB equivalence is shown
in Figure 20. Moreover, the performance metrics of the BCS-based approach were experimentally
tested, and are shown in Figure 21. Figure 21a shows the obtained reconstruction error versus
the measurement number at different degrees of failure. The error decreased as the number of
measurement points increased irrespective of failure percentage. The reconstruction error profile
increased with increased failure rate. In Figure 21b, it is demonstrated how the reconstruction error
profile changes with SNR for different failure rates. The reconstruction error degraded exponentially
with increased SNR independent of failure rate. The reconstruction error increased with increased
failure rate. Figure 21c depicts reconstruction error versus different levels of failure for various SNRs.
It can be observed that the error increased with increased failure rate. Also, the reconstruction error
decreased as SNR increased.
278
Electronics 2018, 7, 383
*+
*-+
*+
Figure 17. Photograph of (a) the fabricated antenna array, (b) measured radiation pattern without fault,
and (c) measured radiation pattern with emulated failures.
279
Electronics 2018, 7, 383
Figure 18. Measurement set-up of AUT for diagnostic purpose.
The imperfection of the curves (compared to the simulation) could be attributed to measurement
errors, and errors due to experimental set-ups which provide different conditions from those in the
simulations. The experiment was conducted in an anechoic chamber, which is a controlled environment.
Hence, the results presented here may show a little variation if the experiment is conducted in a
more practical environment (i.e., uncontrolled environment). Moreover, the BCS-based procedure
presented here can be trusted to effectively and reliably address sparse recovery problems, particularly
the detection of faulty radiators in planar arrays for next-generation 5G wireless communications.
Once the suitable data are collected, and used to diagnose the array, then the array feeding network
can be recalibrated to restore the needed radiation features via excitation reconfiguration of the healthy
antenna elements. However, prior knowledge of the golden array must be provided, and the failure
rate is relatively smaller than the array size. Therefore, from the simulation results verified by the
experiment, the BCS-based approach is adequate and reliable for noisy data. This technique overcomes
the shortcomings of BPA, MMA, etc., demanding off-line phase training to form accurate mapping
between the response of the AUT and the failure location. Hence, the proposed procedure will be
highly useful for millimeter-wave planar array optimal performance.
Figure 19. Reconstructed excitation error field by 30 random noisy measurement points with
20 dB signal-to-noise-ratio.
280
Electronics 2018, 7, 383
6.2. Antenna Array Diagnosis from Simulated and Measured Far-Field Radiation Patterns
There are differences between the simulated and measured antenna patterns due to measurements
errors, uncontrollable array fabrication errors, and experimental set-ups that give different conditions
from the simulations. For example, in our design, a finite flange was employed to feed the ground
plane. Hence, the induced current on flange rescattered and redistributed somehow against the very
large ground plane which was used for the simulations. Antenna array diagnosis procedures based
on simulated radiation pattern (such as References [3,16–18,27,35]) may not be reliable and accurate,
except when tested with the corresponding measured data. Although, in this work, simulation
and measurement data exhibit little difference in field intensity of the identified faulty elements
(in Figures 14 and 19, respectively) caused by the electromechanical coupling effect. In general,
the proposed BCS-based approach shows good reliability and accuracy against both simulated and
measured far-field radiation patterns.
281
Electronics 2018, 7, 383
the proposed method shows its robustness to noisy measured data, and a reliable diagnosis was
obtained for low SNRs.
*+
*-+
*+
Figure 21. Experimental performance assessment of the proposed BCS-based diagnostic procedure.
(a) Reconstruction error versus number of measurements, (b) Reconstruction error against SNR,
(c) Reconstruction error with different failure rate
282
Electronics 2018, 7, 383
Table 1. Comparison between Bayesian compressive sensing (BCS)-based approach and previous
compressive sensing techniques. SNR—signal-to-noise ratio.
7. Conclusions
A faster and robust antenna array diagnosis procedure from far-field radiation pattern
measurement points using Bayesian compressive sensing (BCS) approach was proposed in this paper.
Previous compressive sensing procedures exhibit shortcomings based on reliability with noisy data,
and require a large number of far-field measurement points. The proposed method solves these
problems by formulating planar array diagnosis within the concept of the BCS framework, resolved
using fast relevance machine (RVM). We are not the first to apply the BCS approach to antenna
array diagnosis. It was applied only to linear configurations in References [15,16] without practical
measurements, which are fundamental for testing any procedure. To the best knowledge of the
authors, this is the first attempt to apply the BCS approach to planar antenna array diagnosis from
far-field measurement points, validated with experimental measurements. Diagnoses from simulated
and measured far-field points from a designed microstrip patch antenna array show the method’s
robustness to additive data noise, as well as its reconstruction accuracy and faster diagnosis speed,
which is desired in practical applications. Hence, the proposed method is a better practical choice
whenever an efficient, faster, and more reliable antenna array diagnosis (testing) is needed.
Also, it is important to comment on the choice of sampling strategy. We considered a random
selection of measurement points from a uniform lattice. The choice of the sampling technique is not
critical because it affects all the techniques in the same manner at the far-field. However, it was pointed
out, from a non-uniform near-field lattice, that proper non-uniform random sampling (NURS) using
a priori information on the problem provides meaningful reduction in the cardinality of the set of
measured data compared to uniform random sampling and random sampling from a λ/2 equispaced
dataset [39]. Moreover, the BCS-based technique was compared to other methods using the data
reported in the literature. In the future, we will compare different techniques using experimental data
from controlled and uncontrolled environments, and the same parameters in order to quantify the
error affecting the result of different techniques. For example, we will determine what happens if we
use 30 measurements instead of 64 measurements in the experimental data using 1 minimization,
i.e., the same number of data used by the BCS, as well as the error compared to BCS. A complete
comparison among the techniques using real data is still absent in the present literature.
Author Contributions: Conceptualization, methodology, writing-review and editing, original draft preparation,
O.J.F.; resources, Z.Z.; visualization, software, A.F. and R.Z.; supervision, F.L.
Funding: This research received no external funding. The APC was funded by University of Science and
Technology of China.
Acknowledgments: The majority of the work was conducted at MESIC (a joint lab of USTC and IMECAS),
and partially carried out at the USTC Center for Micro and Nanoscale Research and Fabrication. The authors
would like to thank the Information Science Laboratory Center of USTC for software and hardware services.
The authors acknowledge the Applied Electromagnetics and Microwave Engineering Group of Hefei Normal
University for the provision of the anechoic chamber. The support of the Chinese Academy of Science and the
World Academy of Science (CAS-TWAS) is appreciated. The authors appreciate Zhang [40] and the CVX research
group [41] for making useful codes accessible online.
Conflicts of Interest: The authors declare no conflict of interest.
283
Electronics 2018, 7, 383
References
1. Oliveri, G.; Donelli, M.; Massa, A. Linear array thinning exploiting almost difference sets. IEEE Trans.
Antennas Propag. 2009, 57, 3800–3812. [CrossRef]
2. Oliveri, G.; Manica, L.; Massa, A. ADS-based guidelines for thinned planar arrays. IEEE Trans.
Antennas Propag. 2010, 58, 1935–1948. [CrossRef]
3. Carlin, M.; Oliveri, G.; Massa, A. On the robustness to element failures of linear ADS-thinned arrays. IEEE
Trans. Antennas Propag. 2011, 59, 4849–4853. [CrossRef]
4. 5G Vision White Paper, DMC R&D Center. Samsung Electronics Co., Ltd., 2015. Available online:
http://www.samsung.com/global/business-images/insights/2015/Samsung-5G-Vision-0.pdf (accessed
on 4 October 2018).
5. 5G: A Technological Vision, HUAWEI white Paper. Huawei Technologies Co.: Shenzhen. Available online:
http://www.huawei.com/5gwhitepaper/ (accessed on 4 October 2018).
6. Frenzel, L. Ka-Band Transceiver Implements MIMO and Beamforming for 5G. Available online:
Electronicdesign.com (accessed on 8 October 2018).
7. Lee, J.J.; Ferrer, E.M.; Woollen, D.P.; Lee, K.M. Near-field probe used as a diagnostic tool to locate defective
elements in an array antenna. IEEE Trans. Antennas Propag. 1988, 36, 884–889. [CrossRef]
8. Gattoufi, L.; Picard, D.; Samii, Y.R.; Bolomey, J.C. Matrix method for near-field diagnostic techniques of
phased array antennas. In Proceedings of the IEEE International Symposium on Phased Array Systems and
Technology, Boston, MA, USA, 15–18 October 1996; pp. 52–57.
9. Bucci, O.M.; Migliore, M.D.; Panariello, G.; Sgambato, G. Accurate diagnosis of conformal arrays from
near-field data using the matrix method. IEEE Trans. Antennas Propag. 2005, 53, 1114–1120. [CrossRef]
10. Candes, E.J.; Romberg, J.; Tao, T. Robust uncertainty principles: Exact signal reconstruction from highly
incomplete frequency information. IEEE Trans. Inf. Theory 2006, 52, 489–509. [CrossRef]
11. Donoho, D.L.; Tanner, J. Precise undersampling theorems. Proc. IEEE 2010, 98, 913–923. [CrossRef]
12. Migliore, M.D.; Fuchs, B.; le Coq, L.; Ferro-Famil, L. Compressed sensing approach for reflectarray diagnostic
from far field measurements. In Proceedings of the European Microwave Conference, Paris, France,
9–11 September 2015; pp. 289–292.
13. Massa, A.; Rocca, P.; Oliveri, G. Compressive sensing in electromagnetics—A review. IEEE Antennas
Propag. Mag. 2015, 57, 224–238. [CrossRef]
14. Fuchs, B.; Coq, L.L.; Migiliore, M.D. Fast antenna array diagnosis from a small number of far-field
measurements. IEEE Trans. Antennas Propag. 2016, 64, 2227–2235. [CrossRef]
15. Migliore, M.D. A compressed sensing approach for array diagnosis from a small set of near-field
measurements. IEEE Trans. Antennas Propag. 2011, 59, 2127–2133. [CrossRef]
16. Oliveri, G.; Rocca, P.; Massa, A. Reliable diagnosis of large linear arrays—A Bayesian compressive sensing
approach. IEEE Trans. Antennas Propag. 2012, 60, 4627–4636. [CrossRef]
17. Migliore, M.D. Array diagnosis from far-field data using the theory of random partial Fourier matrices.
IEEE Trans. Antennas Wirel. Propag. Lett. 2013, 12, 745–748. [CrossRef]
18. Fuchs, B.; Migliore, M.D. Accurate array diagnosis from near-field measurements using reweighted
minimization. In Proceedings of the IEEE Antennas Propagation Symposium, Orlando, FL, USA, 7–13 July
2013; pp. 2255–2256.
19. Fuchs, B.; le Coq, L.; Ferro-Famil, L.; Migliore, M.D. Comparison of methods for reflectarray diagnostic from
far field measurements. In Proceedings of the IEEE International Symposium on Antennas Propagation,
Boston, MA, USA, 19–25 July 2015; pp. 398–399.
20. Donoho, D.L. Compressive sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [CrossRef]
21. Li, Q.; Zhang, Y.; Dai, K.; Huang, L.; Pan, C.; Song, J. An adaptive diagnosis scheme for integrated
radar-communication antenna array with huge noise. IEEE Access 2018, 6, 25785–25796. [CrossRef]
22. Eltayeb, M.E.; Al-Naffouri, T.; Heath, R.W., Jr. Compressive sensing for millimeter wave antenna array
diagnosis. IEEE Trans. Commun. 2018, 66, 2708–2721. [CrossRef]
23. Li, W.; Deng, W.; Migiliore, M.D. A deterministic far-field sampling strategy for array diagnosis using sparse
recovery. IEEE Antennas Wirel. Propag. Lett. 2018, 17, 1261–1265. [CrossRef]
284
Electronics 2018, 7, 383
24. Massa, A.; Bertolli, M.; Gottardi, G.; Hannan, A.; Marcantonio, D.; Oliver, G.; Polo, A.; Robol, F.;
Rocca, P.; Viani, F. Compressive sensing as applied to antenna arrays: Synthesis, diagnosis, and processing.
In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS), Florence, Italy,
27–30 May 2018.
25. Oliveri, G.; Massa, A. Bayesian compressive sampling for pattern synthesis with maximally sparse
non-uniform linear arrays. IEEE Trans. Antennas Propag. 2011, 59, 467–481. [CrossRef]
26. Oliveri, G.; Carlin, M.; Massa, A. Complex-weight sparse linear array synthesis by Bayesian Compressive
Sampling. IEEE Trans. Antennas Propag. 2012, 60, 2309–2326. [CrossRef]
27. Jaehyun, P.; Lee, J.; Chun, J.; Kwak, S. Robust monopulse beam synthesis with sparse elements in linear and
planar arrays with element failure detection. IET Radar Sonar Navig. 2017, 11, 1251–1258.
28. Poli, L.; Oliveri, G.; Massa, A. Microwave imaging within the first-order Born approximation by means of the
contrast-field Bayesian Compressive Sensing. IEEE Trans. Antennas Propag. 2012, 60, 2865–2879. [CrossRef]
29. Poli, L.; Oliveri, G.; Rocca, P.; Massa, A. Bayesian compressive sensing approaches for the reconstruction
of two-dimensional sparse scatterers under TE illuminations. IEEE Trans. Geosci. Remote Sens. 2013, 51,
2920–2936. [CrossRef]
30. Poli, L.; Oliveri, G.; Viani, F.; Massa, A. MT-BCS-based microwave imaging approach through
minimum-norm current expansion. IEEE Trans. Antennas Propag. 2013, 61, 4722–4732. [CrossRef]
31. Carlin, M.; Rocca, P. A Bayesian compressive sensing strategy for direction-of-arrival estimation.
In Proceedings of the 6th European Conference on Antennas Propag. (EuCAP 2012), Prague, Czech Republic,
26–30 March 2012; pp. 1508–1509.
32. Carlin, M.; Rocca, P.; Oliveri, G.; Viani, F.; Massa, A. Directions-of-arrival estimation through Bayesian
compressive sensing strategies. IEEE Trans. Antennas Propag. 2013, 61, 3828–3838. [CrossRef]
33. Carlin, M.; Rocca, P.; Oliveri, G.; Massa, A. Bayesian compressive sensing as applied to directions-of-arrival
estimation in planar arrays. J. Elect. Comput. Eng. 2013, 2013, 1–12. [CrossRef]
34. Salucci, M.; Gelmini, A.; Oliver, G.; Massa, A. Planar array diagnosis by means of an advanced Bayesian
compressive processing. IEEE Trans. Antenna Propag. 2018, 66, 5892–5906. [CrossRef]
35. Oliveri, G.; Rocca, P.; Massa, A. A Bayesian-compressive-sampling-based inversion for imaging sparse
scatters. IEEE Trans. Geosci. Remote Sens. 2011, 49, 3993–4006. [CrossRef]
36. Candès, E.J.; Wakin, M.B.; Boyd, S.P. Enhancing sparsity by reweighted minimization. J. Fourier Anal. Appl.
2008, 14, 877–905. [CrossRef]
37. Rudin, L.; Osher, S.; Fatemi, E. Nonlinear total variation based noise removal algorithms. Phys. D 1992, 60,
259–268. [CrossRef]
38. Bach, F.; Jenatton, R.; Mairal, J.; Obozinski, G. Optimization with sparsity-inducing penalties.
Found. Trends Mach. 2012, 4, 1–106. [CrossRef]
39. Migliore, M.D. On the sampling of the electromagnetic field radiated by sparse sources. IEEE Trans.
Antennas Propag. 2014, 63, 553–564. [CrossRef]
40. Zhang, Y.; Yang, J.; Yin, W. YALL1: Your ALgorithms for L1. Available online: Yall1.blogs.rice.edu (accessed
on 8 October 2018).
41. CVX Research, Inc. CVX: Matlab Software for Disciplined Convex Programming, Version 2.0 Beta. 2012.
Available online: http://cvxr.com/cvx (accessed on 8 October 2018).
© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
285
electronics
Article
Mobile-Phone Antenna Array with Diamond-Ring
Slot Elements for 5G Massive MIMO Systems
Naser Ojaroudi Parchin 1, *, Haleh Jahanbakhsh Basherlou 2 , Mohammad Alibakhshikenari 3 ,
Yasser Ojaroudi Parchin 4 , Yasir I. A. Al-Yasir 1 , Raed A. Abd-Alhameed 1 and Ernesto Limiti 3
1 Faculty of Engineering and Informatics, University of Bradford, Bradford BD7 1DP, UK;
[email protected] (Y.I.A.A.-Y.); [email protected] (R.A.A.-A.)
2 Bradford College, Bradford BD7 1AY, UK; [email protected]
3 Electronic Engineering Department, University of Rome Tor Vergata, Via del Politecnico 1, 00133 Rome, Italy;
[email protected] (M.A.); [email protected] (E.L.)
4 Young Researchers and Elite Club, Germi Branch, Islamic Azad University, Germi 63764-56517, Iran;
[email protected]
* Correspondence: [email protected]; Tel.: +44-7341436156
Abstract: A design of mobile-phone antenna array with diamond-ring slot elements is proposed for
fifth generation (5G) massive multiple-input/multiple-output (MIMO) systems. The configuration of
the design consists of four double-fed diamond-ring slot antenna elements placed at different corners
of the mobile-phone printed circuit board (PCB). A low-cost FR-4 dielectric with an overall dimension
of 75 × 150 mm2 is used as the design substrate. The antenna elements are fed by 50-Ohm L-shaped
microstrip-lines. Due to the orthogonal placement of microstrip feed lines, the diamond-ring slot
elements can exhibit the polarization and radiation pattern diversity characteristic. A good impedance
bandwidth (S11 ≤ −10 dB) of 3.2–4 GHz has been achieved for each antenna radiator. However, for
S11 ≤ −6 dB, this value is 3–4.2 GHz. The proposed design provides the required radiation coverage
of 5G smartphones. The performance of the proposed MIMO antenna design is examined using both
simulation and experiment. High isolation, high efficiency and sufficient gain-level characteristics
have been obtained for the proposed MIMO smartphone antenna. In addition, the calculated total
active reflection coefficient (TARC) and envelope correlation coefficient (ECC) of the antenna elements
are very low over the whole band of interest which verify the capability of the proposed multi-antenna
systems for massive MIMO and diversity applications. Furthermore, the properties of the design in
Data-mode/Talk-mode are investigated and presented.
Keywords: 5G; diamond-ring slot; dual-polarized antenna; massive MIMO; mobile-phone antenna;
pattern diversity
1. Introduction
Nowadays, there is an increased interest in research on MIMO systems in wireless
communication [1,2]. It has incomparable advantages in improving the wireless link transmission
capacity and reliability. In MIMO systems, multiple antennas are deployed at both transmitter and
receiver sides [3]. This technology is a key component and probably the most established to truly reach
the promised transfer data rates of fifth generation (5G) communication systems [4,5]. MIMO antennas
are important for increasing channel capacity and link reliability [6,7]. Standard MIMO networks tend
to use two or four antennas in a single physical package. However, massive MIMO is a MIMO system
with an especially high number of antennas [8]. A 2 × 2 MIMO system has been successfully applied
for fourth generation (4G) mobile communications and it is expected that the massive MIMO system
with a large number of MIMO antennas is very promising for 5G wireless communications [9]. The
greater number of elements in a network will make it more resistant to interference and intentional
jamming [10].
Among the antennas which are used for MIMO applications, printed antennas are more appropriate
due to their low cost, easy fabrication and their capability of easily being integrated to small terminal
devices [11]. However, placing multiple antennas in the limited space of a transceiver poses a significant
challenge in the incorporation of the MIMO technique. According to the requirement of cellular
communications, compact, wideband and high isolation MIMO antenna is an urgent demand in the
future mobile terminal and the portable applications [12–17]. Recently, several techniques have been
introduced to design massive MIMO antennas for 4G and sub-6 GHz 5G mobile terminals [18–27].
We propose here a new design of Eight-port mobile-phone antenna with compact dual-polarized
radiation elements providing wide impedance bandwidths for 5G applications. Eight-element MIMO
smartphone antenna can achieve the channel capacity of 37 bps/Hz which is close to eight times
that of a single antenna for single-input/single-output operation. With such a channel capacity and
a wide frequency spectrum (200 MHz, at least), the data rate can be much higher than 1 Gbps.
The antenna is designed to operate at 3.6 GHz, a candidate frequency band for sub-6-GHz 5G
cellular networks, proposed by Ofcom, UK [28]. The design configuration contains four elements of
double-fed/dual-polarized slot-ring antennas placed at corners of the printed circuit board (PCB). The
antenna elements exhibit wide impedance bandwidth with low mutual coupling function providing
pattern and polarization diversity characteristics at different sides of the mobile-phone PCB. As
a result, the design not only can provide full radiation coverage but also it can support different
polarizations. The computer simulation technology (CST) software was used to investigate antenna
characteristics [29]. Fundamental properties of the single-element and its MIMO design in terms of S
parameters, efficiency, radiation pattern, envelope correlation coefficient (ECC), total active reflection
coefficient (TARC) are investigated. In addition, the performance of the designed mobile-phone
antenna in Data-Mode/Talk-Mode are studied.
288
Electronics 2019, 8, 521
( )
(a)
(b) (c)
Figure 1. The antenna schematic, (a) side view, (b) top and (c) bottom layers.
Table 1. Parameter values of the single-element antenna and its MIMO array design.
Configurations and S parameters of the square-ring slot with rectangular feed line, a diamond-ring
slot with rectangular feed line, and the proposed diamond-ring slot radiator with L-shaped feed line
are illustrated and compared in Figure 2a–c, respectively. It can be observed that by using the proposed
design (Figure 2c), the antenna not only provides wider impedance bandwidth but it also exhibits high
isolation with low mutual coupling characteristic (less than −20 dB) at the desired operation band. As
shown, the operation frequency of the slot radiator with L-shaped feed lines spans from 3.2 to 4 GHz
(800 MHz bandwidth). For S11 ≤ −6 dB, this value is 3–4.2 GHz.
S11 characteristics of varying design parameters including Wx , L1 , x and W1 are illustrated in
Figure 3. Figure 3a depicts the effects of diamond-ring size (Wx ) on the resonance frequency: when
its size decreases from 8 to 6 mm, the antenna resonance varies from 3.2 to 4.6 GHz. The frequency
resonance of the antenna is also affected by the size of the L-shaped feed line arm (L1 ). As shown
in Figure 3b, the antenna operation frequency tunes to lower frequencies (without any changes on
its bandwidth or isolation). Figure 3c illustrates the S11 results for various values of x (width of the
diamond-ring slot-line). As shown, it mainly affects the impedance bandwidth of the antenna: when its
size changes from 0.5 to 2 mm, the antenna operation bandwidth varies from 0.3 to 1.2 GHz. Another
important parameter of the dual-polarized diamond-ring slot antenna design is the length of the feed
line arm (L1 ) which tunes the isolation characteristic. As can be observed from the results shown in
Figure 3d, the antenna reflection coefficient tunes from −18 to less than −40 dB.
289
Electronics 2019, 8, 521
(a)
(b)
(c)
Figure 2. Various structures and S parameter results of: (a) square-ring slot with rectangular feed line,
(b) diamond-ring slot with rectangular feed line, and (c) the proposed diamond-ring slot radiator with
L-shaped feed line.
(a) (b)
(c) (d)
Figure 3. S11 results of the diamond-ring antenna for various values of (a) Wx , (b) L1 , (c) x, and (d) W1 .
290
Electronics 2019, 8, 521
Figure 4 shows the surface current distributions in the ground plane of the antenna at 3.6 GHz. As
shown, the surface currents are mainly distributed around the diamond-ring slot radiator. In addition,
for the different feeding ports of the antenna, the currents densities are equal and opposite due to the
polarization diversity function [32–34]. Figure 5 illustrates the 3D radiation patterns of the antenna
when it is fed differently (Port 1 and Port 2). As seen, the antenna exhibits similar radiation patterns
with different orthogonal polarizations and more than 3 dB realized-gain. Radiation characteristics
of the dual-polarized diamond-ring slot antenna in terms of radiation efficiency, total efficiency, and
maximum gain are illustrated in Figure 6. As seen, the antenna provides high efficiencies. More than
80% radiation and total efficiency properties are obtained over the entire operation band (3.2–4 GHz).
It can be observed that the antenna exhibits almost similar radiation and total efficiency. In addition,
the antenna has around 2.5 dBi directivity.
(a) (b)
Figure 4. Simulated current densities at 3.6 GHz for (a) 1st feeding port and (b) 2nd feeding port.
(a) (b)
Figure 5. 3D views of the dual-polarized radiation patterns from (a) feeding port 1 and (b) feeding
port 2.
A prototype of the design was fabricated and its S parameters were tested. Figure 7 shows a
photograph of the fabricated prototype in the measurement setup. Figure 8 illustrates the measured
and simulated S parameter results of the fabricated antenna. It is observed that the fabricated antenna
works properly at the desired frequency range.
291
Electronics 2019, 8, 521
Figure 6. Radiation, total efficiencies, maximum gain of the diamond-ring slot antenna.
292
Electronics 2019, 8, 521
(a)
(b) (c)
Figure 9. Designed mobile-phone antenna (a) transparent view, (b) top-layer and (c) bottom layer.
293
Electronics 2019, 8, 521
(a)
(b)
Figure 10. (a) Snn and (b) Smn results of the mobile-phone antenna.
(a) (b)
Figure 11. 3D transparent views of the radiation patterns for (a) feeding port 1 and (b) feeding port 2.
Top-views of the radiation patterns for the proposed mobile-phone antenna design are displayed
in Figure 12. It can be seen that each side of the mobile-phone PCB has been covered with differently
polarized radiation patterns. Thus, the MIMO antenna exhibited good radiation coverage and
can support different polarizations which make it more suitable to be used in future smartphones.
Furthermore, the antenna provides high radiation and total efficiencies over the operation band, as
294
Electronics 2019, 8, 521
illustrated in Figure 13: more than 70% radiation and total efficiencies were obtained for the radiation
elements at 3.6 GHz.
The proposed mobile-phone antenna was fabricated and its characteristics were tested in the
Antenna Laboratory at the University of Bradford. Top and bottom views of the prototype are shown
in Figure 14a,b, respectively. The mobile-phone antenna is constructed on a cheap FR4 dielectric
with an overall dimension of 75 × 150 × 1.6 mm3 . During the measurement process, 50-Ω RF loads
are employed for the elements not under test to avoid their effects, as shown in Figure 14c. The
measured and simulated S parameters (Snn : S11 –S88 and Smn : S21 –S81 ) of the fabricated design are
illustrated in Figure 15. As illustrated, the diamond-ring slot resonators achieve good S parameters
with sufficient impedance bandwidth and low mutual coupling characteristic in the desired frequency
range. Some deviations from the measurements arise from the errors in fabrication, feeding and
experiment processes.
Figure 12. 3D radiation patterns of the fifth generation (5G) mobile-phone antenna.
295
Electronics 2019, 8, 521
Figure 13. Efficiencies of the antenna elements (Ant. 1–Ant. 8) for the proposed design.
(a) (b)
(c)
Figure 14. (a) Top, (b) bottom views of the fabricated design and (c) the prototype connected to the
cables and 50-Ohm loads.
296
Electronics 2019, 8, 521
(a)
(b)
Figure 15. Measured and simulated (a) Snn (S11 –S88 ) and (b) Smn (S21 –S81 ) of the fabricated prototype.
According to the point that the radiation elements with the same placements and polarizations
provide similar radiation patterns, 2D polar radiation patterns of the adjacent resonators (including
Ant. 1 and Ant. 2) were measured at center operating frequency (3.6 GHz) and illustrated in Figure 16.
As shown, the sample prototype exhibits good radiation patterns and provides acceptable agreement
with simulations. In addition, the antenna elements with different polarizations provide sufficient gain
values at the center frequency of the operation band.
(a) (b)
Figure 16. Measured and simulated 2D radiation patterns for (a) Ant.1 and (b) Ant.2.
297
Electronics 2019, 8, 521
In order to ensure that the MIMO antenna can work properly, ECC and TARC characteristics
are two important parameters which should be considered in MIMO antennas [35,36]. The ECC and
TARC of two elements can be calculated from the S parameters using the formula described as:
2
S∗mm Snm + S∗mn Snn
ECC = ∗ , (1)
1 − |Smm |2 − |Smn |2 1 − |Snm |2 − |Snn |2
1
(Smm + Smn )2 + (Snm + Snn )2
TARC = − . (2)
2
Figure 17 shows the calculated ECC and TARC results from simulated and measured S parameters
of the mobile-phone antenna design. As evident from figures, the calculated ECC and TARC results
are very low over the whole band of interest. The design provides less than 0.01 ECC over the
entire operating band and proves that two adjacent antenna elements are irrelevant. In addition,
its TARC value is less than −30 at 3.6 GHz. Table 2 summarizes and compares the fundamental
characteristics of the presented mobile-phone antenna with the recently reported 5G mobile-phone
antenna designs [16–25]. It can be observed that the proposed design can provide better performances
in terms of efficiency, isolation and ECC. In addition, it exhibits wider bandwidth with pattern and
polarization diversity characteristics to cover different sides of the mobile-phone PCB.
(a)
(b)
Figure 17. Calculated (a) envelope correlation coefficient (ECC) and (b) total active reflection coefficient
(TARC) from measured S parameters.
298
Electronics 2019, 8, 521
Table 2. Comparison between the presented and recently reported 5G mobile-phone antennas.
Reference Bandwidth (GHz) Efficiency (%) Size (mm2 ) Isolation (dB) ECC
[18] 3.4–3.6 55–60 100 × 50 10 -
[19] 3.4–3.6 - 136 × 68 15 -
[20] 3.55–3.65 52–76 150 × 75 11 -
[21] 3.4–3.6 35–50 150 × 75 11 <0.40
[22] 3.4–3.6 30–50 145 × 70 15 <0.2
[23] 3.4–3.6 40–60 136 × 68 14 <0.2
[24] 3.4–3.6 60–75 150 × 80 17 <0.05
[25] 3.4–3.6 50–70 150× 73 17 <0.07
[26] 3.4–3.6 50–80 150 × 75 15 <0.2
[27] 3.4–3.6 60–70 150 × 75 18 <0.015
Proposed 3.3–3.9 60–80 150 × 75 17 <0.01
(a) (b)
(c) (d)
Figure 18. Placement and total efficiencies of the design for different user-hand scenarios
(a) right-hand/top-layer, (b) right-hand/back-layer, (c) left-hand/top-layer and (d) left-hand/back-layer.
3D radiation patterns of the mobile-phone antenna in Talk-Mode at 3.6 GHz are illustrated in
Figure 19a. It should be noted that the radiation performance of each antenna element mainly depends
on its locations in the Talk-Mode scenario. As shown, the realized gain characteristic of the design
varies from 3.2 to 4.9 dB. Compared with the radiation patterns in free space (Figure 12), due to the
existence of the user’s head and hand, radiation patterns are a bit distorted and become weaker. One
299
Electronics 2019, 8, 521
can see that antenna elements are touched by different parts of the hand and head phantoms in the
presented Talk-Mode.
(a)
(b)
Figure 19. (a) 3D and (b) 2D linearly-scaled radiation patterns of the mobile-phone antenna in
Talk-Mode scenario.
The maximum reductions of the radiation properties are observed for the elements that are located
near to user-head (Ant. 3 and Ant. 4) [40]. However, the difference is not very significant. The 2D-polar
300
Electronics 2019, 8, 521
(linear-scaling) radiation patterns of the design are illustrated in Figure 19b. As can be observed, the
directivity of the antenna radiation pattern is maximum in the opposite direction of the head, which is
most important part of the body to protect from the radiation. The main lobe of each single-element
directs most of the power while the other lobes should be negligible.
Figure 20 depicts the total efficiencies and reflection coefficient (Snn ) of the antenna elements in
the presence of the user-head and user-hand in Talk-Mode scenario. As seen, the diversion of the Snn
characteristic of the design is not significant. In addition, the proposed MIMO design exhibit sufficient
efficiency in its operation bandwidth. Based on the above analysis, we can conclude the MIMO design
provides sufficient efficiency, radiation coverage and gain levels for diamond-ring slot radiators.
(a) (b)
Figure 20. (a) Snn and (b) total efficiencies of the proposed smartphone antenna in Talk-Mode scenario.
5. Conclusions
A mobile-phone antenna design with dual-polarized radiators is proposed for 5G massive MIMO
communications. The antenna configuration contains eight-port/four elements of diamond-rings
slot radiators with L-shaped microstrip feed lines deployed at four corners of the PCB. The antenna
elements exhibit wide bandwidth with the center frequency of 3.6 GHz. S parameters, radiation
patterns, efficiency, ECC and TARC results of the design are studied and sufficient results are achieved.
In addition, a prototype of the mobile-phone antenna was fabricated and measured. Moreover, the
performances of the antenna in Hand-Mode and Talk-Mode scenarios are investigated. The obtained
results demonstrated that the proposed smartphone antenna provides good characteristics and meets
the requirements for use in future mobile handsets.
Author Contributions: Writing—original draft preparation, N.O.P., H.J.B., M.A., Y.O.P., Y.I.A.A.-Y., R.A.A.-A.,
and E.L.; writing—review and editing, N.O.P. and R.A.A.-A.; investigation, N.O.P., H.J.B., M.A., Y.O.P., Y.I.A.A.-Y.;
resources, N.O.P., R.A.A.-A., E.L. and; For other cases, all authors have participated.
Funding: This project has received funding from the European Union’s Horizon 2020 research and innovation
programme under grant agreement H2020-MSCA-ITN-2016 SECRET-722424.
Acknowledgments: The authors wish to express their thanks to the support provided by the innovation
programme under grant agreement H2020-MSCA-ITN-2016 SECRET-722424.
Conflicts of Interest: The authors declare no conflict of interest.
References
1. Jensen, M.; Wallace, J. A review of antennas and propagation for MIMO wireless communications. IEEE
Trans. Antennas Propag. 2004, 52, 2810–2824. [CrossRef]
2. Gesbert, D.; Shafi, M.; Shiu, D.S.; Smith, P.J.; Naguib, A. From theory to practice: An overview of MIMO
space-time coded wireless systems. IEEE J. Sel. Areas Commun. 2003, 21, 281–302. [CrossRef]
3. Votis, C.; Christofilakis, V.; Raptis, V.; Tatsis, G.; Chronopoulos, S.K.; Kostarakis, P. Design and analysis of
a multiple-output transmitter based on DDS architecture for modern wireless communications. AIP Conf.
Proc. 2010, 1203, 421–426.
301
Electronics 2019, 8, 521
4. Kammoun, A.; Debbah, M.; Alouini, M.S. Design of 5G full dimension massive MIMO systems. IEEE Trans.
Commun. 2018, 66, 726–740.
5. Osseiran, A.; Boccardi, F.; Braun, V.; Kusume, K.; Marsch, P.; Maternia, M.; Queseth, O.; Schellmann, M.;
Schotten, H.; Taoka, H.; et al. Scenarios for 5G mobile and wireless communications: The vision of the METIS
project. IEEE Commun. Mag. 2014, 52, 26–35. [CrossRef]
6. Angelis, C.T.; Chronopoulos, S.K. System performance of an LTE MIMO downlink in various fading
environments. In Proceedings of the Ambient Media and Systems, Berlin, Germany, 27–28 January 2011;
pp. 36–43.
7. Ojaroudiparchin, N.; Shen, M.; Pedersen, G.F. Multi-layer 5G mobile phone antenna for multi-user MIMO
communications. In Proceedings of the 23rd Telecommunications Forum (TELFOR), Belgrade, Serbia, 24–26
November 2015; pp. 559–562.
8. Hassan, N.; Fernando, X. Massive MIMO Wireless Networks: An Overview. Electronics 2017, 6, 63. [CrossRef]
9. Larsson, E.; Edfors, O.; Tufvesson, F.; Marzetta, T. Massive MIMO for next generation wireless Systems. IEEE
Commun. Mag. 2014, 52, 186–195. [CrossRef]
10. Yang, H.H.; Quek, T.Q.S. Massive MIMO Meet Small Cell; Springer Briefs in Electrical and Computer
Engineering; Springer International Publishing: Basel, Switzerland, 2017.
11. Sharawi, M.S. Printed MIMO Antenna Engineering; Artech House: Norwood, MA, USA, 2014.
12. Parchin, N.O.; Basherlou, H.J.; Al-Yasir, Y.I.A.; Abd-Alhameed, R.A.; Abdulkhaleq, A.M.; Noras, J.M. Recent
developments of reconfigurable antennas for current and future wireless communication systems. Electronics
2019, 8, 128. [CrossRef]
13. Parchin, N.O.; Al-Yasir, Y.; Abdulkhaleq, A.M.; Elfergani, I.; Rayit, A.; Noras, J.M.; Rodriguez, J.;
Abd-Alhameed, R.A. Frequency reconfigurable antenna array for mm-Wave 5G mobile handsets. In
Proceedings of the 9th International Conference on Broadband Communications, Networks, and Systems,
Faro, Portugal, 19–20 September 2018.
14. Al-Yasir, Y.; Abdullah, A.; Ojaroudi Parchin, N.; Abd-Alhameed, R.; Noras, J. A new
polarization-reconfigurable antenna for 5G applications. Electronics 2018, 7, 293. [CrossRef]
15. Hussain, R.; Alreshaid, A.T.; Podilchak, S.K.; Sharawi, M.S. Compact 4G MIMO antenna integrated with a
5G array for current and future mobile handsets. IET Microw. Antennas Propag. 2017, 11, 271–279. [CrossRef]
16. Chen, Q.; Lin, H.; Wang, J.; Ge, L.; Li, Y.; Pei, T. Single ring slot based antennas for metal-rimmed 4G/5G
smartphones. IEEE Trans. Antennas Propag. 2019, 67, 1476–1487. [CrossRef]
17. Ojaroudi Parchin, N.; Abd-Alhameed, R.A. A compact Vivaldi antenna array for 5G channel sounding
applications. In Proceedings of the 12th European Conference on Antennas and Propagation, London, UK,
9–13 April 2018.
18. Al-Hadi, A.A.; Ilvonen, J.; Valkonen, R.; Viikan, V. Eight-element antenna array for diversity and MIMO
mobile terminal in LTE 3500MHz band. Microw. Opt. Technol. Lett. 2014, 56, 1323–1327. [CrossRef]
19. Abdullah, M. Compact 4-Port MIMO antenna system for 5G mobile terminal. In Proceedings of the
International Applied Computational Electromagnetics Society Symposium, Florence, Italy, 26–30 March
2017.
20. Ojaroudi Parchin, N.; Al-Yasir, Y.I.A.; Abd-Alhameed, R.A.; Noras, J.M. Dual-polarized MIMO antenna array
design using miniaturized self-complementary structures for 5G smartphone applications. In Proceedings of
the EuCAP Conference, Krakow, Poland, 31 March–5 April 2019. accepted.
21. Wong, K.L.; Lu, J.Y.; Chen, L.Y.; Li, W.Y.; Ban, Y.L. 8-antenna and 16-antenna arrays using the quad-antenna
linear array as a building block for the 3.5-GHz LTE MIMO operation in the smartphone. Microw. Opt.
Technol. Lett. 2016, 58, 174–181. [CrossRef]
22. Rao, L.-Y.; Tsai, C.-J. 8-loop antenna array in the 5 inches size smartphone for 5G communication the
3.4 GHz–3.6 GHz band MIMO operation. In Proceedings of the Progress in Electromagnetics Research
Symposium (PIERS), Toyama, Japan, 1–4 August 2018.
23. Abdullah, M.; Ban, Y.L.; Kang, K.; Li, M.Y.; Amin, M. Eight-element antenna array at 3.5GHz for MIMO
wireless application. Prog. Electromagn. Res. C 2017, 78, 209–217. [CrossRef]
24. Li, Y.; Luo, Y.; Yang, G. High-isolation 3.5-GHz 8-antenna MIMO array using balanced open slot antenna
element for 5G smartphones. IEEE Trans. Antennas Propag. 2019. [CrossRef]
25. Sun, L.; Feng, H.; Li, Y.; Zhang, Z. Compact 5G MIMO mobile phone antennas with tightly arranged
orthogonal-mode pairs. IEEE Trans. Antennas Propag. 2018, 66, 6364–6369. [CrossRef]
302
Electronics 2019, 8, 521
26. Li, M.Y.; Ban, Y.L.; Xu, Z.Q.; Guo, J.; Yu, Z.F. Tri-polarized 12-antenna MIMO array for future 5G smartphone
applications. IEEE Access 2018, 6, 6160–6170. [CrossRef]
27. Zhao, A.; Ren, Z. Size reduction of self-isolated MIMO antenna system for 5G mobile phone applications.
IEEE Antennas Wirel. Propag. Lett. 2019, 18, 152–156. [CrossRef]
28. Statement: Improving Consumer Access to Mobile Services at 3.6 GHz to 3.8 GHz. Available online:
https://www.ofcom.org.uk/consultations-and-statements/category-1/future-use-at-3.6-3.8-ghz (accessed on
21 October 2018).
29. CST Microwave Studio; ver. 2018; CST: Framingham, MA, USA, 2018.
30. Chang, K. Microwave Ring Circuits and Antennas; Wiley: New York, NY, USA, 1996.
31. Batchelor, J.C.; Langley, R.J. Microstrip annular ring slot antennas for mobile applications. Electron. Lett.
2019, 32, 1635–1636. [CrossRef]
32. Yoshimura, Y. A microstripline slot antenna. IEEE Trans. Microw. Theory Tech. 1972, 20, 760–762. [CrossRef]
33. Elfergani, I.; Hussaini, A.S.; Rodriguez, J.; Abd-Alhameed, R. Antenna Fundamentals for Legacy Mobile
Applications and Beyond; Springer Nature: Basingstoke, UK, 2017; pp. 1–659.
34. Al-Nuaimi, M.K.T.; Whittow, W.G. Performance investigation of a dual element IFA array at 3 GHz for
MIMO terminals. In Proceedings of the Antennas and Propagation Conference (LAPC), Loughborough, UK,
14–15 November 2011; pp. 1–5.
35. Nguyen, Q.D.; Le, T.T.; Le, D.T.; Tran, X.N.; Yamada, Y. A compact MIMO ultra-wide band antenna with low
mutual coupling. Appl. Comput. Electromagn. Soc. J. 2018, 31, 252–260.
36. Parchin, N.O.; Al-Yasir, Y.I.; Ali, A.H.; Elfergani, I.; Noras, J.M.; Rodriguez, J.; Abd-Alhameed, R.A.
Eight-element dual-polarized MIMO slot antenna system for 5G smartphone applications. IEEE Access 2019,
9, 15612–15622. [CrossRef]
37. Syrytsin, I.; Zhang, S.; Pedersen, G.F. Performance investigation of a mobile terminal phased array with user
effects at 3.5 GHz for LTE advanced. IEEE Antennas Wirel. Propag. Lett. 2017, 16, 1847–1850. [CrossRef]
38. Sharawi, M.S. Printed multi-band MIMO antenna systems and their performance metrics [wireless corner].
IEEE Antennas Propag. Mag. 2013, 55, 218–232. [CrossRef]
39. Ojaroudi Parchin, N.; Alibakhshikenari, M.; Jahanbakhsh Basherlou, H.; AAbd-Alhameed, R.; Rodriguez, J.;
Limiti, E. mm-Wave phased array quasi-Yagi antenna for the upcoming 5G cellular communications. Appl.
Sci. 2019, 9, 978. [CrossRef]
40. Basar, M.R.; Malek, M.F.; Juni, K.M.; Saleh, M.I.; Idris, M.S.; Mohamed, L.; Saudin, N.; Mohd Affendi, N.A.;
Ali, A. The use of a human body model to determine the variation of path losses in the human body channel
in wireless capsule endoscopy. Prog. Electromagn. Res. 2013, 133, 495–513. [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
303
electronics
Article
High-Isolation Leaky-Wave Array Antenna Based on
CRLH-Metamaterial Implemented on SIW with ±30o
Frequency Beam-Scanning Capability
at Millimetre-Waves
Mohammad Alibakhshikenari 1, *, Bal Singh Virdee 2 , Chan H. See 3,4 , Raed A. Abd-Alhameed 5 ,
Francisco Falcone 6 and Ernesto Limiti 1
1 Electronic Engineering Department, University of Rome “Tor Vergata”, Via del Politecnico 1, 00133 Rome,
Italy; [email protected]
2 Center for Communications Technology, School of Computing & Digital Media, London Metropolitan
University, London N7 8DB, UK; [email protected]
3 School of Engineering & the Built Environment, Edinburgh Napier University, 10 Colinton Rd.; Edinburgh,
EH10 5DT, UK; [email protected]
4 School of Engineering, University of Bolton, Deane Road, Bolton BL3 5AB, UK
5 Faculty of Engineering & Informatics, University of Bradford, Bradford, West Yorkshire BD7 1DP, UK;
[email protected]
6 Electrical and Electronic Engineering Department, Public University of Navarre, 31006 Pamplona, Spain;
[email protected]
* Correspondence: [email protected]
Abstract: The paper presents a feasibility study on the design of a new metamaterial leaky-wave
antenna (MTM-LWA) used in the construction of a 1 × 2 array which is implemented using
substrate-integrated waveguide (SIW) technology for millimetre-wave beamforming applications.
The proposed 1 × 2 array antenna consists of two LWAs with metamaterial unit-cells etched on the top
surface of the SIW. The metamaterial unit-cell, which is an E-shaped transverse slot, causes leakage
loss and interrupts current flow over SIW to enhance the array’s performance. The dimensions
of the LWA are 40 × 10 × 0.75 mm3 . Mutual-coupling between the array elements is suppressed
by incorporating a metamaterial shield (MTM-shield) between the two antennas in the array. The
LWA operates over a frequency range of 55–65 GHz, which is corresponding to 16.66% fractional
bandwidth. The array is shown to exhibit beam-scanning of ±30◦ over its operating frequency range.
Radiation gain in the backward (−30◦ ), broadside (0◦ ), and forward (+30◦ ) directions are 8.5 dBi,
10.1 dBi, and 9.5 dBi, respectively. The decoupling slab is shown to have minimal effect on the array’s
performance in terms of impedance bandwidth and radiation specifications. The MTM-shield is
shown to suppress the mutual coupling by ~25 dB and to improve the radiation gain and efficiency
by ~1 dBi and ~13% on average, respectively.
Keywords: Metamaterials (MTM); leaky-wave antenna (LWA); antenna arrays; substrate integrated
waveguide (SIW); transverse slots; beam-scanning; mutual coupling isolation; millimetre-wave;
composite right/left-handed transmission line (CRLH-TL)
1. Introduction
Leaky-wave antennas (LWAs) are travelling wave antennas with electrically large radiating
aperture [1,2]. Such antennas can provide high gain directive beam without using a complex feeding
network [3,4]. The advantage of LWA over a conventional array antenna is a simple feed structure
with low loss [5–7]. Conventional planar LWAs radiate higher order modes in the forward direction [8].
However, the periodic structure-based LWA can radiate in both forward and backward directions. It
has been shown that metamaterial-based LWA designs can achieve a continuous main beam scanning
from the backward to the forward direction as a function of frequency [9]. Various LWA designs based
on metamaterial structures have been considered in the past. Such designs include (1) an LWA with
a composite right/left-hand (CRLH)-folded substrate-integrated waveguide (SIW) structure that is
shown to provide beam scanning from of −58◦ to 65◦ with a gain of 1 dBi [10]; (2) an interdigital-shaped
slotted-SIW-based LWA that is reported to achieve a scanning angle of −60◦ to 70◦ with gain of around
8 dBi [11]; (3) a CRLH LWA based on a rectangular waveguide structure that has been demonstrated
for a continuous main beam scanning range from −70◦ to 70◦ with gain of 8.64 dBi [12]; and (4) a planar
slotted SIW LWA that provides a scanning range of −66◦ to 78◦ with consistent gain [13].
In this paper, a new type of LWA in array configuration is proposed based on SIW with metamaterial
inclusions for millimetre-wave applications. Mutual coupling between the closely-spaced antennas
in the array can undermine the array’s performance. To circumvent this, a metamaterial shield is
embedded between the two LWAs. With this approach, mutual coupling is shown to reduce by an
average of ~25 dB over the array’s operating frequency range. Engraved on the upper layer of the SIW
LWA are several metamaterial unit-cells comprising of transverse E-shaped slots. Dimensions of the
LWAs were modified for optimum array performance.
Figure 1. Cont.
306
Electronics 2019, 8, 642
Figure 1. Proposed 1 × 2 array antenna based on MTM-LWA using SIW technology. (a) Top view; (b)
view to show the Substrate integrated waveguide slots; (c) back side to show the ground plane.
The structural parameters of the MTM-LWA array are summarized in Table 1. Each antenna has
dimensions of 40 × 10 × 0.75 mm3 . The overall ground plane dimensions are 50 × 35 × 0.75 mm3 . The
S-parameter responses of the proposed array antenna are exhibited in Figure 2, which shows it operates
throughout the frequency range of 55–65 GHz, which corresponds to 16.66% fractional bandwidth.
307
Electronics 2019, 8, 642
Figure 2. S-parameter responses of the proposed array antennas. Since the structure is symmetrical,
we have not plotted all curves.
Radiation gain patterns of the proposed array antenna at three spot frequencies within its operating
frequency range are plotted in Figure 3. It is evident that the array antenna is capable of beam-scanning
◦
from −30◦ to +30◦ with backward radiation at −30◦ , broadside radiation at 0 , and forward radiation
at +30◦ . In backward, broadside, and forward directions, the gain is 8.5, 10.1, and 9.5 dBi, respectively.
Figure 3. Cont.
308
Electronics 2019, 8, 642
Figure 3. Radiation characteristics of the proposed 1 × 2 MTM-LWA array at 55 GHz, 60 GHz, and 65
GHz. (a) Backward-radiation at 55 GHz; (b) broadside-radiation at 60 GHz; (c) forward-radiation at
65 GHz.
3. Suppress the Mutual Coupling Between the Closely Spaced Mtm-Lwa Arrays
Mutual coupling between closely-spaced radiation elements can severely undermine the array’s
radiation performances. Here the isolation between the two MTM-LWAs is increased by introducing
a metamaterial shield which is based on the SIW structure, as indicated in Figure 4. It comprises of
transverse slots that are tapered. The slots have a width of 0.5 mm and essentially play the role of
the series left-handed capacitances (CL ), where the metallic via-holes with diameter of 0.25 mm act as
shunt left-handed inductances (LL ). The MTM-shield suppresses surface waves created by the LWAs
to increase isolation between the two antennas in the array. The overall dimensions of the shield are
40 × 4 mm2 .
Figure 5 shows the S-parameter response before and after applying the MTM-shield. After
applying MTM-shield, the minimum, average and maximum suppression observed are 8 dB, ~25 dB,
and 42.5 dB, respectively. This shows the effectiveness of the proposed isolation technique. The shield
has no influence on the reflection coefficient response which is S11 ≤ −10 dB.
Radiation patterns of the proposed antenna arrays with no and with MTM-shield through its
operational bandwidth at spot frequencies of 55 GHz, 60 GHz & 65 GHz are plotted in Figure 6. It is
clear from this figure that with the shield the cross-polarized radiation over its operating range is
substantially reduced. The average gain of the co-polarized radiation is only marginally affected.
All details are tabulated in Table 2.
In addition, the radiation gain and efficiency curves over frequency bandwidth for both antennas
without and with the proposed shield are shown in Figure 7. Obviously, after realizing the metamaterial
shield based on SIW, the radiation gain and efficiency performances improved by ~1dBi and ~13% on
average, respectively.
Surface current density distributions without and with the MTM-shield are exhibited in Figure 8.
This figure shows that the MTM-shield is an effective EM band-gap structure to remarkably block
surface currents from EM interacting with adjacent radiation elements in the antenna array. Destructive
influences of surface currents in the antenna are dramatically suppressed from effecting the far-field of
the array antennas.
309
Electronics 2019, 8, 642
Figure 4. Proposed SIW-based leaky-wave antenna array with MTM-shield. (a) Proposed MTM-SIW
shield located between the array antennas; (b) top-view of the leaky-wave array antennas; (c) back-side
to show ground plane.
Figure 5. Reflection and transition coefficients of the proposed antenna array before and after applying
the MTM-shield. Since the structure is symmetrical, we have not plotted all curves.
310
Electronics 2019, 8, 642
Figure 6. Co- and cross-polarized radiation gain patterns of the proposed structure without (WO) and
with (W) metamaterial-shield.
311
Electronics 2019, 8, 642
Radiation Gain
Without MTM With MTM
Minimum 8.69 dBi 9.6 dBi
Maximum 9.11 dBi 10.16 dBi
Average 8.90 dBi 9.86 dBi
Improvement in Average ~1 dBi
Radiation Efficiency
Without MTM With MTM
Minimum 62.53% 73.44%
Maximum 65.21% 78.12%
Average 63.5% 76.38
Improvement in Average ~13%
Figure 7. Radiation gain and efficiency curves over frequency band for both cases with no and with
MTM shield. (a) Radiation-gain; (b) Radiation-efficiency.
Figure 8. Surface current density distributions without and with the metamaterial shield at 60 GHz. (a)
without the metamaterial shield; (b) with the metamaterial shield.
4. Circuit Model of The Proposed MTM-Lwa Array and its Dispersion Phenomenon
One way to explain the metamaterials (MTMs) is the transmission-line theory in terms of the circuit
models. The concept of the composite right/left-handed metamaterial transmission lines (CRLH-MTM
TLs) is investigated and realized based on this approach. This solution has been broadly recognized
and adopted as a powerful analysis tool for the understanding and modelling of MTM devices. By
312
Electronics 2019, 8, 642
considering the right-handed (RH) effects within a purely left-handed (LH) circuit, it demonstrates
a general configuration of a practical MTM-TL. The circuit model of a generic symmetrical CRLH
transmission-line unit-cell has exhibited in Figure 9 where the loss is neglected for simplicity. The
series capacitance (CL ) and the shunt inductance (LL ), which have been realized by the slots and
via-holes, respectively, contribute to the left-handedness while the series inductance (LR ) and the shunt
capacitance (CR ), which have been implemented by the unwanted currents flowing on the patches and
the gap space between the patches and ground plane, respectively, actualize its right-handed (RH) dual
counterpart. The one indicated in Figure 9a is called T-type model with the LH capacitance placed
at the two ends. The mushroom unit cell belongs to this type [14]. The circuit exhibited in Figure 9b
is called π-type model with the LH capacitance in the centre. One example is the CRLH-SIW unit
cell [15–17]. Therefore, each unit cell of the proposed leaky wave array antennas is based on the π-type
model, which has been identified in Figure 4b.
Figure 9. Equivalent circuit models for the symmetrical CRLH-metamaterial unit-cells. (a) T-type
circuit model. (b) π-type circuit model.
By using the periodic boundary conditions corresponding to the Bloch–Floquet theorem, these two
CRLH transmission line unit-cells basically become equal and their dispersion relevance is determined
to be [9]:
1 ω ω2 ω ω2
2 2
1
β(ω) = cos−1 (1 − ( L2 + 2 − 2L − 2L )) (1)
p 2 ω ωR ωse ωsh
where p = 5 mm is the length of the unit-cell and
1
ωL = √ (2)
CL LL
1
ωR = √ (3)
CR LR
1
ωse = √ (4)
C L LR
1
ωsh = √ (5)
CR LL
Seemingly, there are two frequency spots referred to as the infinite-wavelength points (β = 0)
with a bandgap in between. In the balanced case (ωse = ωsh ), the bandgap vanishes. Generally, just
one particular zeroth-order resonance will be excited that depends on the boundary conditions and the
circuit values. For the short-ended resonator, it is determined by ωse , while for the open-ended case, it
is represented by ωsh [9]. Multiplex resonances containing the negative-, zeroth-, and positive- order
resonances can be produced by cascading more than one unit-cell. Those resonance frequencies of
313
Electronics 2019, 8, 642
different order modes for an M-stage CRLH-transmission line can be discovered on the dispersion
diagram when the following condition is satisfied [9]:
Figure 10. Dispersion diagrams for the proposed LWA array based on SIW-MTM extracted by CST
Microwave Studio package and the corresponding equivalent circuit shown in Figure 9b.
The proposed LWA is exhibited as a traveling-wave antenna, where the current propagates along
a guiding structure. Since that, the perturbations are introduced along the structure by implementing
the E-shaped transverse and the traveling-wave leaves the structure and radiates into free-space.
Therefore, in the ideal case, no energy reaches the end of the structure. In a practical scenario, any
energy that reaches the end is absorbed by a matching load. Usually, LWA is designed, in which
at the least 90% of the power at the structures leaks away before the traveling-waves reach the end
314
Electronics 2019, 8, 642
of the antenna. Leaky-wave phenomenon is demonstrated with fast propagating waves only. The
propagating wave number Kp is defined by [18,19]
Kp = K02 − Kz2 (8)
Kz = β − jα (9)
where α and β are the attenuation and phase constants respectively. Supposing that there is a standard
free-space wave equation for the above wave, the waves outside the leaking-structure are given by:
If β < Kp i.e., if the phase velocity is smaller than the free-space velocity of light, Vp < C, then it is a
slow-wave and Kp is imaginary. The wave decays exponentially in amplitude along the length of the
structure and it is a bounded wave. If, β > Kp i.e., if the phase velocity is greater than the free space
velocity of light, Vp > C, then it is a fast-wave and Kp is purely real; therefore, the real power at an
angle is radiated with respect to the normal defined by [20]:
* + * +
β cβ
sin(θ) = sin−1 = sin−1 (11)
K0 ω
Since all of the abovementioned terms are functions of the angular frequency, the angle changes with
frequency; hence, this shows frequency scanning behaviour. The main beam-width is
0.91
Δθ0 = (12)
l
λ cosθ0
If the above equation is applied for large antenna lengths, high directivity can be specified as
4πAe
D= (13)
λ0
However, the effect of enhancing directivity is negligible if there is no power left near the end of the
structures. To specify this parameter, the attenuation/leakage constant is determined as [20]:
A2 ( z )
2
er
αz = / l /z (14)
0
A2 (z)dz − er 0
A2 (z)dz
Thus, if α is sufficiently small so that (1 − e−2αl > 0), the improvement of directivity is perceptible as
length l enhances.
315
Electronics 2019, 8, 642
GHz to 65 GHz; (iii) simple design; (iv) improved radiation patterns; (v) enhanced radiation gain; (vi)
low cross-polarization levels; and (vii) mutual coupling suppression on average of ~25dB over its
operating band.
Table 3. Performance parameters of the proposed lw array antennas in comparison with the
recent papers.
Max. Isolation Bandwidth Rad. Gain Pattern No. of Application of Symmetry and
Ref. Method
Improvement (FBW) Deterioration Elements DGS Simplicity
[21] SCSRR 10 dB Narrow Yes 2 Yes No
[22] SCSSRR 14.6 dB Narrow Yes 2 Yes No
[23] Compact EBG 17 dB Narrow Yes 2 Yes No
[24] Fractal MTM-EMBG 37 dB Wide (~15%) No 4 No Yes
[25] Meander Line 10 dB Narrow No 2 Yes No
[26] EBG 8.8 dB Narrow - Yes No
[27] EBG 13 dB Wide (~12%) Yes 2 Yes No
Substrate Integrated
[28] Waveguide (SIW) with 15 dB Wide (22.22%) - 1 No Yes
Transverse Slots
Substrate Integrated
[29] Waveguide (SIW) and 20 dB Wide (16.32%) - 1 No Yes
Slots
Periodic Space-Time
[30] - Wide (58.82%) - 1 No Yes
Modulation
Space Time Modulation
[31] (External Linear - - - 1 No Yes
Momentum)
[32] MTM-DS 57 dB Wide No 2 No Yes
[33] Fractal load + DGS 16 dB Narrow (2.5%) No 2 Yes No
[34] Slotted Meander-Line 16 dB Narrow Yes 2 Yes No
[35] Slots >26 dB Wide No 4 No Yes
[36] I-Shaped Resonator 30dB Narrow Yes 2 Yes No
[37] W/g MTM 18 dB Narrow No 2 Yes No
[38] MSWI 13.5 dB Wide No 2 No Yes
[39] Metamaterial Superstrate 25 Narrow No 2 No Yes
Slot Combined
[40] Complementary Split 19 Narrow Yes 2 No No
Ring Resonator
[41] Metamaterial 40 Wide (8.8%) No 4 No Yes
[42] Fractal Load 37 Wide No 2 No Yes
Metamaterials and
This
Substrate Integrated 42.5 dB Wide (16.66%) No 2 No Yes
work
Waveguide
6. Conclusion
A feasibility study of a new metamaterial leaky-wave array antenna based on substrate-integrated
waveguide (SIW) technology with transverse slots and metallic via-holes for operation over 55 GHz to
65 GHz was proposed and investigated. The array antenna provides beam-scanning capability of ±30◦
◦
with the gain of 8.5, 10.1, and 9.5 dBi at backward (-30◦ ), broadside (0 ), and forward (+30◦ ) directions,
respectively. To increase the isolation between the array’s elements, a metamaterial shield based on
SIW was introduced between the antennas, which has reduced the mutual coupling by an average
value of ~25 dB. In addition, the proposed MTM shield has increased the radiation gain and efficiency
by an average value of ~1dBi and ~13%, respectively.
Author Contributions: Conceptualization, M.A.; B.S.V.; F.F.; E.L.; methodology, M.A.; C.H.S.; F.F.; E.L.; software,
M.A.; B.S.V; C.H.S.; validation, M.A.; B.S.V.; C.H.S.; R.A.A.-A.; F.F.; E.L.; formal analysis, M.A.; B.S.V.; F.F.; E.L.;
investigation, M.A.; B.S.V.; C.H.S.; R.A.A.-A.; F.F.; E.L.; resources, M.A.; C.H.S.; R.A.A.-A.; E.L.; data curation,
M.A.; C.H.S.; R.A.A.-A.; F.F.; writing—original draft preparation, M.A.; writing—review and editing, B.S.V.;
C.H.S.; R.A.A.-A.; F.F.; E.L.; visualization, M.A.; B.S.V.; C.H.S.; F.F.; E.L.; supervision, E.L.; project administration,
R.A.A.-A.; F.F.; E.L.; funding acquisition, R.A.A.-A.; F.F.; E.L..
316
Electronics 2019, 8, 642
Funding: This work is partially supported by grant agreement H2020-MSCA-ITN-2016 SECRET-722424 and the
UK EPSRC under grant EP/E022936/1.
Conflicts of Interest: The authors declare no conflict of interest. The funders had no role in the design of the
study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to
publish the results.
References
1. Jackson, D.R.; Oliner, A.A. Leaky-wave antennas. In Modern Antenna Handbook; John Wiley & Sons: New
York, NY, USA, 2008; pp. 325–367.
2. Sievenpiper, D.F. Forward and backward leaky wave radiation with large effective aperture from an
electronically tunable textured surface. IEEE Trans. Antennas Propag. 2005, 53, 236–247. [CrossRef]
3. Jackson, D.R.; Caloz, C.; Itoh, T. Leaky-wave antennas. Proc. IEEE 2012, 100, 2194–2206. [CrossRef]
4. Lim, S.; Caloz, C.; Itoh, T. Metamaterial-based electronically controlled transmission-line structure as a novel
leaky-wave antenna with tunable radiation angle and beamwidth. IEEE Trans. Microw. Theory Tech. 2004, 52,
2678–2690. [CrossRef]
5. Caloz, C.; Itoh, T.; Rennings, A. CRLH metamaterial leaky-wave and resonant antennas. IEEE Antennas
Propag. Mag. 2008, 50, 25–39. [CrossRef]
6. Paulotto, S.; Baccarelli, P.; Frezza, F.; Jackson, D.R. A novel technique for open-stopband suppression in 1-D
periodic printed leaky-wave antennas. IEEE Trans. Antennas Propag. 2009, 57, 1894–1906. [CrossRef]
7. Chen, Z.N.; Qing, X. Slotted SIW Leaky-Wave Antenna with Improved Backward Scanning Bandwidth
and Consistent Gain. In Proceedings of the 2017 11th European Conference on Antennas and Propagation
(EUCAP), Paris, France, 19–24 March 2017; pp. 752–755.
8. Qian, Y.; Chang, B.C.C.; Itoh, T.; Chen, K.C.; Tzuang, C.K.C. High efficiency and broadband excitation
of leaky mode in microstrip structures. In Proceedings of the 1999 IEEE MTT-S International Microwave
Symposium Digest (Cat. No.99CH36282), Anaheim, CA, USA, 13–19 June 1999; pp. 1419–1422.
9. Caloz, C.; Itoh, T. Electromagnetic Metamaterials: Transmission Line Theory and Microwave Applications;
Wiley-IEEE Press: New York, NY, USA, 2005; 376p.
10. Yang, T.; Chi, P.L.; Xu, R.M. Novel composite right/left-handed leaky-wave antennas based on the folded
substrate-integrated waveguide structures. Prog. Electromagn. Res. 2012, 29, 235–248. [CrossRef]
11. Dong, Y.; Itoh, T. Composite right/left-handed substrate integrated waveguide and half mode substrate
integrated waveguide leaky-wave structures. IEEE Trans. Antennas Propag. 2011, 59, 767–775. [CrossRef]
12. Weitsch, Y.; Eibert, T.F. Analysis and design of a composite left/right-handed leaky wave antenna based on
the H10 rectangular waveguide mode. Adv. Radio Sci. 2008, 6, 49–54. [CrossRef]
13. Chen, Z.N.; Qing, X. Multilayered composite right/left-handed leaky-wave antenna with consistent gain.
IEEE Trans. Antennas Propag. 2012, 60, 5056–5062.
14. Lai, A.; Leong, K.M.; Itoh, T. Infinite wavelength resonant antennas with monopolar radiation pattern based
on periodic structures. IEEE Trans. Antennas Propag. 2007, 55, 868–876. [CrossRef]
15. Dong, Y.; Itoh, T. Miniaturized substrate integrated waveguide slot antennas based on negative order
resonance. IEEE Trans. Antennas Propag. 2010, 58, 3856–3864. [CrossRef]
16. Dong, Y.; Itoh, T. Substrate integrated waveguide negative order resonances and their applications. IEEE Trans.
Antennas Propag. 2010, 4, 1081–1091. [CrossRef]
17. Dong, Y.; Itoh, T. Metamaterial-Based Antennas. Proc. IEEE 2012, 100, 2271–2285. [CrossRef]
18. Johnson, R.C.; Jasik, H. Antenna Engineering Handbook, 4th ed.; McGraw-Hill Education: New York, NY, USA,
2007.
19. Mohsen, M.K.; Isa, M.S.M.; Isa, A.A.M.; Zin, M.S.I.M.; Saat, S.; Zakaria, Z.; Ibrahim, I.M.; Abu, M.; Ahmad, A.;
Abdulhameed, M.K. The Fundamental of Leaky Wave Antenna. J. Telecommun. Electron. Comput. Eng. 2018,
10, 19–127.
20. Frank, B.G. Frontiers in Antennas: Next Generation Design & Engineering; McGraw Hill Professional: New
York, NY, USA, 2010; 512p.
21. Bait-Suwailam, M.M.; Siddiqui, O.F.; Ramahi, O.M. Mutual coupling reduction between microstrip patch
antennas using slotted-complementary split-ring resonators. IEEE Antennas Wirel. Prop. Lett. 2010, 9,
876–878. [CrossRef]
317
Electronics 2019, 8, 642
22. Shafique, M.F.; Qamar, Z.; Riaz, L.; Saleem, R.; Khan, S.A. Coupling suppression in densely packed microstrip
arrays using metamaterial structure. Microw. Opt. Tech. Lett. 2015, 57, 759–763. [CrossRef]
23. Islam, M.T.; Alam, M.S. Compact EBG Structure for Alleviating Mutual Coupling between Patch Antenna
Array Elements. Prog. Electromagn. Res. 2013, 137, 425–438. [CrossRef]
24. Alibakhshikenari, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.; Ali, A.H.; Falcone, F.; Limiti, E. Study
on Isolation Improvement Between Closely Packed Patch Antenna Arrays Based on Fractal Metamaterial
Electromagnetic Bandgap Structures. IET Microw. Ant. Propag. 2018, 12, 2241–2247. [CrossRef]
25. Ghosh, T.; Ghosal, S.; Mitra, D.; Bhadra Chaudhuri, S.R. Mutual Coupling Reduction Between Closely Placed
Microstrip Patch Antenna Using Meander Line Resonator. Prog. Electromag. Res. Lett. 2016, 59, 115–122.
[CrossRef]
26. Yang, F.; Rahmat-Samii, Y. Microstrip Antennas Integrated with Elctromagnetic Band-Gap (EBG) Structures:
A Low Mutual Coupling Design for Array Applications. IEEE Trans. Antennas Propag. 2003, 51, 2936–2946.
[CrossRef]
27. Mu’ath, J.; Denidni, T.A.; Sebak, A.R. Millimeter Wave Compact EBG Structure for Mutual Coupling
Reduction Applications. IEEE Trans. Antennas Propag. 2015, 63, 823–828.
28. Liu, J.; Jackson, D.R.; Long, Y. Substrate integrated waveguide (SIW) leaky-wave antenna with transverse
slots. IEEE Trans. Antennas Propag. 2012, 60, 20–29. [CrossRef]
29. Ettorre, M.; Sauleau, R.; Le Coq, L. Multi-beam multi-layer leaky-wave SIW pillbox antenna for
millimeter-wave applications. IEEE Trans. Antennas Propag. 2011, 59, 1093–1100. [CrossRef]
30. Taravati, S.; Caloz, C. Mixer-duplexer-antenna leaky-wave system based on periodic space-time modulation.
IEEE Trans. Antennas Propag. 2017, 65, 442–452. [CrossRef]
31. Taravati, S.; Caloz, C. Space-time modulated nonreciprocal mixing, amplifying and scanning leaky-wave
antenna system. In Proceedings of the 2015 IEEE International Symposium on Antennas and Propagation &
USNC/URSI National Radio Science Meeting, Vancouver, BC, Canada, 19–24 July 2015; pp. 639–640.
32. Alibakhshikenari, M.; Virdee, B.S.; Shukla, P.; See, C.H.; Abd-Alhameed, R.; Khalily, M.; Falcone, F.; Limiti, E.
Interaction Between Closely Packed Array Antenna Elements Using Metasurface for Applications Such as
MIMO Systems and Synthetic Aperture Radars. Radio Sci. 2018, 53, 1368–1381. [CrossRef]
33. Yang, X.; Liu, Y.; Xu, Y.X.; Gong, S.X. Isolation Enhancement in Patch Antenna Array with fractal UC-EBG
Structure and Cross Slot. IEEE Antennas Wirel. Propag. Lett. 2017, 16, 2175–2178. [CrossRef]
34. Alsath, M.G.N.; Kanagasabai, M.; Balasubramanian, B. Implementation of Slotted Meander Line Resonators
for Isolation Enhancement in Microstrip Patch Antenna Arrays. IEEE Antennas Wirel. Propag. Lett. 2013, 12,
15–18. [CrossRef]
35. Alibakhshikenari, M.; Virdee, B.; Shukla, P.; See, C.; Abd-Alhameed, R.; Khalily, M.; Falcone, F.; Limiti, E.
Antenna Mutual Coupling Suppression Over Wideband Using Embedded Periphery Slot for Antenna Arrays.
Electronics 2018, 7, 198. [CrossRef]
36. Ghosh, C.K.; Parui, S.K. Reduction of Mutual Coupling Between E-Shaped Microstrip Antennas by Using a
Simple Microstrip I-Section. Microw. Opt. Tech. Lett. 2013, 55, 2544–2549. [CrossRef]
37. Qamar, Z.; Park, H.C. Compact Waveguided Metamaterials for Suppression of Mutual Coupling in Microstrip
Array. Prog. Electromagn. Res. 2014, 149, 183–192. [CrossRef]
38. Alibakhshikenari, M.; See, C.H.; Virdee, B.S.; Abd-Alhameed, R.A. Meta-surface Wall Suppression of Mutual
Coupling between Microstrip Patch Antenna Arrays for THz-band Applications. Prog. Electromagn. Res.
Lett. 2018, 75, 105–111. [CrossRef]
39. Qamar, Z.; Naeem, U.; Khan, S.A. Mutual coupling reduction for high performance densely packed patch
antenna arrays on finite substrate. IEEE Trans. Antennas Propag. 2016, 64, 1653–1660. [CrossRef]
40. Qamar, Z.; Riaz, L.; Chongcheawchamnan, M.; Khan, S.A.; Shafique, M.F. Slot combined complementary
split ring resonators for mutual coupling suppression in microstrip phased arrays. IET Microw. Antenna
Propag. 2014, 8, 1261–1267. [CrossRef]
318
Electronics 2019, 8, 642
41. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual-Coupling
Isolation Using Embedded Metamaterial EM Bandgap Decoupling Slab for Densely Packed Array Antennas.
IEEE Access 2019, 7, 5182–51840. [CrossRef]
42. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual Coupling
Suppression Between Two Closely Placed Microstrip Patches Using EM-Bandgap Metamaterial Fractal
Loading. IEEE Access 2019, 7, 23606–23614. [CrossRef]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).
319
MDPI
St. Alban-Anlage 66
4052 Basel
Switzerland
Tel. +41 61 683 77 34
Fax +41 61 302 89 18
www.mdpi.com