0% found this document useful (0 votes)

11 views332 pages

Massive MIMO Systems, Paper 2020

This document is a reprint of articles from a special issue of the journal Electronics on the topic of massive MIMO systems. It contains an introductory article by the special issue editors and multiple technical research articles on various aspects of massive MIMO.

Uploaded by

john Bronson

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views332 pages

Massive MIMO Systems, Paper 2020

Uploaded by

john Bronson

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 332

Massive MIMO

Systems
Edited by

Kazuki Maruta and Francisco Falcone

Printed Edition of the Special Issue Published in Electronics

www.mdpi.com/journal/electronics
Massive MIMO Systems
Massive MIMO Systems

Special Issue Editors

Kazuki Maruta
Francisco Falcone

MDPI • Basel • Beijing • Wuhan • Barcelona • Belgrade • Manchester • Tokyo • Cluj • Tianjin
Special Issue Editors
Kazuki Maruta Francisco Falcone
Chiba University Public University of Navarre
Japan Spain

Editorial Ofﬁce
MDPI
St. Alban-Anlage 66
4052 Basel, Switzerland

This is a reprint of articles from the Special Issue published online in the open access journal Electronics
(ISSN 2079-9292) (available at: https://www.mdpi.com/journal/electronics/special issues/mimo).

For citation purposes, cite each article independently as indicated on the article page online and as
indicated below:

LastName, A.A.; LastName, B.B.; LastName, C.C. Article Title. Journal Name Year, Article Number,
Page Range.

ISBN 978-3-03936-016-1 (Hbk)

ISBN 978-3-03936-017-8 (PDF)

c 2020 by the authors. Articles in this book are Open Access and distributed under the Creative
Commons Attribution (CC BY) license, which allows users to download, copy and build upon
published articles, as long as the author and publisher are properly credited, which ensures maximum
dissemination and a wider impact of our publications.
The book as a whole is distributed by MDPI under the terms and conditions of the Creative Commons
license CC BY-NC-ND.
Contents

About the Special Issue Editors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii

Kazuki Maruta and Francisco Falcone

Massive MIMO Systems: Present and Future
Reprinted from: Electronics 2020, 9, 385, doi:10.3390/electronics9030385 . . . . . . . . . . . . . . . 1

Qian Lv, Jiamin Li, Pengcheng Zhu, Dongming Wang and Xiaohu You
Downlink Spectral Efﬁciency Analysis in Distributed Massive MIMO with Phase Noise
Reprinted from: Electronics 2018, 7, 317, doi:10.3390/electronics7110317 . . . . . . . . . . . . . . . 5

Jiamin Li, Qian Lv, Jing Yang, Pengcheng Zhu and Xiaohu You
Spectral and Energy Efﬁciency of Distributed Massive MIMO with Low-Resolution ADC
Reprinted from: Electronics 2018, 7, 391, doi:10.3390/electronics7120391 . . . . . . . . . . . . . . . 21

Seoyoung Yu and Jeong Woo Lee

Channel Sounding for Multi-User Massive MIMO in Distributed Antenna System Environment
Reprinted from: Electronics 2019, 8, 36, doi:10.3390/electronics8010036 . . . . . . . . . . . . . . . 37

Omar A. Saraereh, Imran Khan, Byung Moo Lee and Ashraf Tahat
Efﬁcient Pilot Decontamination Schemes in 5G Massive MIMO Systems
Reprinted from: Electronics 2019, 8, 55, doi:10.3390/electronics8010055 . . . . . . . . . . . . . . . 51

Ahmed S. Al-hubaishi, Nor Kamariah Noordin, Aduwati Sali

An Efﬁcient Pilot Assignment Scheme for Addressing Pilot Contamination in Multicell Massive
MIMO Systems
Reprinted from: Electronics 2019, 8, 372, doi:10.3390/electronics8040372 . . . . . . . . . . . . . . . 77

Wei Lu, Yongliang Wang, Xiaoqiao Wen, Shixin Peng and Liang Zhong
Downlink Channel Estimation in Massive Multiple-Input Multiple-Output with Correlated
Sparsity by Overcomplete Dictionary and Bayesian Inference
Reprinted from: Electronics 2019, 8, 473, doi:10.3390/electronics8050473 . . . . . . . . . . . . . . . 97

Hieu Trong Dao and Sunghwan Kim

Multiple-Symbol Non-Coherent Detection for Differential QAM Modulation in Uplink Massive
MIMO Systems
Reprinted from: Electronics 2019, 8, 693, doi:10.3390/electronics8060693 . . . . . . . . . . . . . . . 113

Jong-Gyu Ha, Jae-Hyun Ro and Hyoung-Kyu Song

Throughput Enhancement in Downlink MU-MIMO Using Multiple Dimensions
Reprinted from: Electronics 2019, 8, 758, doi:10.3390/electronics8070758 . . . . . . . . . . . . . . . 123

Qiuna Yan, Yu Sun and Dian-Wu Yue

LOS-Based Equal Gain Transmission and Combining in General Frequency-Selective Ricean
Massive MIMO Channels
Reprinted from: Electronics 2019, 8, 79, doi:10.3390/electronics8010079 . . . . . . . . . . . . . . . 139

Mostafa Hefnawi
Hybrid Beamforming for Millimeter-Wave Heterogeneous Networks
Reprinted from: Electronics 2019, 8, 133, doi:10.3390/electronics8020133 . . . . . . . . . . . . . . . 153

v
Hedi Khammari, Irfan Ahmed, Ghulam Bhatti and Masoud Alajmi
Spatio-Radio Resource Management and Hybrid Beamforming for Limited Feedback
Massive MIMO Systems
Reprinted from: Electronics 2019, 8, 1061, doi:10.3390/electronics8101061 . . . . . . . . . . . . . . 163

Fumiya Muramatsu, Kentaro Nishimori, Ryotaro Taniguchi, Takefumi Hiraguri and Tsutomu
Mitsui
Evaluation of Multi-Beam Massive MIMO Considering MAC Layer Using IEEE802.11ac
and FDD-LTE
Reprinted from: Electronics 2019, 8, 225, doi:10.3390/electronics8020225 . . . . . . . . . . . . . . . 189

Imran Khan, Mohammad Haseeb Zafar, Majid Ashraf and Sunghwan Kim
Computationally Efﬁcient Channel Estimation in 5G Massive Multiple-Input
Multiple-output Systems
Reprinted from: Electronics 2018, 7, 382, doi:10.3390/electronics7120382 . . . . . . . . . . . . . . . 201

Hui Feng, Xiaoqing Zhao, Zhengquan Li and Song Xing

A Novel Iterative Discrete Estimation Algorithm for Low-Complexity Signal Detection in
Uplink Massive MIMO Systems
Reprinted from: Electronics 2018, 8, 980, doi:10.3390/electronics8090980 . . . . . . . . . . . . . . . 213

Wenjin Wang, Yufei Huang, Li You, Jiayuan Xiong, Jiamin Li and Xiqi Gao
Energy Efﬁciency Optimization for Massive MIMO Non-Orthogonal Unicast and Multicast
Transmission with Statistical CSI
Reprinted from: Electronics 2019, 8, 857, doi:10.3390/electronics8080857 . . . . . . . . . . . . . . . 227

Mudasar Latif Memon, Navrati Saxena, Abhishek Roy and Dong Ryeol Shin
Artiﬁcial Intelligence-Based Discontinuous Reception for Energy Saving in 5G Networks
Reprinted from: Electronics 2019, 8, 778, doi:10.3390/electronics8070778 . . . . . . . . . . . . . . . 243

Oluwole J. Famoriji, Zhongxiang Zhang, Akinwale Fadamiro, Rabiu Zakariyya and Fujiang
Lin
Planar Array Diagnostic Tool for Millimeter-Wave Wireless Communication Systems
Reprinted from: Electronics 2018, 7, 383, doi:10.3390/electronics7120383 . . . . . . . . . . . . . . . 263

Naser Ojaroudi Parchin, Haleh Jahanbakhsh Basherlou, Mohammad Alibakhshikenari,

Yasser Ojaroudi Parchin, Yasir I. A. Al-Yasir, Raed A. Abd-Alhameed and Ernesto Limiti
Mobile-Phone Antenna Array with Diamond-Ring Slot Elements for 5G Massive
MIMO Systems
Reprinted from: Electronics 2019, 8, 521, doi:10.3390/electronics8050521 . . . . . . . . . . . . . . . 287

Mohammad Alibakhshikenari, Bal Singh Virdee, Chan H. See, Raed A. Abd-Alhameed,

Francisco Falcone and Ernesto Limiti
High-Isolation Leaky-Wave Array Antenna Based on CRLH-Metamaterial Implemented on
SIW with ±30o Frequency Beam-Scanning Capability at Millimetre-Waves
Reprinted from: Electronics 2019, 8, 642, doi:10.3390/electronics8060642 . . . . . . . . . . . . . . . 305

vi
About the Special Issue Editors
Kazuki Maruta completed his Bachelor of Engineering and Master of Engineering degrees, as
well as his PhD, at Kyushu University, Japan, in 2006, 2008 and 2016, respectively. From 2008 to 2017,
he worked on the research and development of interference compensation techniques for future
wireless communication systems at NTT Access Network Service Systems Laboratories. From 2017
to 2020, he was an Assistant Professor at the Graduate School of Engineering, Chiba University. He is
currently a Specially Appointed Associate Professor at the Academy for Super Smart Society, Tokyo
Institute of Technology. His research interests include MIMO, adaptive array signal processing,
channel estimation, medium access control protocols and moving networks. He is a member of IEEE
and IEICE. He won the IEICE Young Researcher’s Award in 2012, the IEICE Radio Communication
Systems (RCS) Active Researcher Award in 2014, the Asia-Paciﬁc Microwave Conference (APMC)
Prize in 2014, and the IEICE RCS Outstanding Researcher Award in 2018. He was also co-recipient of
the IEICE Best Paper Award in 2018.

Francisco Falcone completed his degree in telecommunication engineering and his PhD in
communication engineering at the Public University of Navarra (UPNA), Spain, in 1999 and 2005,
respectively. From 1999 to 2000, he was a Microwave Commissioning Engineer with Siemens-Italtel,
where he deployed microwave access systems. From 2000 to 2008, he was a Radio Access Engineer
at Telefónica Móviles, performing radio network planning and optimization tasks in mobile network
deployment. In 2009, he was a co-founding member and the Director of Tafco Metawireless, a spin-off
company from UPNA. He was an Assistant Lecturer at the Electrical and Electronic Engineering
Department, UPNA, from 2003 to 2009. In 2009, he became an Associate Professor with the EE
Department, and was the Head of Department from 2012 to 2018. From January 2018 to May 2018,
he was a Visiting Professor at the Kuwait College of Science and Technology, Kuwait. He is also
afﬁliated with the Institute for Smart Cities (ISC), UPNA, which hosts around 140 researchers, and is
the Head of the ICT section. He has more than 500 contributions in indexed international journals,
book chapters and conference contributions. His research interests are related to computational
electromagnetics applied to the analysis of complex electromagnetic scenarios, with a focus on the
analysis, design and implementation of heterogeneous wireless networks to enable context aware
environments. Prof. Falcone was a recipient of the CST 2003 and CST 2005 Best Paper Award, the
PhD Award from the Colegio Oﬁcial de Ingenieros de Telecomunicación (COIT) in 2006, the Doctoral
Award UPNA, 2010, the 1st Juan Gomez Peñalver Research Award from the Royal Academy of
Engineering of Spain in 2010, the XII Talgo Innovation Award 2012, the IEEE 2014 Best Paper Award,
2014, the ECSA-3 Best Paper Award, 2016, and the ECSA-4 Best Paper Award, 2017. He is a Senior
Member of the IEEE.

vii
electronics
Editorial
Massive MIMO Systems: Present and Future
Kazuki Maruta 1, * and Francisco Falcone 2
1 Graduate School of Engineering, Chiba University, 1–33 Yayoi-cho, Inage-ku, Chiba-shi 263-8522, Japan
2 Department of Electrical, Electronic and Communication Engineering & Institute for Smart Cities,
Public University of Navarre, 31006 Pamplona, Spain; [email protected]
* Correspondence: [email protected]

Received: 17 February 2020; Accepted: 21 February 2020; Published: 26 February 2020

1. Introduction
We are going to see the first decade since the fundamental concept of massive multiple-input
multiple-output (MIMO) (also called large-scale MIMO) has emerged [1]. Massive MIMO is expected
to be one of the most promising technologies towards the fifth generation mobile communications (5G)
and beyond. Implementation [2,3] and trials [4,5] are actively proceeded. Especially, massive array
beamforming has a good match to millimeter wave communication [6] which suffers from link
budget shortfall due to its high frequency. Further, thanks to its excessive degree of freedom
(DoF), massive MIMO has unlimited potentiality to further enhance system capabilities [7] and still
expands various research topics with depth. It should be further discussed and believed to break
limitations in wireless communications such as spectral and energy efficiencies for better support of
continuously increasing mobile data traffic, as well as terminals driven by Internet of things (IoT).
The key contribution of this special issue is to provide readers with new insights and facilitate plentiful
discussions in this field.

2. The Present Issue

This special issue consists of nineteen papers covering wide and important topics in the field
of massive MIMO systems, including both fundamental regions such as computation complexity,
energy efficiency, pilot contamination, channel estimation, antenna design, non-orthogonal multiple
access (NOMA) and millimeter-wave beamforming, as well as emerging topic such as machine learning
incorporation. From the system model aspect, variety of scenario have also been covered such as
single/multi-cell, distributed antennas, heterogeneous network, IEEE802.11ac and long term evolution
(LTE) standards.
Distributed antenna systems (DAS) or base station (BS) cooperation have actively investigated
since it can provide array diversity or multiplexing gain due to low spatial correlation of distributed
antennas. Its extension to massive MIMO was analyzed in terms of spectral and energy efficiencies
with considering hardware impairment such as phase noise [8] and analog-to-digital converter (ADC)
resolution [9]. In the distributed massive MIMO structure, sounding reference signal (SRS) design and
channel estimation scheme were proposed in order to mitigate the pilot contamination impact [10].
Work in [11] proposed a path loss based pilot allocation strategy and pseudo-random code
pilot design. In [12], a modified heuristic pilot assignment algorithm was proposed. Its optimization
criteria is to maximize the minimum uplink signal-to-interference plus noise power ratio (SINR).
Efficient channel state information (CSI) estimation scheme was proposed in [13]. It exploits prior CSI
of the previous timeslot having temporal correlation in the angular domain. Differential modulation
unnecessitates channel estimation and is preferable especially in massive MIMO systems. In [14],
incoherent detection for differential modulation was expanded to multiple symbols in the single cell
scenario. For further capacity enhancement, multiplexing in the power domain, i.e., NOMA enabled
by successive interference cancellation (SIC), was introduced [15].

Electronics 2020, 9, 385; doi:10.3390/electronics9030385 1 www.mdpi.com/journal/electronics

Electronics 2020, 9, 385

In millimeter-wave communication, almost line-of-sight (LoS) channel or Ricean fading channel

is expected. Exploiting CSI of the LoS component, spectral efficiency of equal gain transmission
and combining (EGT/EGC) was analyzed in Ricean fading frequency selective fading channel with
cooperative relaying scenarios [16]. Such relaying approach is also effective in heterogeneous network
where small cell BSs play a role of relay the macro cell BS and user terminals. Reference [17] proposed
eigenvector decomposition based hybrid beamforming in the above scenario.
In the practical viewpoint, limited statistical CSI feedback constraint was considered and machine
learning based user grouping aided hybrid beamforming was proposed [18]. Further, CSI estimation
elimination approach, which applies a blind adaptive array signal processing, has been proposed
and its practical performance was evaluated with considering medium access control (MAC) layer
overhead of IEEE802.11ac and frequency division duplex (FDD) based LTE standards [19].
Computation complexity for pre/post coding is also significant problem on massive MIMO
systems. Suppose uplink transmission, iteration-based new detection algorithms were proposed.
One is the extension of linear minimum mean squared error (MMSE) post coding and log-likelihood
ratio (LLR) calculation [20] and another is based on the maximum likelihood (ML) detection and
iterative discrete estimation approaches [21].
Focusing on energy efficiency, reference [22] proposed simplified beamforming as well as power
allocation strategies for the scenario wherein unicast and multicast users are non-orthogonally
multiplexed. Discontinuous reception can also contribute to improve the energy efficiency.
Authors in [23] introduced an artificial intelligence (AI) approach, i.e., recurrent neural network
(RNN), to adapt sleep cycles of user terminals.
In realization of large-scale antenna arrays, we should pay attention to antenna manufacturing.
Reference [24] developed Bayesian compressive sensing based planar array diagnostic tool for efficient
and reliable testing. New antenna structures were designed; dual-polarized diamond-ring slot antenna
array exhibiting wide bandwidth [25], and leaky-wave antenna array incorporating metamaterial
shield [26] which can suppress the mutual coupling.

3. Future
Now discussions towards 6G has started. Massive MIMO is still expected as a promising
contributor for 6G [27–29], e.g., referred as ‘ultra massive MIMO’. Its potentiality will be truly realized
through relentless effort on R&D including the advance of hardware performance. Variety of massive
MIMO technologies, which were widely addressed in this special issue, could be one of the most
important solutions to bring a breakthrough towards beyond 5G or 6G.

Author Contributions: K.M. and F.F. worked together in the whole editorial process of the special issue, ‘Massive
MIMO Systems’ published by journal Electronics. K.M. drafted this editorial summary. K.M. and F.F. reviewed,
edited and finalized the manuscript. All authors have read and agree to the published version of the manuscript.
Acknowledgments: First of all we would like to thank all researchers who submitted articles to this special issue
for their excellent contributions. We are also grateful to all reviewers who contributed evaluations of scientific
merits and quality of the manuscripts and provided countless valuable suggestions to improve their quality and
the overall value for the scientific community. We would like to acknowledge the editorial board of Electronics
journal, who invited us to guest edit this special issue. We are also grateful to the Electronics Editorial Office staff
who worked thoroughly to maintain the rigorous peer-review schedule and timely publication.
Conflicts of Interest: The authors declare no conflicts of interest.

2
Electronics 2020, 9, 385

Abbreviations
The following abbreviations are used in this manuscript:

MIMO Multiple-Input Multiple-Output

5G Fifth generation mobile communications
DoF Degree of freedom
IoT Internet of things
NOMA Non-orthogonal multiple access
LTE Long term evolution
DAS Distributed antenna systems
BS Base station
ADC Analog-to-digital converter
SRS Sounding reference signal
SINR Signal-to-interference plus noise power ratio
CSI Channel state information
SIC Successive interference cancellation
LoS Line-of-sight
EGT Equal gain transmission
EGC Equal gain combining
MAC Medium access control
FDD Frequency division duplex
MMSE Minimum mean squared error
LLR Log-likelihood ratio
ML Maximum likelihood
AI Artiﬁcial intelligence
RNN Recurrent neural network

References
1. Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas.
IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
2. Malkowsky, S.; Vieira, J.; Liu, L.; Harris, P.; Nieman, K.; Kundargi, N.; Wong, I.C.; Tufvesson, F.; Öwall, V.;
Edfors, O. The World’s First Real-Time Testbed for Massive MIMO: Design, Implementation, and Validation.
IEEE Access 2017, 5, 9073–9088. [CrossRef]
3. Wirth, T.; Mehlhose, M.; Thiele, L.; Haustein, T. Proof-of-concept of flexible massive MIMO beamforming
at 2.4 GHz. In Proceedings of the 2017 51st Asilomar Conference on Signals, Systems, and Computers,
Pacific Grove, CA, USA, 29 October–1 November 2017; pp. 1061–1065. [CrossRef]
4. Obara, T.; Okuyama, T.; Inoue, Y.; Aoki, Y.; Suyama, S.; Lee, J.; Okumura, Y. Experimental Trial of 5G Super
Wideband Wireless Systems Using Massive MIMO Beamforming and Beam Tracking Control in 28GHz
Band. IEICE Trans. Commun. 2017, 100, 1256–1268. [CrossRef]
5. Sakai, M.; Kamohara, K.; Iura, H.; Nishimoto, H.; Ishioka, K.; Murata, Y.; Yamamoto, M.; Okazaki, A.;
Nonaka, N.; Suyama, S.; et al. Experimental Field Trials on MU-MIMO Transmission for High SHF
Wide-band Massive MIMO in 5G. IEEE Trans. Wirel. Commun. 2020. [CrossRef]
6. Wu, W.; Liu, D. Hybrid BD-GMD Precoding for Multiuser Millimeter-Wave Massive MIMO Systems.
IEICE Trans. Commun. 2018. [CrossRef]
7. Maruta, K.; Ahn, C.J. Uplink Interference Suppression by Semi-Blind Adaptive Array With Decision Feedback
Channel Estimation on Multicell Massive MIMO Systems. IEEE Trans. Commun. 2018, 66, 6123–6134.
[CrossRef]
8. Lv, Q.; Li, J.; Zhu, P.; Wang, D.; You, X. Downlink Spectral Efficiency Analysis in Distributed Massive MIMO
with Phase Noise. Electronics 2018, 7, 317. [CrossRef]
9. Li, J.; Lv, Q.; Yang, J.; Zhu, P.; You, X. Spectral and Energy Efficiency of Distributed Massive MIMO with
Low-Resolution ADC. Electronics 2018, 7, 391. [CrossRef]
10. Yu, S.; Lee, J.W. Channel Sounding for Multi-User Massive MIMO in Distributed Antenna System
Environment. Electronics 2019, 8, 36. [CrossRef]

3
Electronics 2020, 9, 385

11. Saraereh, O.A.; Khan, I.; Lee, B.M.; Tahat, A. Efficient Pilot Decontamination Schemes in 5G Massive MIMO
Systems. Electronics 2019, 8, 55. [CrossRef]
12. Al-hubaishi, A.S.; Noordin, N.; Sali, A.; Subramaniam, S.; Mohammed Mansoor, A. An Efficient Pilot
Assignment Scheme for Addressing Pilot Contamination in Multicell Massive MIMO Systems. Electronics
2019, 8, 372. [CrossRef]
13. Lu, W.; Wang, Y.; Wen, X.; Peng, S.; Zhong, L. Downlink Channel Estimation in Massive Multiple-Input
Multiple-Output with Correlated Sparsity by Overcomplete Dictionary and Bayesian Inference. Electronics
2019, 8, 473. [CrossRef]
14. Dao, H.T.; Kim, S. Multiple-Symbol Non-Coherent Detection for Differential QAM Modulation in Uplink
Massive MIMO Systems. Electronics 2019, 8, 693. [CrossRef]
15. Ha, J.G.; Ro, J.H.; Song, H.K. Throughput Enhancement in Downlink MU-MIMO Using Multiple Dimensions.
Electronics 2019, 8, 758. [CrossRef]
16. Yan, Q.; Sun, Y.; Yue, D.W. LOS-Based Equal Gain Transmission and Combining in General
Frequency-Selective Ricean Massive MIMO Channels. Electronics 2019, 8, 79. [CrossRef]
17. Hefnawi, M. Hybrid Beamforming for Millimeter-Wave Heterogeneous Networks. Electronics 2019, 8, 133.
[CrossRef]
18. Khammari, H.; Ahmed, I.; Bhatti, G.; Alajmi, M. Spatio-Radio Resource Management and Hybrid
Beamforming for Limited Feedback Massive MIMO Systems. Electronics 2019, 8, 1061. [CrossRef]
19. Muramatsu, F.; Nishimori, K.; Taniguchi, R.; Hiraguri, T.; Mitsui, T. Evaluation of Multi-Beam Massive
MIMO Considering MAC Layer Using IEEE802.11ac and FDD-LTE. Electronics 2019, 8, 225. [CrossRef]
20. Khan, I.; Zafar, M.; Ashraf, M.; Kim, S. Computationally Efficient Channel Estimation in 5G Massive
Multiple-Input Multiple-Output Systems. Electronics 2018, 7, 382. [CrossRef]
21. Feng, H.; Zhao, X.; Li, Z.; Xing, S. A Novel Iterative Discrete Estimation Algorithm for Low-Complexity
Signal Detection in Uplink Massive MIMO Systems. Electronics 2019, 8, 980. [CrossRef]
22. Wang, W.; Huang, Y.; You, L.; Xiong, J.; Li, J.; Gao, X. Energy Efficiency Optimization for Massive MIMO
Non-Orthogonal Unicast and Multicast Transmission with Statistical CSI. Electronics 2019, 8, 857. [CrossRef]
23. Memon, M.L.; Maheshwari, M.K.; Saxena, N.; Roy, A.; Shin, D.R. Artificial Intelligence-Based Discontinuous
Reception for Energy Saving in 5G Networks. Electronics 2019, 8, 778. [CrossRef]
24. Famoriji, O.; Zhang, Z.; Fadamiro, A.; Zakariyya, R.; Lin, F. Planar Array Diagnostic Tool for Millimeter-Wave
Wireless Communication Systems. Electronics 2018, 7, 383. [CrossRef]
25. Ojaroudi Parchin, N.; Jahanbakhsh Basherlou, H.; Alibakhshikenari, M.; Ojaroudi Parchin, Y.; Al-Yasir, Y.I.A.;
Abd-Alhameed, R.A.; Limiti, E. Mobile-Phone Antenna Array with Diamond-Ring Slot Elements for 5G
Massive MIMO Systems. Electronics 2019, 8, 521. [CrossRef]
26. Alibakhshikenari, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Falcone, F.; Limiti, E. High-Isolation
Leaky-Wave Array Antenna Based on CRLH-Metamaterial Implemented on SIW with ±30◦ Frequency
Beam-Scanning Capability at Millimetre-Waves. Electronics 2019, 8, 642. [CrossRef]
27. Yang, P.; Xiao, Y.; Xiao, M.; Li, S. 6G Wireless Communications: Vision and Potential Techniques. IEEE Netw.
2019, 33, 70–75. [CrossRef]
28. Bi, Q. Ten Trends in the Cellular Industry and an Outlook on 6G. IEEE Commun. Mag. 2019, 57, 31–36.
29. Zhang, Z.; Xiao, Y.; Ma, Z.; Xiao, M.; Ding, Z.; Lei, X.; Karagiannidis, G.K.; Fan, P. 6G Wireless Networks:
Vision, Requirements, Architecture, and Key Technologies. IEEE Veh. Technol. Mag. 2019, 14, 28–41.

c 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

4
electronics
Article
Downlink Spectral Efﬁciency Analysis in Distributed
Massive MIMO with Phase Noise
Qian Lv, Jiamin Li *, Pengcheng Zhu, Dongming Wang and Xiaohu You
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China;
[email protected] (Q.L.); [email protected] (P.Z.); [email protected] (D.W.); [email protected] (X.Y.)
* Correspondence: [email protected]; Tel.: +86-025-5209-1635

Received: 25 October 2018 ; Accepted:10 November 2018 ; Published: 12 November 2018

Abstract: To achieve the advantages provided by massive multiple-input multiple-output (MIMO),

a large number of antennas need to be deployed at the base station. However, for the reason of cost,
inexpensive hardwares are employed in the realistic scenario, which makes the system distorted
by hardware impairments. Hence, in this paper, we analyze the downlink spectral efficiency in
distributed massive MIMO with phase noise and amplified thermal noise. We provide an effective
channel model considering large-scale fading, small-scale fast fading and phase noise. Based on the
model, the estimated channel state information (CSI) is obtained during the pilot phase. Under the
imperfect CSI, the closed-form expressions of downlink achievable rates with maximum ratio
transmission (MRT) and zero-forcing (ZF) precoders in distributed massive MIMO are derived.
Furthermore, we also give the user ultimate achievable rates when the number of antennas tends to
infinity with both precoders. Based on these expressions, we analyze the impacts of phase noise on
the spectral efficiency. It can be concluded that the same limit rate is achieved with both precoders
when phase noise is present, and phase noise limits the spectral efficiency. Numerical results show
that ZF outdoes MRT precoder in spectral efficiency and ZF precoder is more affected by phase noise.

Keywords: distributed massive MIMO; phase noise; ampliﬁed thermal noise; spectral efﬁciency

1. Introduction
Massive multiple-input multiple-output (MIMO) is becoming a promising technology to provide
significant gains [1–6]. Since it was first proposed, massive MIMO has been studied extensively.
The main feature of massive MIMO is that hundreds (or even thousands) of antennas are employed
at each base station, simultaneously serving tens of users in the same time-frequency resource,
which offers big advantages compared to conventional MIMO. Firstly, it can bring unprecedented
spatial degrees-of-freedom, which enables the improvement of spectral efficiency and energy efficiency
even with simple linear receivers or precoders [7]. In addition, user channels in massive MIMO systems
are nearly orthogonal and fast fading, intra-cell interference can be averaged out. Massive MIMO can
be divided into two categories: one is co-located massive MIMO and the other is distributed massive
MIMO [8]. The latter has promising advantages of increasing energy efficiency, system coverage
and spectral efficiency, which results from the increase in macro-diversity gain and the reduction in
access distance [9–15]. Considering these advantages, we analyze the spectral efficiency of distributed
massive MIMO in this paper. Notably, due to the different access distance between each user and
all remote antenna units (RAUs), the channel vectors are non-isotropic, which makes the analysis of
performance in distributed massive MIMO more difficult and more complex.
In practical communication systems, inevitable hardware impairments occur and cannot be
eliminated even after applying calibration and compensation techniques [16,17]. These impairments
can be divided into two categories: multiplicative distortion and additive distortion. Phase noise

Electronics 2018, 7, 317; doi:10.3390/electronics7110317 5 www.mdpi.com/journal/electronics

Electronics 2018, 7, 317

introduced by the local oscillators of transceivers is the multiplicative distortion. It will cause random
rotations of the transmitted data symbols, which degrades the system performance. Furthermore,
phase noise makes the estimated channel state information (CSI) more inaccurate and it introduces
a phenomenon called channel aging which means the estimated CSI obtained during pilot phase
is different from that used for downlink transmission. It is pointed out in [8] that the deployment
cost and circuit power consumption of massive MIMO scale linearly with the number of antennas.
Therefore inexpensive but hardware-constraint hardware may be deployed for the reason of cost,
which makes the hardware impairments more severe in massive MIMO.
Analyzing the spectral efficiency is a fundamental method to evaluate the impacts of phase
noise. The impacts of phase noise for uplink transmission have been studied in [18–21] and for
downlink transmission were investigated in [22–24]. The impacts of phase noise on physical layer
security for downlink massive MIMO were investigate in [22]. The achievable rate was derived in [23]
considering the frequency-selective multipath fading channel. The capacity of downlink transmission
with linear precoders was analyzed in [24] but it assumed that the number of antennas and users was
asymptotically large and it only considered a co-located MIMO system.
Herein, considering a distributed massive MIMO with phase noise and amplified thermal noise,
we analyzed the downlink spectral efficiency for any number of antennas and users. Followings are
the key contributions of this paper:
1. In distributed massive MIMO, the channel vectors are non-isotropic and the correlation between
channel vectors and intended beams for each user are destroyed by phase noise. Hence, we first
obtain the distributions of the desired signal and interference powers, which is challenging
and complex.
2. Considering both zero-forcing (ZF) and maximum ratio transmission (MRT) precoders, we obtain
the closed-form expressions of the downlink ergodic achievable rates with imperfect CSI and
hardware impairments in distributed massive MIMO. These closed-form expressions are accurate
for any number of antennas and users in both distributed massive MIMO and co-locate massive
MIMO. Furthermore, they are derived under imperfect CSI which is more realistic, and they
enable the analysis of performance degradation caused by phase noise.
3. The ultimate achievable rate per user is obtained when the number of antennas per remote
antenna unit (RAU) goes infinity. It can be used to investigate the asymptotic performance of
distributed massive MIMO with hardware impairments.
4. The theoretical results are verified by Monte Carlo simulations, and we have a deep insight into
the impacts of phase noise.
The rest of this paper is organized as follows. System model including system configuration,
a model describing phase noise and an effective channel model is introduced in Section 2. We obtain
the estimated CSI during the uplink pilot training phase and analyze the spectral efficiency with linear
precoders in Section 3. Numerical results are given in Section 4. A conclusion is provided in Section 5.
Notation: Column vectors x and matrices X are denoted by bold letters in lower case and in
upper case, respectively. I N is a N × N identity matrix. (·)H and (·)T are the conjugate transpose and
transpose operator, respectively. Scalars x are denoted by italic letters. | x | represents the absolute
value of x and X denotes the spectral norm of X. E[·] and var(·) represent the expectation operator
and variance operator, respectively. CN (0, σ2 ) represents circularly symmetric complex Gaussian
distribution with mean zero and variance σ2 . Γ(k, θ ) means Gamma distribution with shape parameter
k and scale parameter θ. Similarly, Nakagami(m, Ω) means Nakagami distribution with shape
parameter m and controlling spread parameter Ω.

2. System Model
Considering a distributed massive MIMO system, we ﬁrst describe the system conﬁguration and
give the conventional channel model. Next, we present a model describing phase noise and give an
effective channel model incorporating the impacts of phase noise.

6
Electronics 2018, 7, 317

We consider the downlink transmission of a single-cell multi-user distributed massive MIMO

system comprising M RAUs and K single-antenna users as in Figure 1. Each RAU is equipped with an
array of N antennas. All users and RAUs are randomly distributed in the cell.

$QWHQQD

8VHUN
5$8P
$QWHQQD1

H MMN W

8VHUN

H MIP W

5$8P

Figure 1. System Model.

Frequency-ﬂat fading channels are assumed and the system runs in time-division duplex (TDD)
protocol. The channel vector between all RAUs and the k-th user is given by

Δ
g̃k = g̃k1 · · · g̃kMN = Λ1/2
k hk , (1)
Δ −α
where Λk = E g̃k g̃Hk = diag(λ1,k · · · λ M,k ) ⊗ I N is the covariance matrix, λm,k = cdm,k denotes
the path loss between the m-th RAU and the k-th user, dm,k is the corresponding distance, α is the
path loss exponent, c is the median of the mean path gain at a reference distance dm,k = 1 km,
and hk ∼ CN (0, I MN ) is the small-scale fast fading vector.
In this paper, we consider a more realistic scenario where the antennas deployed at each
RAU are inexpensive and hardware-constrained. Speciﬁcally, each antenna experiences phase noise
which distorts communication. The phase noise means the multiplicative phase drift in the signal,
which comes from the local oscillators (LOs) of the RAUs and users. We assume that the LOs are
free-running without a phase-locked loop (PLL), and then the phase noise is commonly modeled as a
discrete-time independent Wiener process [8,25]. Mathematically, the phase noises at the LOs of the
n-th antenna and the k-th user are denoted as

φn (t) ∼ N φn (t − 1) , σφ,n 2
, (2)

ϕk (t) ∼ N ϕk (t − 1) , σϕ,k 2
, (3)

which equal the previous realization φn (t − 1) and ϕk (t − 1) plus an independent zero-mean Gaussian
random increment with variances σφ,n 2 and σ2 . The variances are dependent on the carrier frequency
ϕ,k
and symbol time [25].
The phase noise can be independent or correlated between antennas of each RAU. In our analysis,
we have assumed that the phase noise correlated between antennas of one RAU and independent
among RAUs. Then by expressing the total phase noise as a multiplicative factor, we can rewrite the
channel vector as
gk (t) =Θk (t) g̃k , (4)
1
Δ
where Θk (t) = diag e jθk (t) , · · · , e jθk (t) = e jϕk (t) Φ (t) ∈ C MN × MN is the total phase noise, wherein
MN

Δ
Φ (t) = diag(e jφ1 (t) , · · · , e jφM (t) ) ⊗ I N is the phase noise induced by all RAUs, and similarly, e jϕk (t)
corresponds to the phase drift pruduced by the k-th user. Notably, because of the presence of phase
noise, the effective channel becomes time-dependent.

7
Electronics 2018, 7, 317

2 = σ2 = 0, ∀ n, k.
Remark 1. The conventional channel model without phase noise is obtained when σφ,n ϕ,k

3. Downlink Spectral Efﬁciency Analysis

In this section, ﬁrstly, based on the effective channel model given above, we assume pilot sequence
aided transmission is employed and give the channel estimation. Next, since the channel vectors
are non-isotropic in distributed massive MIMO and the correlation between channel vectors and
intended beams for each user is destroyed by phase noise, we give the approximated distribution
of desired signal and interference powers. After that, we derive the closed-form expressions of the
ergodic achievable downlink rates with both MRT and ZF precoders.

3.1. Channel Estimation

As mentioned before, the transmission protocol is assumed as TDD. Each coherence block
occupying T channel uses is split into two parts: one for uplink pilot symbols and the other for
downlink data symbols. In order to guarantee that the pilot symbols of K users are orthogonal to
each other, it’s necessary to allocate τ ≥ K symbols for pilot transmission. Then the remaining T − τ
channel uses are used for downlink data transmission.
Δ
During the pilot training phase, the pilot sequence xk = [ xk (1), · · · , xk (τ )]T is assigned to user k.
Incorporating the hardware impairments, the received pilot vector yp at the base station at time
t ∈ [0, τ ] is given as
K
yp ( t ) = ∑ gk (t) xk (t) + nBS (t), (5)
k =1

where nBS (t) ∼ CN 0, ξ BS I MN is the ampliﬁed thermal noise at time slot t, and its variance ξ BS is
larger than the variance σ2 of thermal noise. This is because of the effects of low noise ampliﬁers,
mixers and other components.
T
Δ
Let Ψ = yTp (1), · · · , yTp (τ ) ∈ CτMN ×1 . Motivated by [8,26], the Linear Minimum Mean Square
Error (LMMSE) estimation of the channel of the k-th user obtained by pilot training is given by

ĝk (t) = Λk Hk (t)Σ−1 Ψ, (6)

where

Hk (t) = [Hk,1 (t), · · · , Hk,τ (t)] ,

Δ K
Σ= ∑ j=1 B j + ξ BS IτMN ,
Hk,i (t) = xk∗ (i )Dk,i (t),
σ2 + σ2 2 + σ2
σφ,M
φ,1 ϕ,k ϕ,k
Dk,i (t) = diag e− 2 | t −i |
, · · · , e− 2 | t −i |
⊗ IN ,

σ2 + σ2 2 + σ2
Δ φ,1 ϕ,j
σφ,M
ϕ,j
Bj u,v
= Λ j x j (u) x ∗j (v)diag e− 2 |u−v|
, · · · , e− 2 |u−v|
⊗ IN .

The pilot sequences can be designed in different ways. Without loss of generation, in this paper
we assume that the number of pilot symbols is equal to that of users, i.e., τ = K. More speciﬁcally,
Δ
we assume that the set of orthogonal pilot sequences XP = [x1 , · · · , xK ] is a diagonal matrix and each
√
element of it is ρp , wherein ρp is the average transmit power of pilot symbols. This is equivalent to
the assumption made in [18,20].

8
Electronics 2018, 7, 317

Under these assumptions, we give a deﬁnition of

σ2 +σφ,m
2
|t−k| √ ρ
ϕ,k
e− 2 p λm,k
β m,k (t) = , (7)
ρp λm,k +ξ BS

then we can rewrite the LMMSE estimation ĝk (t) in (6) as

ĝk (t) = Λk Hk (t)Σ−1/2 ĥk (8)

T
= β 1,k (t)ĥT1,k , · · · , β M,k (t)ĥTM,k ,

where β m,k (t) is the equivalent large-scale fading part from user k to RAU m and
ĥk = [ĥT1,k , · · · , ĥTM,k ]T = Σ−1/2 Ψ ∼ CN (0, I MN ) represents the equivalent small-scale fast fading part.
Because of the orthogonality principle of LMMSE estimation theory, the channel vector gk (t) can
be decomposed as
gk (t) = ĝk (t) + ek (t), (9)

where ek (t) is the uncorrelated and statistically independent of ĝk (t) estimation error.
During the pilot transmission phase, we obtain the estimated channel showing in (8). In our
analysis, it is assumed that the beamforming vector is designed by using the estimated CSI once
during the pilot transmission phase and then is applied for the entire duration of the downlink
transmission phase.

3.2. Downlink Signal Model

For downlink transmission, the received signal of user k at time t ∈ [τ + 1, T ] is given as
√ ∗
rk (t) = k Θk (t) x + n
ρdl g̃H UE
(t) , (10)

where ρdl is the downlink transmission power, nUE (t) ∼ CN 0, ξ UE is the ampliﬁed thermal noise of
users at time slot t, ξ UE is the variance of the noise, and x ∈ C MN ×1 is the signal vector transmitted by
all M RAUs. Speciﬁcally, x can be given by

K
x= ∑ l =1 w l s l , (11)

where sl ∼ CN (0, 1) is the transmitted data symbol assigned for user l, and wl is the beamforming
vector designed at time slot τ. MRT and ZF precoders are considered in our analysis. Mathematically,
these two linear precoders can be deﬁned as
⎧
⎨ ĝl (τ )
ĝl (τ )
MRT,
wl = al (τ ) (12)
⎩ ZF,
al (τ )

−1
where al (τ ) is the l-th column of Ĝ (τ ) ĜH (τ ) Ĝ (τ ) , wherein Ĝ (τ ) = [ĝ1 (τ ), · · · , ĝK (τ )].
Considering (4), we can rewrite (10) as
√
rk (t) = k ( τ ) Θ̃k ( t ) x + n
ρdl gH UE
(t) , (13)

where
Θ̃k (t) = diag e− j(θk (t)−θk (τ )) , · · · , e− j(θk (t)−θk (τ )) .
Δ 1 1 MN MN
(14)

9
Electronics 2018, 7, 317

It is assumed that users have the statistical properties of the channel and they don’t carry out
channel estimation. So only statical CSI can be used by downlink users to detect the signal. Motivated
by [27], we rewrite the received data as
√
rk (t) = ρdl E gH
k ( τ ) Θ̃k ( t ) wk sk + n , (15)

where

√ √ K
n = ρdl gH
k ( τ ) Θ̃k ( t ) wk − E gk ( τ ) Θ̃k ( t ) wk
H
sk + ρdl ∑ gHk (τ ) Θ̃k (t) wi si + nUE (t) .
i =k

Suppose E gH k ( τ ) Θ̃k ( t ) wk sk is the only signal needed at user k, and treating n as unrelated
Gaussian distributed additive noise [28,29], the achievable downlink rate of user k is denoted as
2
ρdl E gH
k ( τ ) Θ̃k ( t ) wk

Rk = log2 1+ , (16)
A(t) + B(t) + ξ UE

where

A(t) = ρdl var gHk ( τ ) Θ̃k ( t ) wk ,
2
K
B(t) = ∑i =k ρdl E gH k ( τ ) Θ̃k ( t ) wi .

3.3. Downlink Achievable Rates

From (16) we can ﬁnd that the correlation between gk and wk are destroyed by phase noise and
the powers of non-isotropic channel vectors projected onto the precoder subspace are necessary to
obtain the closed-form expressions. Hence we ﬁrst present some preliminary lemmas which help us to
obtain the approximated and isotropic results.

Lemma 1 ([28]). For an isotropic random vector x ∈ C N ×1 whose elements are independent and all distributed
as CN (0, σ2 ), then the distribution of xH x is Γ( N, σ2 ).

The strength of the estimated channel from user k to all RAUs is

M
k ( t ) ĝk ( t ) =
ĝH ∑m=1 β2m,k (t)ĥHm,k ĥm,k , (17)

According to Lemma 1, β2m,k (t)ĥH

m,k ĥm,k is distributed as Γ ( N, β m,k ( t )). Hence (17) is the sum of M
2

non-identically distributed but independent items. To obtain its distribution, Lemma 2 stated bellow
can be used.

Lemma 2 ([15]). If { xi } are a set of random variables and independent of each other, each term is distributed as
Γ(χi , θi ). Then the distribution of the sum ∑i xi can be approximated as ∑i Xi ∼ Γ(χ, θ ) wherein

( ∑ i χ i θ i )2 ∑ χθ2
χ= , θ= i i i . (18)
χ
∑i i i θ 2 ∑i χi θi

10
Electronics 2018, 7, 317

k ( t ) ĝk ( t ) can be approximated as Γ ( χk ( t ), θk ( t )), wherein

Remark 2. From Lemma 2, the distribution of ĝH
2
=1 β m,k ( t )
M 2
∑m
χk (t) = N , (19)
=1 β m,k ( t )
M 4
∑m

=1 β m,k ( t )
M 4
∑m
θk (t) = . (20)
=1 β m,k ( t )
M 2
∑m

k ( t ) ek ( t ) as Γ ( χe ( t ), θe ( t )), wherein
Similarly, we can also give the distribution of eH
2
=1 ηm,k ( t )
M 2
∑m
χe ( t ) = N , (21)
=1 ηm,k ( t )
M 4
∑m

=1 ηm,k ( t )
M 4
∑m
θe ( t ) = , (22)
=1 ηm,k ( t )
M 2
∑m

2 (t) = λ
where ηm,k m,k − β m,k ( t ).
2

Based on the lemmas and analysis above, we give the following lemma about the projection
principle of non-isotropic vectors.

Lemma 3 ([28]). When we project an MN-dimensional non-isotropic estimated channel vector ĝk ∈ C MN ×1
onto a p-dimensional subspace, we can give the approximated distribution of the projection power as
Γ( pχk /( MN ), θk ).

Remark 3. The dimension p can be given by p = MN with MRT precoder and p = MN − K + 1 with ZF
precoder, respectively, and for any independent beam, we can have p = 1 [30,31].

When MRT and ZF precoders are employed, based on the analysis above, we can give the
distribution of the signal power at user k and the distribution of the interference power at user k
2
H
ĝ (τ ) ĝk (τ ) ∼ Γ (χk (τ ), θk (τ )) , (23)
k ĝk (τ )
2
H
ĝ (τ ) ak (τ ) ∼ Γ MN −K +1 χk (τ ), θk (τ ) , (24)
k ak (τ ) MN
2
H 1
ek (τ )wi ∼ Γ χe ( τ ), θe ( τ ) . (25)
MN

Notably, wi in (25) can be either MRT precoder or ZF precoder and the equation still holds when
i = k, due to the independence of ek (t) and wi .
2
Based on the analysis above, we can give the approximated distribution of ĝH (τ ) Θ̃k (t) wk as
k
2
H
ĝ (τ ) Θ̃k (t) ĝk (τ ) ∼ Γ (χk (τ ), θk (τ )) , (26)
k ĝk (τ )
2
H
ĝ (τ ) Θ̃k (t) ak (t) ∼ Γ MN −K +1 χk (t), θk (t) . (27)
k ak (t) MN

Figure 2 veriﬁes the accuracy of the approximation in (26) and (27). It illustrates the cumulative
2
distribution function (CDF) curves of ĝH
k ( τ ) Θ̃k ( t ) wk with MRT precoder. The phase noise variance
is set as σφ2 = σϕ2 = 10−2 . It can be seen that although the random variable Θ̃k (t) will destroy
the correlation of ĝk and wk , the approximation is exactly accurate when the variance of the phase
noise is 10−2 , and it will be more accurate when the variance of the phase noise is lower 10−2 .

11
Electronics 2018, 7, 317

For ZF precoder, we can get the same conclusion. Meanwhile, the phase noise variance is generally
σφ2 = σϕ2 = 1.58 × 10−4 [8,20,26]. Hence, it’s reasonable to use (26) and (27) to analyze the downlink
spectral efﬁciency.

Empirical CDF
1

0.9

0.8
Cumulative distribution function

0.7

0.6

0.5

0.4

0.3

0.2 simulation,M=5,N=1
Gamma approxiamtion,M=5,N=1
simulation,M=1,N=10
0.1
Gamma approxiamtion,M=1,N=10

0
0 1 2 3 4 5 6 7 8
10-10

Figure 2. Cumulative distribution function of signal power with MRT precoder under different M
and N.

Based on the lemmas above, the spectral efﬁciency for downlink transmission with both MRT and
ZF precoders under hardware impairments is analyzed. The theorems and corollary stated below give
the closed-form expressions of the downlink achievable rates and system asymptotic performance.

Theorem 1. When MRT precoder is used, the closed-form expression of the downlink achievable rate under
hardware impairments is given by

Dmrt (t)
Rmrt (t) = , (28)
Amrt (t) + Bmrt (t) + ξ UE /ρdl

where
⎛ ⎞2
Γ χk (t) + 12
Dmrt (t) = ⎝ ⎠ θ ( t ),
k
Γ χk (t)
2
− σϕ,k2 + σ2
φ,m | t − τ | 2
M
∑m =1 e β m,k ( t )
χk (t) = N ,
− 2 σ 2 + σ2 | t − τ |
M
∑m =1 e
ϕ,k φ,m
β4m,k (t)

−2 σϕ,k 2 + σ2
φ,m | t − τ | 4
M
∑m =1 e β m,k (t)
θk (t) = ,
− σ 2 + σ2
φ,m | t − τ | 2
M
∑m =1 e
ϕ,k
β m,k (t)
M 1 M
Amrt (t) = N ∑m=1 β2m,k (τ ) +
M ∑m=1 m,k
η 2 (τ ) − Dmrt (t),

=1 β m,i ( τ ) λm,k
M 2
K N ∑m
Bmrt (t) = ∑i =k θi (τ )(χi (τ ) − 1)
.

Proof of Theorem 1. Please refer to Appendix A.

12
Electronics 2018, 7, 317

Theorem 2. When ZF precoder is used, the closed-form expression of the downlink achievable rate under
hardware impairments is given by

Dzf (t)
Rzf (t) = , (29)
Azf (t) + Bzf (t) + ξ UE /ρdl

where
⎛ ⎞2
Γ κ (t) + 12
D (t) = ⎝
zf ⎠ θ ( t ),
Γ (κ (t)) k

MN − K +1 M
Azf (t) =
M ∑m=1 β2m,k (τ ) − Dzf (t),
K M
M ∑m=1 m,k
Bzf (t) = η 2 ( τ ),
MN − K + 1
κ (t) = χ k ( t ).
MN

Proof of Theorem 2. Please refer to Appendix B.

Then, in order to study the effects of phase noise further, we investigate a case where the
number of antennas employed at each RAU goes inﬁnity and the number of RAUs and users is
ﬁxed. The asymptotic performance provided in Corollary 1 is obtained based on (28) and (29).

Corollary 1. Let N → ∞, the ultimate rate of user k with both MRT and ZF precoders is given by

− σϕ,k
2 + σ2
φ,m | t − τ | 2
M
∑m =1 e β m,k (τ )
R∞
k (t) =
, (30)
− σ 2 + σ2
φ,m | t − τ | 2
=1 β m,k ( τ ) − ∑m=1 e β m,k (τ )
M 2 M
∑m ϕ,k

Proof. Since the proof is similar for both precoders, we only provide the proof for MRT precoder.
It can be seen that χk (t) → ∞ when the number of antennas N → ∞. Therefore we can have
2
Γ(χk (t)+ 12 )
lim N →∞ − χk (t) = 0 [29]. Then the limiting rate of user k can be obtained directly
Γ(χk (t))

by dividing the denominator and numerator of (28) by N. From Corollary 1 we can see that the
ultimate rate without phase noise will be unlimited when N tends to inﬁnity, which means that phase
noise limits the downlink spectral efﬁciency.

4. Numerical Results
In this section, a series of Monte Carlo simulations is used to verify the theoretical results obtained
in Section 3. A circular single-cell massive MIMO system is considered. All of the RAUs and users are
randomly distributed in the cell and the minimum access distance between RAUs and users is set as
r0 = 30 m. The channels are generated by (4), and other simulation parameters are presented in Table 1.

Table 1. Basic simulation parameters.

Number of RAUs M 5
Number of users K 2
Cell radius R 1 km
Path loss exponent α 3.7
Power of uplink pilot symbol ρp 1 Watt
downlink transmit power ρdl 10 Watts
variance of thermal noise is assumed as σ2 −174 dBm

13
Electronics 2018, 7, 317

Figure 3 illustrates the theoretical and simulated spectral efficiency with MRT and ZF precoders
versus the number of antennas per RAU. The spectral efficiency is the average rate between users.
We assume that the variances of phase noise and amplified thermal noise are σφ,m 2 = σϕ,k
2 =
1.58 × 10−4 ∀m, k and ξ UE = ξ BS = 1.58σ2 . t is set as τ + 1. It can be seen that the closed-form
expressions in (28), (29) and the simulation results in (16) match well with each other. For both
precoders, the spectral efficiency increases and gets more and more close to the limiting average
rate with the increasing of N. When N = 100, the system achieves 80% of the ultimate rate with ZF
precoder and 76% with MRT precoder. Furthermore, it can be seen that ZF precoder achieves better
performance than MRT.
12

10
Spectral efficiency (bit/s/Hz)

Simulation-MRT
6 Theoretical-MRT
Simulation-ZF
Theoretical-ZF
5 Ultimate rate-MRT&ZF

4
10 20 30 40 50 60 70 80 90 100
M

Figure 3. Spectral efﬁciency against M with MRT and ZF precoders.

Next, we investigate the effects of phase noise. Figure 4 illustrates the theoretical spectral efficiency
with MRT and ZF precoders against the variance of phase noise. Notably, the variance of phase noise
reflects the strength of phase noise. It is assumed that the number of antennas N = 50 and other system
parameters have the same value as Figure 3. Figure 4 reveals that the spectral efficiency decreases
monotonically with the variance of phase noise increasing. In addition, phase noise have a greater
impact on ZF precoder. This results form the fact that ZF precoder is more sensitive to CSI. It can be
noted that as the variance increases, the performance gap between MRT and ZF precoders becomes
smaller. This is because when the phase noise is severe, the loss caused by unknown CSI at user side
dominates rather than the interference between users.
7.5
MRT ZF

7
Spectral efficiency (bit/s/Hz)

6.5

5.5

5
0 0.001 0.002 0.003 0.004 0.005 0.006 0.007 0.008 0.009 0.01

Figure 4. Spectral efﬁciency against σφ2 = σϕ2 using MRT and ZF precoders with σφ,m
2 = σϕ,k
2 = σ2 =
φ
σϕ2 , ∀m, k.

14
Electronics 2018, 7, 317

Figure 5 illustrates the instantaneous spectral efﬁciency during the downlink transmission phase.
The number of antennas employed at each RAU is assumed as N = 40 and the variance of phase noise
is set as σφ,m
2 2 = 1.58 × 10−4 ∀ m, k. In addition, the coherence time of channel is set as T = 200.
= σϕ,k
As shown in Figure 5, the spectral efﬁciency degrades as t increases. This is because the uncertainty
of the phase drift between downlink transmission phase and pilot training phase increases with the
growing of t. Figure 5 reveals that it’s improper to use the estimated CSI obtained during the pilot
phase for the whole data transmission phase.

7.5

7 ZF MRT

6.5
Spectral Efficiency (bit/s/Hz)

5.5

4.5

3.5
0 20 40 60 80 100 120 140 160 180 200

Figure 5. Spectral efﬁciency against t with MRT and ZF precoders.

5. Conclusions
In this paper, we analyzed downlink spectral efficiency with hardware impairments in distributed
massive MIMO. Initially, employing pilot symbol assisted transmission, we obtained the estimated
CSI in a more realistic scenario where transmission is distorted by phase noise and amplified thermal
noise. Next, we used the imperfect CSI to derive the closed-form expressions for downlink achievable
rates with MRT and ZF precoders. In addition, we obtained the ultimate rate when N → ∞. It can be
seen that the rate performance was limited by phase noise. Then, numerical results proved that the
theoretical analysis was accurate. Furthermore, they also revealed that ZF can achieve larger spectral
efficiency than MRT precoder, and hardware impairments had a greater impact on ZF precoder.
Finally, spectral efficiency degraded with the increasing of the variance of phase noise and downlink
transmission time.
In the future work, we intend to extend our research considering a more effective phase noise
model which could lead finer precoding strategies to improve the theoretical rates.

Author Contributions: Formal analysis, Q.L.; Supervision, J.L., P.Z., D.W. and X.Y.; Validation, Q.L.;
Writing—original draft, Q.L.; Writing—review & editing, Q.L. and J.L.
Funding: This work was supported in part by National Natural Science Foundation of China (NSFC)
(Grant NO. 61501113, 61571120, 61871122), Jiangsu Provincial Natural Science Foundation (Grant
No.BK20150630, BK20180011), six talent peaks project in Jiangsu province, and National Key Special Program
No.2018ZX03001008-002.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

Appendix A. Proof of Theorem 1

2
When MRT precoder is chosen, the following three terms E gH
k ( τ ) Θ̃k ( t ) wk
, A(t) and B(t)
need to be calculated showing in (16).

15
Electronics 2018, 7, 317

2
For the term E gH
k ( τ ) Θ̃k ( t ) wk
, we can obtain

H
E g (τ ) Θ̃k (t) wk 2
k
( a) 2
= E gH k ( τ ) E Θ̃k ( t ) wk

(b) H 2 (A1)

= E |ĝk (τ ) Dk,τ (t)wk |
2
(c) Γ(χk (τ )+ 12 )
= θ k ( τ ),
Γ(χk (τ ))

where ( a) is obtained because Θ̃k (t) is independent of gk (τ ) and wk . By exploiting the fact
σ2 +σφ,m
2

that E e− j(θk (t)−θk (τ )) = e−

m m ϕ,k
2 |t−τ | and e ( t ) is independent of w , we can get ( b ),
k k
and (c) results from Lemma 2, Lemma 3 and the relationship between Gamma distribution and
Nakagami distribution.
For the term A(t), we obtain

var gH k ( τ ) Θ̃k ( t ) wk
2 2
H
=E ĝH k ( τ ) Θ̃ k ( t ) w k + E e k ( τ ) Θ̃ k ( t ) w k (A2)
2

− E gH
k ( τ ) Θ̃k ( t ) wk .

The ﬁrst term of (A2) can be calculated as

ĝk (τ ) 2 ( a)
E ĝH
M
( τ ) Θ̃ ( t ) = N ∑m=1 β2m,k (τ ), (A3)
k k
ĝk (τ )

where ( a) results from (26).

Next the second term can be given by

ĝk (τ ) 2 ( a) 1
E eH
M
k ( τ ) Θ̃ k ( t )
ĝk (τ )
=
M ∑m=1 ηm,k
2
( τ ), (A4)

where ( a) can be obtained by exploiting the fact that eH

k ( τ ) Θ̃k ( t ) ∼ Γ ( χe ( τ ), θe ( τ )) and (25).
2

For the last term B(t), we ﬁrst calculate

∞
1 ( a) 1 χi −1 e− x/θi
E = x χi dx
ĝi 2 0 x θi Γ ( χi )
1 χ −1
= χi θ i Γ ( χ i − 1) (A5)
θi Γ ( χi ) i
1
= ,
θ i ( χ i − 1)

16
Electronics 2018, 7, 317

where ( a) results from Remark 2 and we omit (t) in (A5). Based on (A5), we have
2

E gH k ( τ ) Θ̃k ( t ) wi
2
( a) 1
E gH ( τ ) Θ̃ k ( t ) ĝ i ( τ ) E
k
ĝi (τ )2

(A6)
E ∑m M
=1 β m,i ( τ ) λm,k ĥm,k ĥm,k
2 H
=
θi (τ )(χi (τ ) − 1)
=1 β m,i ( τ ) λm,k
M 2
N ∑m
= ,
θi (τ )(χi (τ ) − 1)

where x y means lim N →∞ ( x − y) = 0, ( a) results from Lemma 4 (ii) of [32].

Finally, combining (A1)–(A6) concludes the proof.

Appendix B. Proof of the Theorem 2

To derive the closed-form expression of (16) with ZF precoder, the following three terms
H
E g (τ ) Θ̃k (t) wk 2 , A(t) and B(t) need to be calculated, wherein wk = ak (τ ) .
k ak (τ )
H 2
For the term E gk (τ ) Θ̃k (t) wk , we obtain
2

E gH k ( τ ) Θ̃k ( t ) wk

( a)
2
= E gH k ( τ ) E Θ̃k ( t ) wk

(b) ak (τ ) 2
= E ĝH ( τ ) D ( t ) (A7)
k k,τ
ak (τ )
⎛ ⎞2
(c) Γ κ (t) + 12
=⎝ ⎠ θ ( t ),
Γ (κ (t)) k

where ( a) is obtained because Θ̃k (t) is independent of g(τ ) and wk . (b) is obtained due to the
independence of ek (τ ) and wk . (c) results from Lemma 2, Lemma 3 and the relationship between
Gamma distribution and Nakagami distribution.
Similar to the proof of Theorem 1, to get the closed-form expression of A(t), we need to calculate
the following two terms

H ak (τ ) 2

E ĝk (τ ) Θ̃k (t) , (A8)
ak (τ )

ak (τ ) 2
E eH
k ( τ ) Θ̃k ( t ) . (A9)
a (τ )
k

The ﬁrst term (A8) can be given by

H ak (τ ) 2 MN − K +1 M

E ĝk (τ ) Θ̃k (t) = ∑m=1 β2m,k (τ ), (A10)
ak (τ ) M

which results from (27).

17
Electronics 2018, 7, 317

Next the second term can be given by

ak (τ ) 2 1
E eH
M
k ( τ ) Θ̃ k ( t )
ak (τ )
=
M ∑m=1 ηm,k
2
( τ ), (A11)

which results from (25) and the fact that eH

k ( τ ) Θ̃k ( t ) ∼ Γ ( χe ( τ ), θe ( τ )).
2

For the term B(t), we can have

2

E gH k ( τ ) Θ̃k ( t ) wi
2
( a)
= E eH k ( τ ) Θ̃k ( t ) wi
(A12)

1 M
=
M ∑m=1 ηm,k
2
( τ ),

where ( a) results from the property of ZF precoder.

Substituting (A7) and (A10)–(A12) into (16) completes the proof.

References
1. Wang, J.; Zhu, H.; Gomes, N.J.; Wang, J. Frequency Reuse of Beam Allocation for Multiuser Massive MIMO
Systems. IEEE Trans. Wirel. Commun. 2018, 17, 2346–2359. [CrossRef]
2. Wang, C.X.; Haider, F.; Gao, X.; You, X.H.; Yang, Y.; Yuan, D.; Aggoune, H.; Haas, H.; Fletcher, S.; Hepsaydir, E.
Cellular architecture and key technologies for 5G wireless communication networks. IEEE Commun. Mag.
2014, 52, 122–130. [CrossRef]
3. Jiang, F.; Chen, J.; Swindlehurst, A.L.; López-Salcedo, J.A. Massive MIMO for Wireless Sensing with a Coherent
Multiple Access Channel. IEEE Trans. Signal Process. 2015, 63, 3005–3017. [CrossRef]
4. Ciuonzo, D.; Rossi, P.S.; Dey, S. Massive MIMO Channel-Aware Decision Fusion. IEEE Trans. Signal Process.
2015, 63, 604–619. [CrossRef]
5. Ciuonzo, D.; Rossi, P.S.; Dey, S. Massive MIMO meets decision fusion: Decode-and-fuse vs. decode-then-fuse.
In Proceedings of the 2014 IEEE 8th Sensor Array and Multichannel Signal Processing Workshop (SAM),
A Coruna, Spain, 22–25 June 2014; pp. 265–268. [CrossRef]
6. Shirazinia, A.; Dey, S.; Ciuonzo, D.; Rossi, P.S. Massive MIMO for Decentralized Estimation of a Correlated
Source. IEEE Trans. Signal Process. 2016, 64, 2499–2512. [CrossRef]
7. Cao, J.; Wang, D.; Li, J.; Sun, Q.; Hu, Y. Uplink spectral efﬁciency analysis of multi-cell multi-user massive
MIMO over correlated Ricean channel. Sci. China Inf. Sci. 2018, 61, 082305. [CrossRef]
8. Björnson, E.; Matthaiou, M.; Debbah, M. Massive MIMO with non-ideal arbitrary arrays: Hardware scaling
laws and circuit-aware design. IEEE Trans. Wirel. Commun. 2015, 14, 4353–4368. [CrossRef]
9. Dai, L. An Uplink Capacity Analysis of the Distributed Antenna System (DAS): From Cellular DAS to DAS
with Virtual Cells. IEEE Trans. Wirel. Commun. 2014, 13, 2717–2731.
10. Liu, Z.; Dai, L. On the Scaling Behavior of Average Ergodic Capacity of Distributed MIMO Systems.
In Proceedings of the 2015 IEEE Global Communications Conference (GLOBECOM), San Diego, CA, USA,
6–10 December 2015; pp. 1–6. [CrossRef]
11. Wang, J.; Dai, L. Asymptotic Rate Analysis of Downlink Multi-User Systems with Co-Located and Distributed
Antennas. IEEE Trans. Wirel. Commun. 2015, 14, 3046–3058. [CrossRef]
12. Wang, J.; Dai, L. Downlink Rate Analysis for Virtual-Cell Based Large-Scale Distributed Antenna Systems.
IEEE Trans. Wirel. Commun. 2016, 15, 1998–2011. [CrossRef]
13. Khadka, A.; Adachi, K.; Sun, S.; Wang, J.; Zhu, H.; Wang, J. Cooperative Transmission Strategy over Users’
Mobility for Downlink Distributed Antenna Systems. In Proceedings of the GLOBECOM 2017–2017 IEEE
Global Communications Conference, Singapore, 4–8 December 2017; pp. 1–6. [CrossRef]
14. Zhu, H.; Wang, J. Radio Resource Allocation in Multiuser Distributed Antenna Systems. IEEE J. Sel.
Areas Commun. 2013, 31, 2058–2066. [CrossRef]

18
Electronics 2018, 7, 317

15. Lv, Q.; Li, J.; Zhu, P.; You, X. Spectral Efficiency Analysis for Bidirectional Dynamic Network with Massive
MIMO Under Imperfect CSI. IEEE Access 2018, 6, 43660–43671. [CrossRef]
16. Studer, C.; Wenk, M.; Burg, A. MIMO transmission with residual transmit-RF impairments. In Proceedings
of the 2010 International ITG Workshop on Smart Antennas (WSA), Bremen, Germany, 23–24 February 2010;
pp. 189–196. [CrossRef]
17. Papazafeiropoulos, A.; Clerckx, B.; Ratnarajah, T. Mitigation of phase noise in massive MIMO systems:
A rate-splitting approach. In Proceedings of the 2017 IEEE International Conference on Communications
(ICC), Paris, France, 21–25 May 2017; pp. 1–7. [CrossRef]
18. Pitarokoilis, A.; Mohammed, S.K.; Larsson, E.G. Uplink Performance of Time-Reversal MRC in Massive
MIMO Systems Subject to Phase Noise. IEEE Trans. Wirel. Commun. 2015, 14, 711–723. [CrossRef]
19. Pitarokoilis, A.; Mohammed, S.K.; Larsson, E.G. Achievable rates of ZF receivers in massive MIMO
with phase noise impairments. In Proceedings of the 2013 Asilomar Conference on Signals, Systems
and Computers, Pacific Grove, CA, USA, 3–6 November 2013; pp. 1004–1008. [CrossRef]
20. Li, J.; Wang, D.; Zhu, P.; You, X. Uplink Spectral Efficiency Analysis of Distributed Massive MIMO with
Channel Impairments. IEEE Access 2017, 5, 5020–5030. [CrossRef]
21. Fang, Y.; Li, X.; Qiu, L. Asymptotic Equivalent Performance of Uplink Massive MIMO Systems with Phase
Noise. In Proceedings of the 2018 IEEE International Conference on Communications (ICC), Kansas City,
MO, USA, 20–24 May 2018; pp. 1–6. [CrossRef]
22. Zhu, J.; Schober, R.; Bhargava, V.K. Physical layer security for massive MIMO systems impaired by phase
noise. In Proceedings of the 2016 IEEE 17th International Workshop on Signal Processing Advances in
Wireless Communications (SPAWC), Edinburgh, UK, 3–6 July 2016; pp. 1–5. [CrossRef]
23. Corvaja, R.; Armada, A.G. Phase Noise Degradation in Massive MIMO Downlink with Zero-Forcing and
Maximum Ratio Transmission Precoding. IEEE Trans. Veh. Technol. 2016, 65, 8052–8059. [CrossRef]
24. Krishnan, R.; Khanzadi, M.R.; Krishnan, N.; Wu, Y.; i Amat, A.G.; Eriksson, T.; Schober, R. Linear Massive
MIMO Precoders in the Presence of Phase Noise - A Large-Scale Analysis. IEEE Trans. Veh. Technol. 2016,
65, 3057–3071. [CrossRef]
25. Petrovic, D.; Rave, W.; Fettweis, G. Effects of Phase Noise on OFDM Systems with and without PLL:
Characterization and Compensation. IEEE Trans. Commun. 2007, 55, 1607–1616. [CrossRef]
26. Papazafeiropoulos, A.; Clerckx, B.; Ratnarajah, T. Rate-splitting to mitigate residual transceiver hardware
impairments in massive MIMO systems. IEEE Trans. Veh. Technol. 2017, 66, 8196–8211. [CrossRef]
27. Li, J.; Wang, D.; Zhu, P.; You, X. Spectral efficiency analysis of single-cell multi-user large-scale distributed
antenna system. IET Commun. 2014, 8, 2213–2221. [CrossRef]
28. Li, J.; Wang, D.; Zhu, P.; Wang, J.; You, X. Downlink Spectral Efficiency of Distributed Massive MIMO
Systems with Linear Beamforming Under Pilot Contamination. IEEE Trans. Veh. Technol. 2018, 67, 1130–1145.
[CrossRef]
29. Jose, J.; Ashikhmin, A.; Marzetta, T.L.; Vishwanath, S. Pilot contamination and precoding in multi-cell TDD
systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
30. Hosseini, K.; Yu, W.; Adve, R.S. Large-scale MIMO versus network MIMO for multicell interference
mitigation. In Proceedings of the 2014 IEEE 15th International Workshop on Signal Processing Advances in
Wireless Communications (SPAWC), Toronto, ON, Canada, 22–25 June 2014; pp. 70–74. [CrossRef]
31. Li, J.; Wang, D.; Zhu, P.; You, X. Benefits of Beamforming Training Scheme in Distributed Large-Scale MIMO
Systems. IEEE Access 2018, 6, 7432–7444. [CrossRef]
32. Hoydis, J.; ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of Cellular Networks: How Many
Antennas Do We Need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]

c 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

19
electronics
Article
Spectral and Energy Efﬁciency of Distributed Massive
MIMO with Low-Resolution ADC
Jiamin Li, Qian Lv *, Jing Yang, Pengcheng Zhu and Xiaohu You
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China;
[email protected] (J.L.); [email protected] (J.Y.); [email protected] (P.Z.); [email protected] (X.Y.)
* Correspondence: [email protected]; Tel.: +25-5209-1635

Received: 15 November 2018; Accepted: 3 December 2018; Published: 4 December 2018

Abstract: In this paper, considering a more realistic case where the low-resolution analog-to-digital
convertors (ADCs) are employed at receiver antennas, we investigate the spectral and energy
efficiency in multi-cell multi-user distributed massive multi-input multi-output (MIMO) systems with
two linear receivers. An additive quantization noise model is provided first to study the effects of
quantization noise. Using the model provided, the closed-form expressions for the uplink achievable
rates with a zero-forcing (ZF) receiver and a maximum ratio combination (MRC) receiver under
quantization noise and pilot contamination are derived. Furthermore, the asymptotic achievable rates
are also given when the number of quantization bits, the per user transmit power, and the number of
antennas per remote antenna unit (RAU) go to infinity, respectively. Numerical results prove that
the theoretical analysis is accurate and show that quantization noise degrades the performance in
spectral efficiency, but the growth in the number of antennas can compensate for the degradation.
Furthermore, low-resolution ADCs with 3 or 4 bits outperform perfect ADCs in energy efficiency.
Numerical results imply that it is preferable to use low-resolution ADCs in distributed massive
MIMO systems.

Keywords: distributed massive MIMO; energy efﬁciency; spectral efﬁciency; pilot contamination;
quantization noise

1. Introduction
Massive multi-input multi-output (MIMO) systems are an essential technology for the fifth
generation (5G) mobile networks because they can significantly improve spectral efficiency and energy
efficiency [1–6]. In massive MIMO systems, a relatively small number of users are served by hundreds
or thousands of antennas employed at base stations in the same time-frequency resource. The huge
number of antennas provides a high number of degrees-of-freedom, which favors low-complexity
receivers, such as maximum ratio combination (MRC) and zero-forcing (ZF), and beamforming, such
as ZF and maximum ratio transmission (MRT) [2,7,8]. Therefore, we consider MRC and ZF receivers
for uplink transmission in this paper. There are two categories for massive MIMO: co-located massive
MIMO and distributed massive MIMO [9]. Compared to co-located massive MIMO, distributed
massive MIMO has advantages of increasing spectral efficiency, energy efficiency, and system coverage
due to the reduced access distance [8,10–12]. Hence, a distributed massive MIMO system is considered
in this paper.
Although massive MIMO systems have significant performance gains, they also face new
challenges: high total power consumption, expensive hardware, and mass data processing [13].
Specifically, each antenna is equipped with a radio frequency (RF) chain, including an analog-to-digital
converter (ADC) unit in massive MIMO systems. However, with the increase in the antenna number,
the hardware complexity and the power consumption of ADCs increase exponentially with the number

Electronics 2018, 7, 391; doi:10.3390/electronics7120391 21 www.mdpi.com/journal/electronics

Electronics 2018, 7, 391

of quantization bits [14]. Therefore, one promising solution is to employ low-resolution ADCs in
massive MIMO systems. The study of low-resolution ADC in MIMO or massive MIMO systems has
caused widespread concern.
Spectral and energy efficiency are two fundamental metrics to analyze the impacts of
low-resolution ADCs. Spectral efficiency was investigated in [15–17]. The performance of 1-bit
resolution ADC in MIMO systems was studied in [15] considering the nonlinear characteristics of a
quantizer. In massive MIMO systems with low-resolution ADCs, the uplink achievable rate using
the common MRC receivers has been investigated in [16], and the uplink achievable rate using the
common ZF receivers was studied in [17]. However, these two papers made an assumption that
the base station had perfect channel state information (CSI), and they only considered a single-cell
massive MIMO system. In fact, the CSI is not available at the base station. On the other hand, energy
efficiency was studied in [13,18–20]. The optimal number of quantized bits and antenna selection were
considered to maximize the energy efficiency of general MIMO with low-resolution ADCs in [13].
It was pointed out in [18] that very low bit resolution is not preferable from the perspective of energy
efficiency. A function about energy efficiency and the number of quantized bits was obtained in [19].
The previous papers mainly studied a single-cell system, made an assumption that the base
station had perfect CSI, and did not analyze spectral efficiency and energy efficiency simultaneously.
Hence, in this paper, a multi-cell multi-user massive MIMO system with low-resolution ADCs is
considered, and we assume that the base stations estimate CSI during the uplink pilot transmission
phase. Furthermore, the uplink spectral and energy efficiency are both analyzed. Here are the key
contributions of this paper:

1. A joint uplink signal model is provided and it enables us to study the effects of pilot contamination
and quantization noise simultaneously.
2. Under imperfect CSI and considering MRC and ZF receivers, we derive the closed-form
expressions for the uplink achievable rates. The asymptotic performance with quantization bits,
the number of antennas per RAU, and per user transmit power are also obtained.
3. The theoretical results are verified by performing Monte Carlo simulations, and we obtain
deep insight into the impacts of quantization noise on the uplink spectral efficiency and energy
efficiency in distributed massive MIMO systems.

2. System Model
We consider a distributed massive MIMO system. There are L adjacent cells, and each cell consists
of M remote antennas units (RAUs) and K single-antenna users. Each RAU is equipped with an
array of N antennas. Each antenna is equipped with a low-resolution ADC, which means system
performance will be degraded by quantization noise. RAUs in the same cell transmit or receive signals
simultaneously while the beamforming design and signal processing are performed in a baseband
processing unit. An example is given in Figure 1. There are L = 7 adjacent cells, and in cell-1, there are
K = 6 users and M = 6 RAUs.

22
Electronics 2018, 7, 391

&HOO

&HOO &HOO

&HOO

&HOO &HOO

&HOO
Baseband LLow-resolution
processing unit User ADCs RAU

Figure 1. System conﬁguration.

2.1. Quantization Noise Model

For uplink transmission, the signal vector received by all RAUs in cell l can be given by

√ L
yl = pu ∑i=1 Gl,i xi + nl , (1)

where xi is the K × 1 signal vector transmitted by the K users in cell i, pu is the uplink transmitted
power, and nl ∼ CN (0, I MN ) is the additive white Gaussian noise, Gl,i = [gl,i,1 , ..., gl,i,K ] is the MN × K
channel matrix from M RAUs in cell l to K users in cell i, wherein
T
gl,i,k = λl,1,i,k hTl,1,i,k , · · · , λl,M,i,k hTl,M,i,k (2)

where λl,m,i,k is the path loss between the k-th user in the i-th cell and the m-th RAU in the l-th
cell, which is dependent on the corresponding distance, and hl,m,i,k ∼ CN (0, I N ) denotes the small
scale fading.
This paper assumes that the CSI is unknown to the base station, and pilot training is performed.
Motivated by [21], based on the minimum mean square error (MMSE) channel estimation, the
equivalent estimated channel can be given by
T
ĝi,l,k = β i,1,l,k ĥTi,k,1 , · · · , β i,M,l,k ĥTi,k,M (3)

where
λ2i,m,l,k
β i,m,l,k = . (4)
∑ Lj=1 λi,m,j,k + 1/(τ pu )

τ denotes the length of pilot sequences, β i,m,l,k denotes the equivalent path loss between the k-th user in
Δ
the l-th cell and the m-th RAU in the i-th cell, and ĥi,k = [ĥTi,k,1 , · · · , ĥTi,k,M ]T ∼ CN (0, I MN ) represents
the equivalent small scale fading part of the estimated channel. Because of the orthogonality principle
of MMSE estimation theory, gi,l,k can be decomposed as

gi,l,k = ĝi,l,k + g̃i,l,k (5)

where g̃i,l,k ∼ CN 0, diag ηi,1,l,k , · · · , ηi,M,l,k ⊗ I N is the uncorrelated and statistically independent
Δ
of ĝi,l,k estimation error, and ηi,m,l,k = λi,m,l,k − β i,m,l,k .
After the received analog signals pass through the low-resolution ADCs, the quantized digital
signal vector can be obtained as
√
L
yl,q = Q(yl ) = Q pu ∑i=1 Gl,i xi + nl (6)

23
Electronics 2018, 7, 391

where Q(.) represents the quantization function. Assuming that the gain of automatic gain control is
appropriately set, the additive quantization noise model (AQNM) can be employed to reformulate the
quantized signal vector as
√ L
yl,q = αyl + nl,q = α pu ∑i=1 Gl,i xi + αnl + nl,q (7)

where α = 1 − ρ, ρ is the inverse of the signal-to-quantization-noise ratio, and nl,q denotes the
additive uncorrelated quantization noise vector, which is Gaussian-distributed. The parameter ρ is a
constant dependent on the number of quantization bits b. According to [16], the covariance matrix of
quantization noise nl,q for a ﬁxed channel realization can be denoted as

L
Rnl,q = α(1 − α)diag pu ∑i=1 Gl,i GH
l,i + I . (8)

2.2. The Energy Efﬁciency Model

(1) Achievable uplink rates: In the uplink transmission phase, the quantized signal processed by
the linear detector of user k in cell l is presented as

rl,k = aH
l,k yl,q
√ L K √ L K
= α pu ∑ i =1 ∑ j =1 aH
l,k ĝl,i,j xi,j + α pu ∑i =1 ∑ j=1 al,k g̃l,i,j xi,j + αal,k nl + αal,k nl,q
H H H
(9)

where al,k is the linear receiver vector in cell l for user k, and xi,j ∼ CN (0, 1) is the j-th column of
xi . In this paper, we focus on two linear receivers, namely MRC and ZF. Mathematically, al,k can be
given by
ĝl,l,k , for MRC
al,k = (10)
fl,l,k , for ZF
−1
where fl,l,k is the k-th column of Ĝl,l ĜH
l,l Ĝl,l , and Ĝl,l = [ĝl,l,1 , · · · , ĝl,l,K ].
Motivated by [22,23], treating the interference as worst-case unrelated additive noise, the lower
bound of the achievable uplink rate of the k-th user in the l-th cell can be given by

pu α2 | aH
l,k ĝl,l,k |
2
Rl,k (p) = E log2 1+ (11)
E aH
l,k pu α ∑(i,j) =(l,k) ĝl,i,j ĝl,i,j + pu α ∑(i,j) g̃l,i,j g̃l,i,j + α + Rnl,q al,k | Ĝl,l
2 H 2 H 2

( a) pu α2 | aH
l,k ĝl,l,k |
2
= E log2 1 + (12)
E Il,k + pu α2 ∑i =l |aHl,k ĝl,i,k | + α al,k
2 2 H 2

where p is the transmitted power vector of K users. Since the denominator of Equation (11) is a
conditional expectation operator and the estimated error vector and estimated channel vector are
independent, Il,k is given as

L
Il,k = pu α2 ∑i=1 ∑ j =k aH
l,k E ĝl,i,j ĝl,i,j al,k + pu α ∑(i,j) al,k E g̃l,i,j g̃l,i,j al,k + al,k E Rnl,q al,k
H 2 H H H
(13)

24
Electronics 2018, 7, 391

wherein

E ĝl,i,j ĝH
l,i,j = diag β l,1,i,j I N , · · · , β l,M,i,j I N

E g̃l,i,j g̃H
l,i,j = diag ηl,1,i,j I N , · · · , ηl,M,i,j I N

E Rnl,q = diag pu Ĝl,l ĜH l,l + DR (14)
K K K K
DR = diag pu ∑ ηl,1,l,j + pu ∑ ∑ λl,1,i,j + 1 IN , · · · , pu ∑ ηl,M,l,j + pu ∑ ∑ λl,M,i,j + 1 IN .
j =1 i = l j =1 j =1 i = l j =1

ηl,m,i,j = ηl,m,i,j − β l.m,i,j .

(2) Power consumption model: According to [24–26], for cell l, the total power consumption
model can be given by

Pl = PTC + PLP + PT + PBH . (15)

The ﬁrst term PTC is the power consumption of transceiver chains, which can be given by

PTC = M( NPBS + ρPSYN ) + (1 − ρ) PSYN + KPUE + MPADC (16)

where PBS and PUE are the power consumption of running the circuit components employed at the
base station and users, PSYN are the power consumed by the local oscillator, and PADC = a0 N2b + a1
are the power consumed by ADC, wherein a0 and a1 are constant parameters, ρ = 1 for the distributed
antenna system (DAS), and ρ = 0 for the co-located antenna system (CAS). This results from the
assumption that antennas at the same RAU are connected to a common oscillator, while oscillators
at different RAUs are different in the DAS, and all antennas are connected to a single oscillator in
the CAS.
The second term PLP is the power consumption of the MRC/ZF receiver at the base station, which
can be given by

B 3MNK 3
MNK (3K +1)

PLP = B T −
T
τ 2MNK
L + LBS (1 − d ) + d 3LBS +
K
LBS (17)
BS T
where d = 0 for MRC while d = 1 for ZF, B is the bandwidth, T denotes the symbols for uplink
transmission, and LBS is the computational efﬁciency of arithmetic complex-valued operations for
a Joule.
The third term PT is transmit power, which can be represented as

T −τ K
PT = T ξ pu (18)

where ξ is the ampliﬁed efﬁciency.

For the last term, PBH is the power consumed of backhaul in the DAS, while it can be neglected in
the CAS. Speciﬁcally, PBH in the DAS can be given by

K
PBH = M P0 + BPBT ∑k=1 Rl,k (p) (19)

where P0 and PBT are the fixed and traffic-dependent power consumption at each backhaul, respectively.
(3) Global energy efficiency model: Based on the above analysis, the total power consumption for
all cells can be given by

T −τ L K
PTotal (p) = LPIND + ξT LKpu + PBT MB ∑l =1 ∑k=1 Rl,k (p) (20)

25
Electronics 2018, 7, 391

where PIND is the power consumption independent of p and can be given by

PIND = PTC + PLP + MP0 . (21)

According to [24], the global energy efficiency is defined as the ratio of the achievable sum rate to
the total power consumption in Watts. Mathematically, it can be defined as

B ∑lL=1 ∑kK=1 Rl,k (p)

ϕ(p) = PTotal . (22)

3. Energy Efﬁciency Analysis

From Equations (22) and (A4), we can see that it is difficult to directly calculate Equation (A4)
to analyze the energy efficiency. Therefore, we first derive the closed-form expressions of uplink
achievable rates. The results are shown in the following theorems.

Theorem 1. Using MRC receiver with low-resolution ADCs and pilot contamination, the closed-form
expression for the uplink achievable rate of the k-th user in the l-th cell is given by
2

pu α ( N ∑mM=1 β l,m,l,k ) + N ∑m
M 2
=1 β l,m,l,k
l,k = log2
Rmrc 1+
pu αΩl,k + pu αΞl,k +(1−α)Φl,k +α ∑m
M (23)
=1 β l,m,l,k
2

where
M L K
Ωl,k = N ∑m=1 ∑i=1 ∑ j=1 β l,m,l,k ηl,m,i,j
M L M 2
Ξl,k = N ∑ β l,m,l,k ∑ ∑ β l,m,i,j + ∑ β l,m,i,k +∑ N ∑ β1/2 1/2
l,m,l,k β l,m,i,k
m =1 i =1 j = k i =l i =l m =1
⎛ ⎞
M 2 M M
Φl,k = pu ⎝ N ∑ β l,m,l,k +N ∑ β2l,m,l,k + N ∑∑ β l,m,l,k β l,m,l,j ⎠ + Υl,k
m =1 m =1 j = k m =1

M K K
Υl,k = N ∑m=1 β l,m,l,k 1 + pu ∑ j=1 ηl,m,l,j + pu ∑i =l ∑ j=1 λl,m,i,j . (24)

Proof. The proof is given in Appendix B.

Theorem 2. Using ZF receiver with low-resolution ADCs and pilot contamination, the closed-form expression
for the uplink achievable rate of user k in the l-th cell is given by

⎛ ⎞
⎜ M
pu αζ ∑m ⎟
=1 β l,m,l,k
Rl,k = log2 ⎜ ⎟
zf
⎝1 + M L K

M
⎠ (25)
αpu ∑ ∑ ∑ β l,m,i,j + ∑ ∑ ηl,m,i,j +ζ ∑ β l,m,i,k +(1−α) ζ pu ∑ β l,m,l,k +Δl,k +αM
m =1 i =l j =k i =1 j =1 i =l m =1

where ζ = MN − K + 1, and

M K K
Δl,k = ∑ m =1 1 + pu ∑ j=1 ηl,m,l,j + pu ∑i =l ∑ j=1 λl,m,i,j . (26)

Proof. The proof is given in Appendix C.

From Equations (23) and (25), it can be concluded that the quantization noise inﬂuences both the
numerator and the denominator of Equation (A4). This means that the quantization noise is unlike the
additive noise, which only affects the denominator.

26
Electronics 2018, 7, 391

Based on the theorems above, we analyze the asymptotic performance with quantization
bits, per user transmit power, and the number of antennas per RAU, respectively. The results are
given below.
Case 1: With a ﬁxed transmitted power per user pu and a total number of antennas per cell MN,
when the number of quantization bits b → ∞, the inverse of the signal-to-quantization-noise ratio
ρ tends toward zero, which means that α in Equations (23) and (25) tends toward 1. The following
results can then be obtained in this case by replacing the α in Equations (23) and (25) with 1
2

pu ( N ∑mM=1 β l,m,l,k ) + N ∑m
M 2
=1 β l,m,l,k
R̃mrc
l,k = log2 1 + (27)
pu Ωl,k + pu Ξl,k +∑m
M
=1 β l,m,l,k
2

⎛ ⎞
⎜ M
pu ζ ∑ m ⎟
⎜ =1 β l,m,l,k ⎟.
l,k = log2 ⎝1 +
R̃zf M L K ⎠ (28)
pu ∑ ∑ ∑ β l,m,i,j + ∑ ∑ ηl,m,i,j +ζ ∑ β l,m,i,k + M
m =1 i =l j =k i =1 j =1 i =l

Case 1 shows the achievable uplink rates without considering the quantization noise caused by
ADC. It can be seen that the spectral efficiency is only limited by pilot contamination and channel
estimation error. Moreover, since the power consumption of an ADC PADC = a0 N2b + a1 is an
exponential function of b, PADC tends toward infinity when b → ∞. As shown in Equation (20), the
total power consumption also goes to infinity. Hence, the limited achievable rates and unlimited power
consumption lead to the fact that the global energy efficiency tends toward zero, that is φ(p) → 0
when b → ∞, while pu and MN are fixed.
Case 2: With a fixed number of quantization bits b and antennas per cell MN, when pu → ∞,
the ultimate rates of user k in cell l with both receivers can be directly obtained by dividing the
dominators and numerators of Equations (23) and (25) by pu , which are given by

2

α ( N ∑mM=1 β l,m,l,k ) + N ∑m
M 2
=1 β l,m,l,k
l,k = log2
R̃mrc 1+ αΩl,k +αΞl,k +(1−α)Φl,k
(29)
⎛ ⎞
⎜ M
αζ ∑m ⎟
⎜ =1 β l,m,l,k
l,k = log2 ⎝1 +
R̃zf ⎟
⎠ (30)
M L K M
α ∑ ∑ ∑ β l,m,i,j + ∑ ∑ ηl,m,i,j +ζ ∑ β l,m,i,k +(1−α) ζ ∑ β l,m,l,k +Δl,k
m =1 i =l j =k i =1 j =1 i =l m =1

where
⎛ ⎞
M 2 M M
Φl,k = ⎝ N ∑ β l,m,l,k +N ∑ β2l,m,l,k + N ∑∑ β l,m,l,k β l,m,l,j ⎠ + Υl,k
m =1 m =1 j = k m =1

M K K
Υl,k = N ∑m=1 β l,m,l,k ∑ j=1 ηl,m,l,j + ∑i =l ∑ j=1 λl,m,i,j

M K K
Δl,k = ∑m=1 ∑ j=1 ηl,m,l,j + ∑i =l ∑ j=1 λl,m,i,j .

Case 2 indicates that, as pu grows indefinitely, the achievable uplink rates approach certain values
dependent on the resolution of ADC. This observation shows that the performance degradation due to
low-resolution ADCs cannot be compensated by increasing the transmit power. Furthermore, it can
be seen that, as the transmit power increases, the system energy efficiency tends toward zero. This is
because the total power consumption presented in Equation (20) tends toward infinity as pu increases
indefinitely, but the unlimited growth in the transmit power cannot improve the achievable uplink
rates indefinitely.
Case 3: With a fixed number of quantization bits b, the number of RAUs per cell M and the
transmitted power per user pu , when N → ∞, the limiting rates of user k in cell l with both receivers

27
Electronics 2018, 7, 391

can be directly obtained by dividing the dominators and numerators of Equations (23) and (25) by N 2 ,
which are given by

2
α(∑m
M
=1 β l,m,l,k )
l,k = log2
Rmrc 1+ 2 2
(31)
M
α ∑i =l ∑ m =1 β1/2 1/2
l,m,l,k β l,m,i,k+(1−α)(∑m
M
=1 β l,m,l,k )

M
α ∑m =1 β l,m,l,k
l,k = log2 1 +
Rzf M
α ∑i =l ∑ m β l,m,i,k +(1−α) ∑m M . (32)
=1 =1 β l,m,l,k

Case 3 shows that, when the number of antennas per RAU grows without bound, the impacts of
quantization noise vanish. However, the achievable rates with both receivers tend toward certain and
limited values as N goes infinity. This results from the presence of pilot contamination. Furthermore,
since the power consumption of transceiver chains and linear processing at the base station are
proportional to N, they tend toward infinity when N → ∞. As shown in Equation (20), the total
power consumption also goes to infinity. Hence, the limited achievable rates and unlimited power
consumption lead to the fact that the global energy efficiency tends toward zero, that is φ(p) → 0
when N → ∞ while b, M, and pu are fixed.

4. Numerical Results
In this section, we verify the accuracy of the theoretical results in Section 3 by a series of Monte
Carlo simulations. A multi-cell distributed massive MIMO system is considered, which consists
of L = 7 cells, M = 7 RAUs per cell, K = 6 users per cell, and the cell radius D is normalized
to 1. In each cell, all users are uniformly
√ distributed, while RAUs have ﬁxed locations with radiuses
r1 = 0, r2 = · · · = r7 = (3 − 3)/2 and angles θ1 = 0, θ2 = π/6, θ3 = π/2, θ4 = 5π/6, θ5 =
7π/6, θ6 = 3π/2, and θ7 = 11π/6. The path loss between the k-th user in the i-th cell and the m-th
RAU in the l-th cell λl,m,i,k is modeled as λl,m,i,k = d− ι
l,m,i,k , where dl,m,i,k is the corresponding distance,
and ι assumed as ι = 3.7 is the path loss exponent. Moreover, the length of pilot sequences is τ = K.
The coherence time of the channel is assumed as T = 196 symbols, and the power consumption
parameters are given in Table 1.

Table 1. Power consumption parameters.

Parameters Values
Transmitted power per user pu 0.02 Watts
Power consumption per antenna at base station PBS 0.1 Watts
Power consumption per antenna at users PUE 0.1 Watts
Power consumption per local oscillator PSYN 1 Watts
Computational efficiency at base stations LBS 12.8 Gflops/Watt
Power amplified efficiency ξ 0.4
Fixed power consumption per backhaul P0 0.825 Watts
Traffic-dependent backhaul power PBT 0.25 Watts/(Gbit/s)
Parameters of ADC a0 &a1 10−3 & 0.02

We first prove the accuracy of the theoretical results given in Theorems 1 and 2. Figure 2 illustrates
the uplink spectral efficiency per cell versus the number of quantization bits with different numbers
of antennas per RAU. It can be seen that the closed-form expressions and simulation results match
well with each other using both MRC and ZF receivers. As the number of antennas per RAU increases,
the uplink spectral efficiency grows obviously for both receivers. Furthermore, for both receivers, the
uplink spectral efficiency increases rapidly with the increase in quantization bits b when b is small,
while the growth of b cannot improve the spectral efficiency further when b is large. It can be concluded
that low-resolution ADCs are acceptable in massive MIMO systems, and employing a large number

28
Electronics 2018, 7, 391

of antennas at each RAU can compensate for the performance degradation. In the following, the
closed-form expressions will be used for numerical work.

24
N=20

22
Spectral efficiency (bit/s/Hz)

18
N=10

14
N=5
12
ZF,Simulation
10 ZF,Theoretical
MRC,Simulation
MRC,Theoretical
8
1 2 3 4 5 6 7 8
Number of quantization bits

Figure 2. Spectral efﬁciency versus the number of quantization bits with different numbers of antennas
per RAU.

Next, Figure 3 illustrates the energy efficiency versus spectral efficiency with different numbers
of quantization bits and of antennas per RAU. It can be seen that, as the number of antennas and
the number of quantization bits increase, energy efficiency increases first and then decreases. This is
because the power consumption and spectral efficiency both increase with the increase in antennas
and quantization bits, but the improvement of spectral efficiency dominates first, and the power
consumption then dominates. The results illustrate that we cannot improve the spectral efficiency and
energy efficiency simultaneously without bound, and there needs to be a tradeoff between them, which
was investigated in [27–29]. Moreover, it can be seen in Figure 3b that b = 3 or b = 4 are preferable
under the system configuration mentioned above. If b increases further, the spectral efficiency can be
slightly improved while the energy efficiency decreases rapidly. It should be noted that the optimal
number of bits is dependent on system configuration and system parameters. Figure 3 also indicates
that low-resolution ADCs (b = 3 or 4 bits in our simulation results) are preferable in distributed massive
MIMO systems.
Finally, the energy efficiency against spectral efficiency with different numbers of quantization bits
and transmitted power per user is presented in Figure 4. The same conclusion about the relationship
between energy efficiency and spectral efficiency with different b can be obtained from Figure 4. As for
the transmitted power per user, it can be seen that, with its increase, the energy efficiency increases
first and then decreases. This results from the fact that the power consumption linearly increases with
the growth of transmitted power, but the spectral efficiency increases first and then tends toward a
certain value.

29
Electronics 2018, 7, 391

b=1 bits
1 b=2 bits
b=3 bits
b=6 bits
b=8 bits
b=9 bits
Energy efficiency 0.8 b=11 bits

0.6

0.4

0.2

0
0 5 10 15 20 25 30 35 40
Spectral Efficiency

(a)

N=4
0.9 N=6
N=8
N=12
0.8
N=16
N=22
0.7 N=30
Energy efficiency

0.6

0.5

0.4

0.3

0.2

0.1

0
5 10 15 20 25 30
Spectral efficiency

(b)

Figure 3. Energy efﬁciency versus spectral efﬁciency with different numbers of quantization bits b
and different numbers of antennas N per RAU. (a) Each line corresponds to different numbers of
quantization bits with b = [1, 2, 3, 6, 8, 9, 11] and the points on each line correspond to different numbers
of antennas per RAU with N= [1:1:6, 8, 10, 12, 15, 20, 30, 40, 50, 80, 100]. (b) Each line corresponds to
different numbers of antennas per RAU with N= [4, 6, 8, 12, 16, 22, 30] and the points on each line
correspond to different numbers of quantization bits with b = [1:1:12].

30
Electronics 2018, 7, 391

1
b=1 bits
b=2 bits
0.95 b=3 bits
b=4 bits
b=5 bits
0.9 b=6 bits

0.85
Energy efficiency

0.8

0.75

0.7

0.65

0.6

0.55
18 20 22 24 26 28 30 32
Spectral efficiency

Figure 4. Energy efﬁciency versus spectral efﬁciency with the number of quantization bits b = [1:1:6]
bits and transmitted power per user pu = [0.01, 0.02, 0.05, 0.08, 0.1, 0.15, 0.2, 0.3, 0.4, 0.6, 0.8, 1] W, where
each line corresponds to different b and the points on each line correspond to different pu .

5. Conclusions
In this paper, we analyzed the uplink spectral and energy efficiency simultaneously in distributed
massive MIMO systems with low-resolution ADCs. Furthermore, this paper considered a more
realistic scenario where the base station did not have CSI and it obtained the estimated CSI during
the pilot phase. In this case, the pilot contamination presents and degrades the system performance.
We first gave an additive quantization noise model and got the estimated CSI with pilot contamination.
Under the imperfect CSI, we derived the closed-form expressions for achievable uplink rates using
MRC and ZF receivers. Furthermore, we obtained the asymptotic performance with the number of
quantization bits, the per user transmit power, and the per RAU antenna number, respectively. The
theorems are verified by simulation. It can be noted that the increase in antennas can compensate for
the spectral efficiency degradation caused by quantization noise. Furthermore, the energy efficiency
with low-resolution ADCs are better than that with perfect ADCs. Numerical results imply that it is
preferable to use low-resolution ADCs in distributed massive MIMO systems.
We intend to extend our research considering the tradeoff between spectral efficiency and energy
efficiency, which involves multi-objective optimization. Furthermore, in order to make the system
more energy-efficient, we plan to extend our research considering RAU selection.

Author Contributions: Formal analysis, J.L.; Methodology, J.L.; Supervision, P.Z. and X.Y.; Validation, J.L. and
J.Y.; Writing original draft, J.L. and Q.L.; Writing review & editing, J.L. and Q.L.
Funding: This work was supported in part by National Natural Science Foundation of China (NSFC) (Grant No.
61501113, 61571120) and the Jiangsu Provincial Natural Science Foundation (Grant No. BK20150630, BK20180011).
Conﬂicts of Interest: The authors declare no conﬂict of interest.

Appendix A. Lemmas for Proof

In order to derive the closed-form expressions with both receivers, we provide the following
preliminary lemmas ﬁrst.

31
Electronics 2018, 7, 391

Lemma A1 ([22]). Suppose { Xi } are independent Gamma distributed random variables, i.e., { Xi } ∼ Γ(k i , θi ).
Then the ﬁrst two moments of the sum ∑i Xi can be given by

E ∑ i Xi = ∑i k i θi , (A1)
2 2
E ∑ i Xi = ∑i k i θi2 + ∑i k i θi . (A2)

Lemma A2 ([21]). For the p-dimensional non-isotropic channel vector x whose strength is distributed as
xH x ∼ Γ(k, θ ), when projected onto a s-dimensional subspace, the distribution of the projection power can be
approximated as Γ(sk/p, θ ).

Lemma A3 ([30]). If x is an N × 1 isotropic random vector and A is a constant matrix. Then we can have
tr(A)
Ex xH Ax = . (A3)
N

Appendix B. Proof of Theorem 1

From Lemma 4 of [31], we can obtain the approximation of Equation (11) as follows:

pu α2 E [ | aH
l,k ĝl,l,k | ]
2
Rl,k (p) ≈ log2 1+ . (A4)
E Il,k + pu α2 ∑i =l |aH l,k ĝl,i,k | + α al,k
2 2 H 2

Consider the MRC receiver, it can be seen from Equation (A4) that the following terms need to
be simpliﬁed:

E ĝl,l,k 4 = k̂ l,l,k θ̂l,l,k
2
+ (k̂ l,l,k θ̂l,l,k )2 (A5)

where
M
=1 β l,m,l,k
2
N (∑m
M
=1 β l,m,l,k )
2 ∑m
k̂ l,l,k = M
θ̂l,l,k = M
. (A6)
∑m =1 β2l,m,l,k ∑m =1 β l,m,l,k

l,l,k ĝl,l,k ∼ Γ ( k̂ l,l,k , θ̂l,l,k ) and Lemma A1.

This can be obtained by exploiting the fact that ĝH
Due to the independence between ĝl,l,k and ĝl,i,j when j = k, we have
( a)
M
l,l,k E ĝl,i,j ĝl,i,j ĝl,l,k = N ∑m=1 β l,m,l,k β l,m,i,j
E ĝH H (A7)

where ( a) results from the fact that the channel strength is Gamma-distributed.
Because of the pilot contamination, ĝl,l,k and ĝl,i,k are dependent, we have
2
2 ( a) M M
E |ĝH
l,l,k ĝl,i,k | = N ∑m=1 β1/2 1/2
l,m,l,k β l,m,i,k + N ∑m=1 β l,m,l,k β l,m,i,k (A8)

where (a) is obtained due to the fact that the channel strength is Gamma-distributed and due to
Lemma A1.
Using the fact that ĝl,l,k and g̃l,i,j are independent, we have

M L K
∑(i,j) E l,l,k E g̃l,i,j g̃l,i,j ĝl,l,k = N ∑m=1 ∑i =1 ∑ j=1 β l,m,l,k ηl,m,i,j .
ĝH H
(A9)

32
Electronics 2018, 7, 391

For the last term, we ﬁrst calculate

E ĝH H
l,l,k diag pu Ĝl,l Ĝl,l ĝl,l,k

= pu E ĝH l,l,k ĝl,l,k ĝl,l,k ĝl,l,k + pu ∑ j =k E ĝl,l,k ĝl,l,j ĝl,l,j ĝl,l,k
H H H
(A10)

M
= pu k̂ l,l,k θ̂l,l,k
2
+ (k̂ l,l,k θ̂l,l,k )2 + N ∑ j =k ∑m=1 β l,m,l,k β l,m,l,j .

Then we can obtain

L
E ĝH
l,l,k E diag pu ∑ Gl,i GHl,i + I ĝl,l,k
i =1

MN
ĝl,n,l,k 2 DR (A11)
=E ĝH
l,l,k diag pu Ĝl,l ĜH
l,l ĝl,l,k + E ∑
n =1

M
= pu k̂ l,l,k θ̂l,l,k
2
+ (k̂ l,l,k θ̂l,l,k )2 + N ∑ j =k ∑m=1 β l,m,l,k β l,m,l,j + Υl,k .

Substituting Equations (A5), (A7)–(A9), and (A11) into Equation (A4) yields the closed-form
expression expressed by Equation (23). This completes the proof.

Appendix C. Proof of the Theorem 2

Consider a ZF receiver, similar to the proof of Theorem 1. The following terms need to
be calculated.
For the term E aH1 2 , we have
l,k

H
1 a
= aHl,k2 ĝl,l,k ∼ Γ MNMN
−K +1 k̂
l,l,k , θ̂l,l,k , (A12)
aH
l,k
2 l,k

which results form Lemma A2 and from the fact that, from the perspective of each user, an intended
beam lies in a subspace of dimension s = MN − K + 1 with ZF receivers. Thus,

MN −K +1
E aH
1
= k̂ l,l,k θ̂l,l,k . (A13)
l,k
2 MN

Next, due to the independence between al,k and ĝl,i,j , we have

∑i =l ∑ j =k E aH l,k E ĝl,i,j ĝl,i,j al,k
H

( a)

= ∑ i = l ∑ j = k E E aH H
l,k ĝl,i,j ĝl,i,j al,k
(b) 1

= E ĝHl,i,j ĝl,i,j
MN
(c) 1 M
= M ∑i =l ∑ j=k ∑m=1 βl,m,i,j . (A14)

where (a) results from the fact that al,k and ĝl,i,j are independent, (b) results from Lemma A3, and (c)
l,i,j ĝl,i,j ∼ Γ ( k̂ l,i,j , θ̂l,i,j ).
results from the fact that ĝH
Similarly, we have

aH L K M
∑(i,j) E ∑i=1 ∑ j=1 ∑m=1 ηl,m,i,j .
al,k
2E g̃l,i,j g̃H =
l,k 1
aH
(A15)
l,k aH
l,k
l,i,j 2 M

33
Electronics 2018, 7, 391

Due to the pilot contamination, al,k and ĝl,i,k are dependent, we have
2
aHl,k
∑i =l aHl,k l,i,k =
MN −K +1 M
E ĝ M ∑i =l ∑m=1 βl,m,i,k . (A16)

For the last term, we ﬁrst calculate

H
a al,k
E aHl,k diag pu Ĝl,l ĜH l,l aH
l,k l,k
H
al,k al,k
= pu E aH ĝl,l,k ĝl,l,k aH
H
l,k l,k
MN −K +1
= MN pu k̂ l,l,k θ̂l,l,k . (A17)

Then we can obtain

L
aH
∑
al,k
E l,k
aH
E diag pu Gl,i GH +I
l,k aH
l,k
l,i
i =1
H
aH al,k al,k al,k (A18)
=E l,k
aH
diag p u Ĝ Ĝ H
l,l l,l aH + E aH
D R aH
l,k l,k l,k l,k

( a) MN −K +1 1
= pu k̂ l,l,k θ̂l,l,k + Δ
MN M l,k

where ( a) results form Lemma A3.

Substituting Equations (A13)–(A16), and (A18) into Equation (A4) yields the closed-form
expression expressed by Equation (25). This completes the proof.

References
1. Wang, J.; Zhu, H.; Gomes, N.J.; Wang, J. Frequency Reuse of Beam Allocation for Multiuser Massive MIMO
Systems. IEEE Trans. Wirel. Commun. 2018, 17, 2346–2359. [CrossRef]
2. Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas.
IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
3. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
4. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An Overview of Massive MIMO: Beneﬁts and
Challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
5. Wang, D.; Zhang, Y.; Wei, H.; You, X.; Gao, X.; Wang, J. An overview of transmission theory and techniques of
large-scale antenna systems for 5G wireless communications. Sci. China Inf. Sci. 2016, 59, 081301. [CrossRef]
6. Zhang, J.; Wen, C.; Jin, S.; Gao, X.; Wong, K. On Capacity of Large-Scale MIMO Multiple Access Channels
with Distributed Sets of Correlated Antennas. IEEE J. Sel. Areas Commun. 2013, 31, 133–148. [CrossRef]
7. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling Up MIMO:
Opportunities and Challenges with Very Large Arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
8. Li, J.; Wang, D.; Zhu, P.; Wang, J.; You, X. Downlink Spectral Efﬁciency of Distributed Massive MIMO
Systems With Linear Beamforming Under Pilot Contamination. IEEE Trans. Veh. Technol. 2018, 67, 1130–1145.
[CrossRef]
9. Björnson, E.; Matthaiou, M.; Debbah, M. Massive MIMO with non-ideal arbitrary arrays: Hardware scaling
laws and circuit-aware design. IEEE Trans. Wirel. Commun. 2015, 14, 4353–4368. [CrossRef]
10. Zhu, H. Performance Comparison Between Distributed Antenna and Microcellular Systems. IEEE J. Sel.
Areas Commun. 2011, 29, 1151–1163. [CrossRef]
11. You, X.H.; Wang, D.M.; Sheng, B.; Gao, X.Q.; Zhao, X.S.; Chen, M. Cooperative distributed antenna systems
for mobile communications. IEEE Wirel. Commun. 2010, 17, 35. [CrossRef]
12. Dai, L. A Comparative Study on Uplink Sum Capacity with Co-Located and Distributed Antennas. IEEE J.
Sel. Areas Commun. 2011, 29, 1200–1213. [CrossRef]

34
Electronics 2018, 7, 391

13. Bai, Q.; Nossek, J.A. Energy efficiency maximization for 5G multi-antenna receivers. Trans. Emerg.
Telecommun. Technol. 2015, 26, 3–14. [CrossRef]
14. Walden, R.H. Analog-to-digital converter survey and analysis. IEEE J. Sel. Areas Commun. 1999, 17, 539–550.
[CrossRef]
15. Singh, J.; Dabeer, O.; Madhow, U. On the limits of communication with low-precision analog-to-digital
conversion at the receiver. IEEE Trans. Commun. 2009, 57, 3629–3639. [CrossRef]
16. Fan, L.; Jin, S.; Wen, C.; Zhang, H. Uplink Achievable Rate for Massive MIMO Systems With
Low-Resolution ADC. IEEE Commun. Lett. 2015, 19, 2186–2189. [CrossRef]
17. Qiao, D.; Tan, W.; Zhao, Y.; Wen, C.; Jin, S. Spectral efficiency for massive MIMO zero-forcing receiver with
low-resolution ADC. In Proceedings of the 2016 8th International Conference on Wireless Communications
Signal Processing (WCSP), Yangzhou, China, 13–15 October 2016; pp. 1–6. [CrossRef]
18. Verenzuela, D.; Bjrnson, E.; Matthaiou, M. Hardware design and optimal ADC resolution for uplink
massive MIMO systems. In Proceedings of the 2016 IEEE Sensor Array and Multichannel Signal Processing
Workshop (SAM), Rio de Janerio, Brazil, 10–13 July 2016; pp. 1–5. [CrossRef]
19. Sarajlić, M.; Liu, L.; Edfors, O. When Are Low Resolution ADCs Energy Efficient in Massive MIMO?
IEEE Access 2017, 5, 14837–14853. [CrossRef]
20. Roth, K.; Nossek, J.A. Achievable Rate and Energy Efficiency of Hybrid and Digital Beamforming Receivers
With Low Resolution ADC. IEEE J. Sel. Areas Commun. 2017, 35, 2056–2068. [CrossRef]
21. Li, J.; Wang, D.; Zhu, P.; You, X. Uplink Spectral Efficiency Analysis of Distributed Massive MIMO With
Channel Impairments. IEEE Access 2017, 5, 5020–5030. [CrossRef]
22. Li, J.; Wang, D.; Zhu, P.; You, X. Spectral efficiency analysis of single-cell multi-user large-scale distributed
antenna system. IET Commun. 2014, 8, 2213–2221. [CrossRef]
23. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and Spectral Efficiency of Very Large Multiuser MIMO
Systems. IEEE Trans. Commun. 2013, 61, 1436–1449. [CrossRef]
24. Zuo, J.; Zhang, J.; Yuen, C.; Jiang, W.; Luo, W. Energy-Efficient Downlink Transmission for Multicell Massive
DAS With Pilot Contamination. IEEE Trans. Veh. Technol. 2017, 66, 1209–1221. [CrossRef]
25. Ngo, H.Q.; Tran, L.; Duong, T.Q.; Matthaiou, M.; Larsson, E.G. On the Total Energy Efficiency of Cell-Free
Massive MIMO. IEEE Trans. Green Commun. Netw. 2018, 2, 25–39. [CrossRef]
26. Björnson, E.; Sanguinetti, L.; Hoydis, J.; Debbah, M. Optimal Design of Energy-Efficient Multi-User MIMO
Systems: Is Massive MIMO the Answer? IEEE Trans. Wirel. Commun. 2015, 14, 3059–3075. [CrossRef]
27. Pham, Q.; Hwang, W. Fairness-Aware Spectral and Energy Efficiency in Spectrum-Sharing Wireless
Networks. IEEE Trans. Veh. Technol. 2017, 66, 10207–10219. [CrossRef]
28. Deng, L.; Rui, Y.; Cheng, P.; Zhang, J.; Zhang, Q.T.; Li, M. A Unified Energy Efficiency and Spectral Efficiency
Tradeoff Metric in Wireless Networks. IEEE Commun. Lett. 2013, 17, 55–58. [CrossRef]
29. Xu, L.; Yu, G.; Jiang, Y. Energy-Efficient Resource Allocation in Single-Cell OFDMA Systems: Multi-Objective
Approach. IEEE Trans. Wirel. Commun. 2015, 14, 5848–5858. [CrossRef]
30. Heath, W.H., Jr.; Wu, T.; Kwon, Y.H.; Soong, A.C.K. Multiuser MIMO in Distributed Antenna Systems With
Out-of-Cell Interference. IEEE Trans. Signal Process. 2011, 59, 4885–4899. [CrossRef]
31. Lim, Y.; Chae, C.; Caire, G. Performance Analysis of Massive MIMO for Cell-Boundary Users. IEEE Trans.
Wirel. Commun. 2015, 14, 6827–6842. [CrossRef]

35
electronics
Article
Channel Sounding for Multi-User Massive MIMO in
Distributed Antenna System Environment
Seoyoung Yu † and Jeong Woo Lee *,†
School of Electrical and Electronics Engineering, Chung-Ang University, Seoul 06974, Korea;
[email protected]
* Correspondence: [email protected]; Tel.: +82-2-820-5734
† Current address: 84 Heukseok-ro, Dongjak-gu, Seoul 06974, Korea.

Received: 20 November 2018; Accepted: 21 December 2018; Published: 1 January 2019

Abstract: We propose a generation scheme for a sounding reference signal (SRS) suitable for
supporting a large number of users in massive multi-input multi-output (MIMO) system with
a distributed antenna system (DAS) environment. The proposed SRS can alleviate the pilot
contamination problem which occurs inherently in the multi-user system due to the limited number
of orthogonal sequences. The proposed SRS sequence is generated by applying a well-chosen phase
rotation to the conventional LTE/LTE-A SRS sequences without requiring an increased amount
of resource usage. We also propose using the correlation-aided channel estimation algorithm as a
supplemental scheme to obtain more reliable and reﬁned channel estimation. It is shown that the
proposed SRS sequence and the supplemental channel estimation scheme improve signiﬁcantly the
channel estimation performance in multi-user massive MIMO systems.

Keywords: massive multi-input multi-output (MIMO); distributed antenna systems (DAS); sounding
reference signal (SRS); channel estimation

1. Introduction
It is expected that the amount of mobile wireless traffic in 2020 will be 1000 times higher than that of
2010 [1–3]. Along with the dramatic growth in the demand for wireless communications, performance
requirements for data rate, spectral efficiency and energy efficiency are also getting higher [4–6].
To meet these growing demand and performance requirements for wireless communications, massive
multi-input multi-output (m-MIMO) technology was proposed as one of the key technologies for the
next generation cellular networks, known as fifth generation (5G) systems [7–10]. It is known that
m-MIMO systems, whose transmitter or receiver is equipped with massive number of antennas, can
improve the spectral efficiency and save energy in wireless communication systems [10–12]. As a result,
m-MIMO systems have recently attracted many researchers and engineers in many aspects. Multi-user
m-MIMO technology, in which a base station (BS) uses a large number of antennas to serve many pieces
of user equipment (UE) simultaneously on the same time-frequency resource, is one example that is
actively studied to be practically adopted in 5G systems [13–15]. The distributed antenna system (DAS)
has also been considered a key technology for feasible deployment of 5G systems [16–18]. In DAS
configuration, there exist a digital unit (DU) and multiple radio units (RUs) in a cell, where each RU is
connected to DU via fiber optic links. The DU manages RUs in a centralized manner, by which RUs
can transmit and receive signals in a cooperative manner.
The key requirement for enjoying the benefit of m-MIMO technology is to obtain the accurate
channel state information (CSI) for each link at the BS, or at RUs in DAS environment. In the
frequency-division duplex (FDD) approach, UEs estimate downlink (DL) channels by using DL
pilot signals, or sounding reference signals (SRS), transmitted from the BS [19]. The required number
of DL pilots in an FDD based approach is proportional to the number of BS antennas multiplied by the

Electronics 2019, 8, 36; doi:10.3390/electronics8010036 37 www.mdpi.com/journal/electronics

Electronics 2019, 8, 36

number of served UEs, which complicates the adoption of such a DL channel estimation in massive
MIMO environments. Thus, the m-MIMO systems typically employ the time-division duplex (TDD)
approach to estimate the DL channel. In the TDD approach, UEs estimate DL channels by sending
mutually orthogonal uplink (UL) pilot signals (SRS) to BS based on the DL and UL channel reciprocity
within the channel coherence interval [9]. The total number of UL pilots required in such a TDD based
approach is proportional to the number of served UEs irrespective of the number of BS antennas [7,9].
For a given length of sequence, say M, we may generate at most M orthogonal sequences.
Conventionally, pilot sequences are mutually orthogonal, so the maximum number of pilot sequences
is limited by M. If the number of UEs in simultaneous service exceeds M, we need to reuse some or
all of the already generated orthogonal sequences. This results in the so-called pilot contamination
problem [20] due to the violation of orthogonality between pilot sequences. The pilot contamination
mainly limits the performance improvement of m-MIMO systems [7,15]. Most of the prior works
that proposed to resolve this problem have considered the use of mutually orthogonal SRS sequences.
They include pilot signal coordination [21], blind channel estimation with data samples [22] and
cooperative multi-cell precoding in m-MIMO systems [23]. The number of mutually orthogonal SRS
sequences is mainly limited by the length of the base sequence. In Long Term Evolution (LTE)/Long
Term Evolution-Advanced (LTE-A) systems, the number of orthogonal SRS sequences is 16, which is
not suitable for serving a large number of users in m-MIMO environments [24,25]. Using long pilot
sequences may reduce the pilot contamination, but they replace data sequences and reduce channel
spectral efﬁciency and throughput. If pilot sequences are made too long, they may occupy even the
restricted band, which must be prevented. Thus, it may be desirable to generate a larger number of
SRS sequences without increasing the sequence length.
For this purpose, we propose a mechanism for generating SRS sequences with a lower level of
pilot contamination which is suitable for serving a large number of UEs. In the proposed mechanism,
a phase rotation is applied to the base sequence without increasing the sequence length. The resultant
SRS sequences may be mutually correlated and thus still incur a pilot contamination as the number
of UEs grows and exceeds the length of base sequence. This results in high channel estimation error
when a linear estimation based on the orthogonality of SRS sequences is used. To resolve this problem,
we propose a two-step channel estimation algorithm by which the least square (LS) estimation [26] is
ﬁrst applied and the minimum mean squared error (MMSE) estimation [27] is additionally applied
only to the group of UEs using mutually correlated SRS sequences. The proposed correlation-aided
channel estimation shows the improved performance in the channel estimation. It is shown that the
proposed SRS combined with the supplemental channel estimation algorithm guarantees lower mean
squared error (MSE) in channel estimation, which alleviates the pilot contamination problem.
The rest of the paper is organized as follows. In Section 2, we introduce the system model in
consideration. In Section 3, a brief review of SRS sequences in conventional LTE/LTE-A systems is
provided. We introduce the generation of proposed SRS sequences and the analysis on the resultant
correlation in Section 4. In Section 5, we propose the two-step channel estimation algorithm composed
of an LS estimation followed by a supplemental MMSE estimation. We also formulate and analyze the
MSE obtained for SRS sequences with an LS estimation. We verify the performances in various aspects
by computer simulations in Section 6 and conclude this paper in Section 7.

2. System Model
Consider a cell having a DU and R RUs each of which serves K UEs as shown in Figure 1,
where each RU has NT transmit and receive antennas. We index RU by r ∈ {0, · · · , R − 1}, and index
the UE served by RU r as r (k ), k ∈ {0, · · · , K − 1}. We consider orthogonal frequency division
multiplexing (OFDM) communications with Nc subcarriers between UE and RU. We suppose the
channel reciprocity, by which the DL channel from RU to UE can be estimated by using the UL pilots
sent from UEs under the constraint that the time delay from the UL channel estimation to the DL
transmission is less than the coherence time of the channel [7]. Then, the UL channel estimated

38
Electronics 2019, 8, 36

by sending SRS sequence from each UE to RU is used as the DL channel estimation. Let sr(k) =
[sr(k) [0] · · · sr(k) [ M − 1]] T denote the SRS sequence of UE r (k), where the superscript T denotes a
transpose of a vector. We also let hr(k),r [m] ∈ C NT ×1 denote the channel gain between UE r (k ) and
RU r over the m-th subcarrier. Then, the channel gains corresponding to M subcarriers, hr(k),r [m],
m = 0, · · · , M − 1, are estimated by using an SRS sequence.
The signal received by RU r over the subcarrier m is denoted by yr [m] ∈ C NT ×1 and obtained by

K −1 R −1 K −1
yr [ m ] = ∑ hr(k),r [m]sr(k) [m] + ∑ ∑ hr (k),r [m]sr (k) [m] + nr [m], (1)
k =0 r =0,r =r k =0

where nr [m] ∈ C NT ×1 is the zero-mean additive white Gaussian noise vector with covariance matrix
σn2 I NT × NT . Note that hr(k),r [m] = β r(k),r gr(k),r [m], where β r(k),r represents the large scale fading
while each entry of gr(k),r [m] ∈ C NT ×1 denotes the small scale fading represented by an independent
and identically distributed (i.i.d.) zero mean complex Gaussian random variable with unit variance.
Note that the large scale fading factor β2r(k),r = dr−(ka),r , where dr(k),r is a distance between UE r (k ) and
RU r , and a is an attenuation factor.

&HOO

58U

KU VU N
N U
8(U KU 58Uc
N Uc

UN
U

8(Uc
Uc N Uc

Figure 1. Multi-user m-MIMO conﬁguration in DAS environments.

3. Conventional Channel Sounding Reference Signal in LTE/LTE-A Systems

In the following, we brieﬂy introduce the generation of SRS in conventional LTE/LTE-A
systems [24,25]. The structure of SRS symbol is illustrated in Figure 2. Basically, the SRS sequence is
generated by a cyclic shift of a base sequence, which is obtained from Zadoff–Chu sequence [24,25]
as presented below. Let Nsc RB be the number of subcarriers per RB, where N RB = 12 in LTE/LTE-A
sc
systems, and L be the number of assigned subcarriers for SRS or sounding bandwidth. Let D be the
decimation factor which is the number of SRS sequences sharing the allocated sounding bandwidth,
where the length of SRS sequence is M = L/D. Note that L is the multiple of Nsc RB , i.e., L = n · N RB ,
sc
1 ≤ n ≤ NRB
UL , where N UL is the uplink system bandwidth in terms of RBs. Let L denote the length of
RB z
Zadoff–Chu sequence used to generate the base sequence of length M, where Lz is given by the largest
prime number such that Lz < M.

39
Electronics 2019, 8, 36

Let xq [m] denote the q-th root Zadoff–Chu sequence deﬁned by

πqm(m + 1)
xq [m] = exp − j , 0 ≤ m < Lz , (2)
Lz
where

q = q̄ + 0.5 + v(−1)2q̄ ,

(3)
q̄ = Lz (u + 1)/31,

with u ∈ {0, 1, · · · , 29} and v = 0 if M = nNsc

RB , n ≤ 5 and v = 0, 1 if M = nN RB , n ≥ 6. The base
sc
sequence x̄ [m] is obtained by

x̄ [m] = xq [m mod Lz ], 0 ≤ m < M. (4)

Note that base sequences are divided into groups, where u is the group number associated with
the physical cell ID and the length of the SRS sequence, and v is the base sequence number within the
group. The SRS sequence x (α) [m] of length M is deﬁned by applying a cyclic shift α, α ∈ {0, 1 · · · , 7},
to the base sequence x̄ [m] as
α
x (α) [m] = e j2π Lc m x̄ [m], 0 ≤ m < M, (5)
j2π Lαc
where Lc > 7 to obtain distinct values of e for different α ∈ {0, 1 · · · , 7}. For any αi and α j chosen
from {0, · · · , 7}, two sequences x(αi ) = [ x (αi ) [0] · · · x (αi ) [ M − 1]] T and x(α j ) = [ x (α j ) [0] · · · x (α j ) [ M − 1]] T
1 ( αi ) H ( α j )
are orthogonal if Mx x = δi− j , where δi− j = 1 if i = j and 0 otherwise, and the superscript H
M −1 j L c ( α j − α i ) m 2π
denotes a conjugate transpose of a vector. This condition reduces to M 1
∑m =0 e = δi− j by
using Equation (5) and the property | x̄ [m]| = 1 for all m, which is clear from Equations (2)–(4).
2

This condition is satisﬁed only if (α j − αi ) LMc is an even integer, or, in other words, Lc is a factor
of (α j − αi ) M
2 , for αi = α j ∈ {0, · · · , 7}. Consequently, Lc must be an integer that is greater than 7 and
is a factor of 12 M.

VXEFDUULHU

1 VF5%
/
IUHTXHQF\

WLPH 656V\PERO

Figure 2. SRS symbol structure.

Multiple SRS sequences are deﬁned from a single base sequence by using different values of
α and the decimation factor. In LTE/LTE-A, the decimation factor of two is used and the signal
occupies every second subcarrier within the allocated sounding bandwidth. By using distinct SRS
sequences obtained with different values of α and by using distinct sets of subcarriers as a result of
decimation, multiple UEs can estimate their channel gains and can be served by RU simultaneously.

40
Electronics 2019, 8, 36

In conventional LTE/LTE-A systems using α ∈ {0, · · · , 7} and the decimation factor of two, we can
obtain only 16 orthogonal sequences. In the multi-user systems with higher number of UEs than 16,
we need to design a larger set of SRS sequences having low cross-correlation.

4. Proposed Channel Sounding Reference Signal for Multi-User Systems

The lack of orthogonal SRS sequences may cause the pilot contamination problem. The easy way
to alleviate this is using longer SRS sequences or wider sounding bandwidth. However, this approach
may result in the degraded performance of channel estimation in the frequency selective environment
and the lower spectral efﬁciency and throughput because SRS sequences replace data sequences.
Moreover, if the sounding bandwidth is too wide, SRS sequences may occupy the restricted band,
which must be prohibited. Thus, we aim to generate a new set of SRS sequences showing reduced
pilot contamination without increasing the sounding bandwidth or sequence length. We apply phase
rotation to the LTE/LTE-A SRS sequences, which is introduced in Section 3 to generate a new SRS
sequence as

+ Lαc m
p p
M + Lp
j2π L p m (α) j2π s
x (α,s,p) [m] e j2π M m e
s
x [m] = e x̄ [m], 0 ≤ m < M, (6)

where 0 ≤ s < LMc and 0 ≤ p < L p , and the last equality comes from Equation (5). Note that Lc is an
integer greater than 7 which can divide 12 M as introduced in Section 3 and L p is a prime number which
is smaller than Lc . For a given SRS sequence length M, we may generate up to M orthogonal sequences.
However, by using Equation (5), we can generate only eight orthogonal sequences with varying
s α
α = 0, · · · , 7. Thus, we use the phase rotation e j2π M m , 0 ≤ s < LMc , together with e j2π Lc m , 0 ≤ α < Lc ,
to generate M orthogonal SRS sequences without incurring pilot contamination. If the number of
UEs exceeds M, we need to generate extra SRS sequences instead of reusing already generated ones.
p
j2π m
For this purpose, we apply additional phase rotation e L p , 0 ≤ p < L p , where L p needs to be
coprime with M and thus be also coprime with Lc in order to make the resultant sequences distinct
from already generated M sequences. We empirically found that a prime number L p smaller than Lc
results in a good performance.
By Equations (2)–(4), we can rewrite Equation (6) as
⎧ ! "
⎨exp j2π p α qm(m+1)
(α,s,p)
s
+ + Lc m − , for 0 ≤ m < Lz ,
x [m] = ! M Lp 2Lz " (7)
⎩exp j2π p α q(m− Lz )(m− Lz +1)
M + + Lc m − for Lz ≤ m < M.
s
Lp 2Lz ,

The correlation of SRS sequences sr(k) and sr ( j) is deﬁned and expanded as

M −1 M−1 − j2π s−s +
p− p
+ α− α
1 1 1 m
∑ sr∗(k) [m]sr ( j) [m] ∑
M Lp Lc
Cr(k),r ( j) sr ( k ) H sr ( j ) = = e , (8)
M M m =0
M m =0

where sr(k) [m] = x (α,s,p) [m], sr ( j) [m] = x (α ,s ,p ) [m] and the superscript ∗ represents the complex
conjugate of a complex variable. The detailed derivation of Equation (8) is provided in Appendix A.
It is clear that Cr(k),r ( j) obtained with s = s , p = p and α = α corresponds to the auto-correlation of
M −1 j ·0
sr(k) because sr ( j) = sr(k) , where Cr(k),r(k) = M
1
∑m =0 e = 1. The cross-correlation Cr(k),r ( j) = 0 if
p = p and either s = s or α = α , while Cr(k),r ( j) = 0 if p = p by the following reason.
M−1 j2πφm
Consider f (φ) = M 1
∑m =0 e , where f (φ) = 0 if φM is a nonzero integer, and f (φ) = 0,
−s
otherwise. If p = p , the rightmost side of Equation (8) becomes f s M + α L−c α . If s = s or α = α ,
s −s α −α
M + Lc M is a nonzero integer because M is a multiple of Lc as introduced in Section 3, and thus
−s −s
Cr(k),r ( j) = f s M + α L−c α = 0. On the other hand, if p = p , s M + p L−p p + α L−c α M cannot be a

41
Electronics 2019, 8, 36

s −s p −p
nonzero integer because ( p − p) M is not divisible by L p , and, thus Cr(k),r ( j) = f M + Lp + α L−c α =
0, where L p and M are coprime and | p − p | < L p .
Consequently, for each p, we can generate a set of M orthogonal SRS sequences, where 0 ≤ s < LMc
and 0 ≤ α < Lc enables the generation of M orthogonal sequences. Applying the phase rotation
p
j2π m
e L p , 0 ≤ p < L p , in Equation (7) enables to obtain L p sets of M orthogonal sequences. Any two
sequences obtained with different values of p are mutually correlated.
Distinct SRS sequences generated with different α, s and p by Equation (7) are assigned to different
UEs. The ﬁrst set of M orthogonal sequences generated with p = 0 are assigned to the ﬁrst M UEs.
Then, the next set of M orthogonal sequences generated with p = 1 are assigned to the next M UEs.
This procedure is repeated by increasing p until all UEs are assigned SRS sequences.

5. Channel Estimation

5.1. Least Square Channel Estimation

Let ĥr(k),r [m] denote the estimation of the channel gain hr(k),r [m]. We apply the LS estimation
algorithm [26] to the received signal of SRS sequence for the channel estimation. In this process,
we assume a block fading with length M, by which the channel is considered invariant over consecutive
M subcarriers. Then, the estimation of channel gain at the m-th subcarrier in a fading block, ĥr(k),r [m],
can also be denoted by ĥr(k),r and obtained as

M −1
1
ĥr(k),r [m] = ĥr(k),r = ∑ sr∗(k) [i ]yr [i ]. (9)
M i =0

By using Equation (1), we can rewrite Equation (9) as

#
1 M −1 ∗ K −1 R −1 K −1

M i∑ ∑ hr( j),r [i]sr( j) [i] + ∑ ∑ hr ( j),r [i]sr ( j) [i] + nr [i]

ĥr(k),r = sr ( k ) [i ]
=0 j =0 r =0,r =r j=0
# (10)
M −1 K −1 R −1 K −1
1
= ∑ hr(k),r [i ] + ∑ hr( j),r [i ]sr∗(k) [i ]sr( j) [i ] + ∑ ∑ hr ( j),r [i]sr∗(k) [i]sr ( j) [i] + sr∗(k) [i]nr [i] ,
M i =0 j=0,j =k r =0,r =r j=0

where sr∗(k) [i ]sr(k) [i ] = |sr(k) [i ]|2 = 1 is used. If the channel is actually block faded with length M,
the channel gain hr(k),r [m] is identical for all m = 0, · · · , M − 1 and thus, we can represent the channel
gain by hr(k),r . Then, Equation (10) can be rewritten as

K −1 R −1 K −1 M −1
1
ĥr(k),r = hr(k),r + ∑ hr( j),r Cr(k),r( j) + ∑ ∑ hr ( j),r Cr(k),r ( j) + M ∑ sr∗(k) [i ]nr [i ]. (11)
j=0,j =k r =0,r =r j=0 i =0

In case that all SRS sequences of UEs in the cell are mutually orthogonal, we have zero
cross-correlation between any two SRS sequences so that Equation (11) is simpliﬁed as

M −1
1
ĥr(k),r = hr(k),r + ∑ sr∗(k) [i ]nr [i ]. (12)
M i =0

We deﬁne the normalized mean squared error (MSE) of the channel estimation between UE r (k )
and RU r at subcarrier m as

Eĥr(k),r − hr(k),r [m]2

(k) [ m ] =
2
σMSE,r . (13)
β2r(k),r NT

Then, we deﬁne the average MSE by

42
Electronics 2019, 8, 36

K −1 R −1 M −1
1
2
σ̄MSE =
KRM ∑ ∑ ∑ 2
σMSE,r (k) [ m ] (14)
k =0 r =0 m =0

and use this as the performance metric of channel estimation. Note that, if the channel is actually
block faded with length M, the normalized MSE can be denoted by σMSE,r
2
(k) and Equation (14) can be
simpliﬁed as σ̄MSE
2 = KR
1
∑kK=−01 ∑rR=−01 σMSE,r
2
(k) . Under the assumption of block fading with length M,
the average MSE can be obtained from Equations (11), (13) and (14) as

K −1 R −1 Eĥr(k),r − hr(k),r 2
1
2
σ̄MSE =
KR ∑ ∑ β2r(k),r NT
k =0 r =0
# (15)
K −1 R −1 K −1 R −1 K −1
1 1 1 2
=
KR ∑ ∑ β2r(k),r
∑ β2r( j),r |Cr(k),r( j) |2 + ∑ ∑ β2r ( j),r |Cr(k),r ( j) |2 + σ
M n
,
k =0 r =0 j=0,j =k r =0,r =r j=0

where the detailed derivation is given in Appendix B.

Let us predict analytically the average MSE under the block fading environment. The large scale
fading factors are assumed to be β r(k),r = β 1 and β r (k),r = β 2 for all k, r and r , r = r, which means
that the large scale fading between UE and serving RU is represented by β 1 while the large scale fading
between UE and other neighboring RUs are represented by β 2 . We suppose β 2 < β 1 because UE is
usually served by a nearly located RU. Then, Equation (15) can be rewritten as
#
1 K −1 R −1 K −1 β22 R−1 K −1 σn2
KR k∑ ∑ ∑ r(k),r( j) ∑ ∑ r(k),r ( j)
2
σ̄MSE = | C | 2
+ | C | 2
+ . (16)
=0 r =0 j=0,j =k β21 r =0,r =r j=0 β21 M

We consider the reuse of M orthogonal SRS sequences to UEs repeatedly without applying the
p
j2π m
phase rotation e L p in Equation (7). We suppose that each RU in the cell serves the equal number
of UEs and distinct set of M
R orthogonal sequences are assigned repeatedly to UEs in each RU. Then,
we obtain a correlation matrix C ∈ C KR×KR of SRS sequences whose (i, j)-th entry is deﬁned by

1, if |i − j| = kM, k = 0, · · · , KR/M,
Cij = (17)
0, else.

Every other M R column and row of C correspond to UEs in the same RU. Due to the reuse of
M
R orthogonal sequences for each RU, SRS sequences used for different RUs are always mutually
orthogonal. It follows that Cr(k),r ( j) = 0 for all k, j and r = r and Equation (16) becomes

K −1 R −1 K −1
1 σn2
2
σ̄MSE =
KR ∑ ∑ ∑ |Cr(k),r( j) |2 +
β21 M
. (18)
k =0 r =0 j=0,j =k

From Equation (17) and the allocation rule of orthogonal sequences to UEs introduced above,
we obtain
⎧
1 K −1 R −1 K −1 ⎨0, if KR ≤ M,
∑ ∑ ∑
KR k=0 r=0 j=0,j =k
|Cr(k),r( j) | =
2
⎩ 1 ∑ M 2(KR − iM), if KR > M.
KR (19)
KR i =1

Consequently, we can rewrite Equation (18) as

⎧
⎨ σn2
,
β21 M
if KR ≤ M,
2
σ̄MSE = (20)
⎩ 2 − M KR σn2
M +1 + if KR > M.
KR
M KR β21 M
,

43
Electronics 2019, 8, 36

Note that σ̄MSE

2 is independent of β22 while it depends on β21 . Recall that β22 is determined by
the distance between a UE and its neighboring RU. Thus, the performance of channel estimation
obtained by reusing orthogonal sequences repeatedly is not affected by how far neighboring RUs
are located from UE. It is also clear from Equation (20) that the average MSE decreases as M grows.
Since the number of orthogonal sequences is limited for a given sequence length, the number of UEs
served simultaneously in a cell is limited. In the multi-user systems, a larger number of UEs must be
accommodated, so we may need to use non-orthogonal sequences for channel sounding.

5.2. Supplemental Correlation-Aided Channel Estimation

As the number of UEs grows in the cell, it is impossible to assign orthogonal SRS sequences to all
UEs even by using the proposed SRS sequences because the number of orthogonal sequences is limited
to M. Thus, we propose using the supplemental channel estimation scheme to enhance the channel
estimation performance even with correlated SRS sequences. Suppose DU knows SRS sequences and
their cross-correlations. We rewrite Equation (11) as

R −1 K −1 M −1
1
ĥr(k),r = ∑ ∑ hr ( j),r Cr(k),r ( j) + M ∑ sr∗(k) [i ]nr [i ], (21)
r =0 j =0 i =0

where 0 ≤ r < R and 0 ≤ k < K. This can be expressed by using matrix forms as

Ĥ = HC + N, (22)

where Ĥ ∈ C NT ×KR , H ∈ C NT ×KR and N ∈ C NT ×KR are matrices whose columns are ĥr(k),r , hr(k),r
and nr(k),r , respectively, 0 ≤ r < R and 0 ≤ k < K. We let C ∈ C KR×KR denote a correlation matrix
whose entries are Cr(k),r ( j) , where 0 ≤ r, r < R and 0 ≤ k, j < K. We define two classes of UEs.
The first one is the set of UEs that are assigned mutually orthogonal SRS sequences. UEs assigned
mutually correlated SRS sequences compose the second class.
We reorder Ĥ, H, C and N and partition
I 0
them as Ĥ = Ĥu Ĥc , H = Hu Hc , C = and N = Nu Nc , where submatrices
0 A
with subscripts u and c correspond to the first and the second class of UEs, respectively, and A is a
non-diagonal matrix. Then, Equation (22) can be rewritten in partitioned forms as

I 0
Ĥu Ĥc = Hu Hc + Nu Nc , (23)
0 A

which implies

Ĥu = Hu + Nu , (24)

Ĥc = Hc A + Nc . (25)

It is clear from Equation (24) that ĥr(k),r = hr(k),r + nr(k),r for the first class of UEs. So, ĥr(k),r
is considered to estimate sufficiently the actual channel hr(k),r of UE r (k ) in the first class. On the
other hand, as can be seen from Equation (25), for the second class of UEs, ĥr(k),r includes the linear
combination of other UEs’ channels as well. Thus, we need a supplemental procedure to obtain
the more reliable channel estimations for UEs in the second class. For this purpose, we propose the
correlation-aided channel estimation which applies the MMSE algorithm [27] to the estimated channels
for the second class of UEs as the supplemental procedure. We obtain the refined channel estimation
˜ for UEs in the second class by multiplying the MMSE nulling matrix W to Ĥ as
Ĥ c c

˜ = Ĥ W,
Ĥ (26)
c c

44
Electronics 2019, 8, 36

where −1
W = A H AA H + B−1 I (27)

and B is a diagonal matrix whose diagonal entries are β2r(k),r corresponding to UEs in the second
class. Then, after ˜
refinement of Ĥc to Ĥc by Equation (26), the channel estimation Ĥ is replaced by
˜
Ĥ = Ĥu Ĥc .
In summary, first, we apply the LS estimation to obtain ĥr(k),r for all UEs. Next, we define the
first and the second classes of UEs based on the correlation matrix C. The first class is defined by UEs
whose SRS sequences are mutually orthogonal. The remaining UEs form the second class. Then, for the
second class of UEs, we apply additionally the MMSE algorithm to the channel estimation obtained by
LS scheme as the supplemental procedure. Finally, we use the result of LS estimation for the first class
and the result of supplement estimation for the second class as the channel estimation. If the number
of UEs is less than or equal to the number of mutually orthogonal SRS sequences, we only need to
perform a LS estimation as introduced in Equation (9). Otherwise, we need to apply the proposed
correlation-aided supplemental estimation after performing the LS estimation.

6. Numerical Results
We evaluate the performances of the proposed SRS and the proposed supplemental channel
estimation algorithm in terms of MSE in the multi-user m-MIMO DAS environment. For comparison,
the MSE obtained by using conventional LTE/LTE-A SRS scheme is also shown. As the proposed
scheme, we consider the following three types:

(a) using repeatedly M orthogonal sequences generated by varying α and s without applying the
p
j2π L p m
phase rotation e in Equation (7),
p
j2π m
(b) using the phase rotation e L p together with varying α and s for SRS generation, but not using
the supplemental correlation-aided channel estimation,
p
j2π m
(c) using the phase rotation e L p together with varying α and s for SRS generation, and the
supplemental correlation-aided channel estimation as well.

We consider the large scale fading factor between UEs and serving RU as β 1 and the large scale
fading factor between UEs and other neighboring RUs as β 2 , i.e., β r(k),r = β 1 and β r(k),r = β 2 for all k,
r and r = r. We let β 2 /β 1 = 0.7692, which comes from the assumption that the distance of UE from
serving RU is 1.3 times shorter than the distance from other neighboring RUs, where the attenuation
factor is assumed to be a = 2. We let each RU serve the same number of UEs. We choose Lc = 8 in
p
j2π m
Equations (5) and (7), and L p = 3 when applying the phase rotation e L p in types (b) and (c) of
the proposed scheme. We consider a block fading channel, International Telecommunication Union
Radiocommunication Sector (ITU-R) Ped A and ITU-R Veh A channels [28], where ITU-R Ped A [29]
and ITU-R Veh A channels are examples of frequency selective channels. The simulation parameters
used for performance evaluation are listed in Table 1.
We depict the average MSE of channel estimation with respect to the total number of UEs
in the cell obtained for block fading channel, ITU-R Ped A channel and ITU-R Veh A channel in
Figures 3–5, respectively. In each ﬁgure, we plot the average MSE obtained by using conventional SRS
in LTE/LTE-A systems and the proposed scheme of three types.
We also plot the average MSE obtained by applying supplemental MMSE estimation to all UEs
instead of applying MMSE selectively only to UEs belonging to the second class, where the phase
p
j2π m
rotation e L p is applied and the corresponding curves are marked with ‘nonselective supplemental
MMSE.’ In Figure 3, we include the average MSE predicted analytically by Equation (20) for the
proposed scheme of type (a) over block fading channel.

45
Electronics 2019, 8, 36

Table 1. Parameters used in performance evaluation.

Parameters Values
number of subcarriers (Nc ) 1024
RB )
number of subcarriers per RB (Nsc 12
48 (proposed scheme)
SRS length (M)
24 (LTE/LTE-A with a decimation factor of 2)
sampling time 0.1 μs
average SNR (β21 /σn2 ) 0 dB
channel model block fading, ITU-R Ped A, ITU-R Veh A
number of RUs (R) 4
number of RU antennas (NT ) 8

block fading channel

0
average MSE [dB]

-5

-10

LTE/LTE-A
proposed, type (a)
-15
proposed, type (b)
proposed, type (c)
analytic prediction for type (a)
nonselective supplemental MMSE
-20
0 20 40 60 80 100 120
total number of UEs

Figure 3. Average MSE of channel estimation with respect to the total number of UEs over block fading channel.

ITU-R Ped A channel

0
average MSE [dB]

-5

-10

LTE/LTE-A
-15
proposed, type (a)
proposed, type (b)
proposed, type (c)
nonselective supplemental MMSE
-20
0 20 40 60 80 100 120
total number of UEs

Figure 4. Average MSE of channel estimation with respect to the total number of UEs over ITU-R Ped A channel.

46
Electronics 2019, 8, 36

ITU-R Veh A channel

2
average MSE [dB]

-2

-4

-6

-8 LTE/LTE-A
proposed, type (a)
proposed, type (b)
-10 proposed, type (c)
nonselective supplemental MMSE
-12
0 20 40 60 80 100 120
total number of UEs

Figure 5. Average MSE of channel estimation with respect to the total number of UEs over ITU-R Veh
A channel.

It is observed that the channel estimation MSE obtained by using any SRS scheme rapidly
increases when the number of UEs in service exceeds the number of orthogonal SRS sequences, where
the conventional LTE/LTE-A scheme and the proposed scheme generate 16 and M (=48) orthogonal
SRS sequences, respectively. For more than 48 UEs, the proposed scheme also provides much lower
channel estimation MSE than LTE/LTE-A scheme. Even reusing repeatedly M (=48) orthogonal SRS
sequences generated in type (a) can lower signiﬁcantly the channel estimation MSE for any number of
p
j2π m
UEs. It is observed that applying phase rotation e L p to SRS generation further lowers the channel
estimation MSE, and the use of supplemental selective MMSE estimation can even further improve
the channel estimation performance at the cost of increasing complexity. Note that the computational
complexity of supplemental selective MMSE estimation is, in general, O(n3 ) because it requires the
matrix inversion, where n is the dimension of a square matrix A. Considering, as a reference, the case
that M orthogonal sequences are repeatedly used to generate SRS sequences, the dimension of A is
determined from Equation (17) as n = min(KR, 2 max(0, KR − M)). Applying supplemental MMSE
estimation nonselectively to all UEs degrades the performance when the number of UEs is not high
enough. In Figure 3, the analytic prediction of MSE for block fading channel is observed to match
the numerical result very well. It is clear that the proposed scheme of type (c) shows the best channel
estimation performance in terms of MSE for all numbers of UEs. The MSE gain of the proposed scheme
of type (c) over LTE/LTE-A system is about 6 dB in block fading and ITU-R Ped A channels, and about
8 dB in ITU-R Veh A channel when serving 120 UEs through four RUs. The performance gain achieved
by using the proposed scheme can be observed in block fading channel as well as frequency selective
channel.
Conventional LTE/LTE-A systems may not effectively employ m-MIMO transmission for
the service of a large number of UEs mainly due to the pilot contamination problem. However,
the proposed SRS and the supplemental channel estimation can relieve the pilot contamination
problem in a signiﬁcant level. It is expected that the proposed scheme can be effectively adopted in
multi-user m-MIMO systems.

7. Conclusions
In this paper, we proposed a generation of SRS sequences resulting in improved channel estimation
performance without increasing the sequence length. The proposed SRS can easily be generated

47
Electronics 2019, 8, 36

by imposing a phase rotation to the base sequence. Even though the proposed SRS sequences
are mutually correlated when the number of UEs is higher than the sequence length, it lowers
signiﬁcantly the MSE of channel estimation and thus alleviates the pilot contamination problem.
We also proposed a supplemental correlation-aided channel estimation scheme to further improve
the channel estimation performance of multi-user m-MIMO technology in DAS environments. It is
expected that the proposed SRS scheme and the supplemental channel estimation scheme can be
effectively adopted in m-MIMO systems.

Author Contributions: Writing, J.W.L.; Software, S.Y.; Conceptualization, J.W.L.; Supervision, J.W.L.
Funding: This research was supported by the Chung-Ang University Graduate Research Scholarship in 2017
and by the Basic Science Research Program through the National Research Foundation of Korea funded by the
Ministry of Education (NRF-2016R1D1A1B03933174).
Conﬂicts of Interest: The authors declare no conﬂict of interest.

Appendix A. Derivation of Equation (8)

With sr(k) [m] = x (α,s,p) [m] and sr ( j) [m] = x (α ,s ,p ) [m], we obtain Equation (8) by using
Equation (7) as

M −1
1 1
Cr(k),r ( j) s Hs = ∑ sr∗(k) [m]sr ( j) [m]
M r (k) r ( j) M m =0

L z −1 qm(m+1)
+ Lαc m− 2Lz
qm(m+1) p
M + Lp
s
+ Lαc m− 2Lz
p
1 − j2π M + Lp
s j2π
=
M ∑ e e
m =0

M −1 q(m− Lz )(m− Lz +1) p
+ Lαc m−
q(m− Lz )(m− Lz +1)
M + Lp
s
+ Lαc m−
p
1 − j2π M + Lp
s j2π
∑
2Lz
+ e 2Lz
e
M m= Lz

M−1 − j2π s−s +
p− p
+ α− α
1 m
∑
M Lp Lc
= e .
M m =0

Appendix B. Derivation of Equation (15)

By using Equations (11), (13) and (14), we obtain Equation (15) as

K −1 R −1 Eĥr(k),r − hr(k),r 2
1
2
σ̄MSE =
KR ∑ ∑ β2r(k),r NT
k =0 r =0
$ $2
1 K −1 R −1 1 $ K −1 R −1 K −1
1 M −1 ∗ $
$ $
= ∑ ∑
KRNT k=0 r=0 β r(k),r
2
E $
$ ∑ h C
r ( j),r r (k ),r ( j) + ∑ ∑ h C
r ( j),r r (k ),r ( j) + ∑
M i =0
s r ( k ) [ i ] n r [ i ] $
$
j=0,j =k r =0,r =r j=0

1 K −1 R −1 1 K −1 R −1 K −1
= ∑ ∑
KRNT k=0 r=0 β r(k),r j=0,j =k
2 ∑ Ehr( j),r 2 |Cr(k),r( j) |2 + ∑ ∑ Ehr ( j),r 2 |Cr(k),r ( j) |2
r =0,r =r j=0
#
1 M −1
+ 2 ∑ Enr [i ] 2
M i =0
#
1 K −1 R −1 1 K −1 R −1 K −1
1 M −1
KRNT k∑ ∑ 2 ∑ NT βr( j),r |Cr(k),r( j) | + ∑ ∑ NT βr ( j),r |Cr(k),r ( j) | + M2 ∑ NT σn
= 2 2 2 2 2
=0 r =0 β r (k),r j=0,j =k r =0,r =r j=0 i =0
#
K −1 R −1 K −1 R −1 K −1
1 1 1
KR k∑ ∑ 2 ∑ β2r( j),r |Cr(k),r( j) |2 + ∑ ∑ β2r ( j),r |Cr(k),r ( j) |2 + M σn2 .
=
=0 r =0 β r (k),r j=0,j =k r =0,r =r j=0

48
Electronics 2019, 8, 36

References
1. Ge, X.; Tu, S.; Mao, G.; Wang, C.-X.; Han, T. 5G ultra-dense cellular networks. IEEE Wirel. Commun. 2016,
23, 72–79. [CrossRef]
2. Ge, X.; Cheng, H.; Guizani, M.; Han, T. 5G wireless backhaul networks: Challenges and research advances.
IEEE Netw. 2014, 28, 6–11. [CrossRef]
3. Nakamura, T.; Nagata, S.; Benjebbour, A.; Kishiyama, Y.; Hai, T.; Xiaodong, S.; Ning, Y.; Nan, L. Trends in
small cell enhancements in LTE Advanced. IEEE Commun. Mag. 2013, 51, 98–105. [CrossRef]
4. Chin, W.H.; Fan, Z.; Haines, R. Emerging technologies and research challenges for 5G wireless networks.
IEEE Commun. Mag. 2014, 21, 106–112. [CrossRef]
5. Angelis, C.T.; Chronopoulos, S.K. System performance of an LTE MIMO downlink in various fading
environments. In Proceedings of the Ambient Media and Systems, Berlin, Germany, 27–28 January 2011;
pp. 36–43._5. [CrossRef]
6. Votis, C.; Christoﬁlakis, V.; Raptis, V.; Tatsis, G.; Chronopoulos, S.K.; Kostarakis, P. Design and analysis of a
multiple-output transmitter based on DDS architecture for modern wireless communications. AIP Conf. Proc.
2010, 1203, 421–426. [CrossRef]
7. Marzetta, T. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Comm. 2010, 9, 3590–3600. [CrossRef]
8. Rusek, F.; Persson, D.; Lau, B.; Larsson, E.; Marzetta, T.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
9. Larsson, E.G.; Tufvesson, F.; Edfors, O.; Marzetta, T.L. Massive MIMO for next generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
10. Pitarokoilis, A.; Mohammed, S.K.; Larsson, E.G. On the optimality of single-carrier transmission in large-scale
antenna systems. IEEE Wirel. Commun. Lett. 2012. 1, 276–279. [CrossRef]
11. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overview of massive MIMO: Beneﬁts and
challenges. IEEE J. Sel. Top. Signal Process. 2013, 8, 742–758. [CrossRef]
12. Bjornson, E.; Larsson, E.G.; Marzetta, T.L. Massive MIMO: Ten myths and one critical question.
IEEE Commun. Mag. 2016, 54, 114–123. [CrossRef]
13. Zuo, J.; Zhang, J.; Yuen, C.; Jiang, W.; Luo, W. Multicell multiuser massive MIMO transmission with downlink
training and pilot contamination precoding. IEEE Trans. Veh. Technol. 2016, 65, 6301–6314. [CrossRef]
14. Ge, X.; Zi, R.; Wang, H.; Zhang, J.; Jo, M. Multi-user massive MIMO communication systems based on
irregular antenna arrays. IEEE Trans. Wirel. Commun. 2016, 15, 5287–5301. [CrossRef]
15. Gopalakrishnan, B.; Jindal, N. An analysis of pilot contamination on multi-user MIMO cellular systems with
many antennas. In Proceedings of the Signal Processing Advances in Wireless Communications (SPAWC),
San Francisco, CA, USA, 26–29 June 2011, doi:10.1109/SPAWC.2011.5990435.
16. Heath, R.; Peters, S.; Wang, Y.; Zhang, J. A current perspective on distributed antenna systems for the
downlink of cellular systems. IEEE Commun. Mag. 2013, 51, 161-167. [CrossRef]
17. Chen, X.; Zhang, Z.; Chen, H.-H. On distributed antenna systems with limited feedback precoding:
opportunities and challenges. IEEE Wirel. Commun. 2010, 17, 80-88. [CrossRef]
18. Yang, C.; Han, S.; Hou, X.; Molisch, A. How do we design CoMP to achieve its promised potential?
IEEE Wirel. Commun. 2013, 20, 67–74. [CrossRef]
19. Kobayashi, M.; Jindal, N.; Caire, G. Training and feedback optimization for multiuser MIMO downlink.
IEEE Trans. Commun. 2011, 59, 2228–2240. [CrossRef]
20. Elijah, O.; Leow, C.Y.; Abdul Rahman, T.; Nunoo, S.; Zakwoi Iliya, S. A comprehensive survey of pilot
contamination in massive MIMO-5G system. IEEE Commun. Surv. Tuts. 2016, 18, 905–923. [CrossRef]
21. Yin, H.; Gesbert, D.; Filippou, M.; Liu, Y. A coordinated approach to channel estimation in large-scale
multiple-antenna systems. IEEE J. Sel. Areas Commun. 2013, 31, 264–273. [CrossRef]
22. Ngo, H.Q.; Larsson, E.G. EVD-based channel estimations for multicell multiuser MIMO with very large
antenna arrays. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and Signal
Processing (ICASSP), Kyoto, Japan, 25–30 March 2012; doi:10.1109/ICASSP.2012.6288608.
23. Jose, J.; Ashikhmin, A.; Marzetta, T.L.; Vishwanath, S. Pilot contamination and precoding in multi-cell TDD
systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]

49
Electronics 2019, 8, 36

24. 3GPP Technical Specification 36.211. Evolved Universal Terrestrial Radio Access (E-UTRA) Physical Channels
and Modulation. Available online: www.3gpp.org (accessed on 20 November 2018).
25. 3GPP Technical Specification 36.213. Evolved Universal Terrestrial Radio Access (E-UTRA) Physical Layer
Procedures. Available online: www.3gpp.org (accessed on 20 November 2018).
26. Coleri, S.; Ergen, M.; Puri, A.; Bahai, A. Channel estimation techniques based on pilot arrangement in OFDM
systems. IEEE Trans. Broadcast. 2002, 48, 223–229. [CrossRef]
27. Jafarkhani, H. Space-Time Coding: Theory and Practice; Cambridge University Press: Cambridge, UK, 2005;
pp. 229–230, ISBN 0-521-84291-3.
28. Recommendation ITR-R M.1225. Guidelines for Evaluation of Radio Transmission Technologies for IMT-2000;
Int. Telecommun. Union (ITU): Geneva, Switzerland, 1997.
29. Chronopoulos, S.K.; Christofilakis, V.; Tatsis, G.; Kostarakis, P. Performance of turbo coded OFDM under the
presence of various noise types. Wirel. Pers. Commun. 2016, 87, 1319–1336. [CrossRef]

c 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

50
electronics
Article
Efﬁcient Pilot Decontamination Schemes in 5G
Massive MIMO Systems
Omar A. Saraereh 1 , Imran Khan 2 , Byung Moo Lee 3, * and Ashraf Tahat 1
1 Communication Engineering Department, King Abdullah II School of Engineering, Princess Sumaya
University for Technology PSUT, Amman 11941, Jordan; [email protected] (O.A.S.);
[email protected] (A.T.)
2 Department of Electrical Engineering, University of Engineering & Technology, Peshawar 814, Pakistan;
[email protected]
3 School of Intelligent Mechatronics Engineering, Sejong University, Seoul 05006, Korea
* Correspondence: [email protected]

Received: 26 November 2018; Accepted: 27 December 2018; Published: 3 January 2019

Abstract: Massive Multiple-input Multiple-output (MIMO) is an emerging technology for the 5G

wireless communication systems which has the potential to provide high spectral efficient and
improved link reliability and accommodate large number of users. Aiming at the problem of
pilot contamination in massive MIMO systems, this paper proposes two algorithms to mitigate
it. The first algorithm is depending on the idea of Path Loss to perform User Grouping (PLUG)
which divide the users into the center and edge user groups depending on different levels of pilot
contamination. It assigns the same pilot sequences to the center users which slightly suffer from
pilot contamination and assign orthogonal pilot sequences to the edge users which severely suffer
from pilot contamination. It is assumed that the number of users at the edge of each cell is the
same. Therefore, to overcome such limitations of PLUG algorithm, we propose an improved PLUG
(IPLUG) algorithm which provides the decision parameters for user grouping and selects the number
of central and edge users in each cell in a dynamic manner. Thus, the algorithm prevents the wrong
division of users in good channel conditions being considered as an edge user which causes large
pilot overhead, and also identifies the users with worst channel conditions and prevents the wrong
division of such users from the center user group. The second algorithm for pilot decontamination
utilizes the idea of pseudo-random codes in which orthogonal pilot are assigned to different cells.
Such codes are deployed to get a transmission pilot by scrambling the user pilot in the cell. Since the
pilot contamination is generated because different cells multiplex the same set of orthogonal pilots
and the pseudo-random sequences have good cross-correlation characteristics, this paper uses this
feature to improve the orthogonality of pilots between different cells. Simulation results show that
the proposed algorithms can effectively improve channel estimation performance and achievable rate
as compared with other schemes.

Keywords: Massive MIMO; pilot decontamination; MSE; dynamic user scheduling; dynamic
pilot allocation

1. Introduction
With the advent of the era of big data and increasing demand by the explosion of growing
numbers of subscribers, the demand for communication networks has exploded, and the existing
mobile communication networks (4G) are increasingly unable to meet the needs of users for the
network [1]. Massive MIMO is a key technology for 5G wireless communications to increase the
spectral efﬁciency [2–4]. It has the ability to be deployed in various communications paradigms such as
multicarrier communication, Orthogonal Frequency Division Multiplexing (OFDM) and cooperative

Electronics 2019, 8, 55; doi:10.3390/electronics8010055 51 www.mdpi.com/journal/electronics

Electronics 2019, 8, 55

communications [5–12]. As massive MIMO utilizes different frequencies in a Frequency Division

Duplex (FDD) system. Therefore, the current research on massive MIMO is generally depending on
a Time Division Duplex (TDD) system [13], that is, using channel reciprocity to obtain the required
channel state information (CSI) [14], but the limited coherence interval limits the number of orthogonal
pilots allocated to the user, which inevitably exists. The use of the same pilot by different cell users
results in the inability of the Base Station (BS) to distinguish between pilot contamination [15].
In view of the fact that pilot contamination is caused by different cells multiplexing the same pilot,
this paper proposes two algorithms for pilot decontamination in massive MIMO systems. The first
algorithm is depending on the path loss for performing user grouping (PLUG) while the second
algorithm is depending on pseudo-random codes [16]. In PLUG algorithm, the users are divided
into a central user group and an edger user group depending on the distance between the users and
the BS, and then classified according to the corresponding principle. In the case of a slight loss in
the performance of the central user, the communication outage probability of the cell edge user is
significantly reduced, and the quality of service (QoS) is significantly improved. On the basis of the
PLUG strategy, the improved PLUG (IPLUG) is further proposed, and the decision parameters are
dynamically selected to select the number of edge users to realize the dynamic division of the central
users and edge users and improve the flexibility of the strategy. The second algorithm is depending on
pseudo-random code that assigns different delays to each cell as a code sequence that distinguishes
different cells and uses these pseudo-random codes to correspond to the cells. The user pilot performs
synchronous scrambling to obtain new user pilots to enhance the orthogonality of user pilots between
different cells. At the same time, the mean square error (MSE) of the expected channel estimation
under the pilot design scheme is derived and analyzed. The strategy of this paper does not require the
large-scale cooperation of BSs. It only needs to know the distance and decision parameters between
users and BSs in the current cell and compete for a dynamic grouping by using the division principle.
The proposed model has low complexity and requires fewer parameters, which makes it suitable for
deploying in massive MIMO systems practical scenarios. It is proved that the proposed pilot design
scheme can effectively improve the performance of channel estimation. The numerical simulation
results also verify that the proposed pilot design algorithms depending on PLUG and pseudo-random
code can greatly improve the performance of channel estimation and effectively reduce the pilot
contamination of the system.

2. State-of-the-Art Pilot Decontamination Algorithms

The authors in [8] pointed out that in multi-cell massive MIMO systems, when the number of
BS antennas tends to infinity, the performance of the system is mainly limited by pilot contamination,
and thereafter the research on pilot contamination in massive MIMO systems has never stopped.
The authors in [17] analyzed the influence of pilots on the performance of massive multi-cell
MIMO systems and proposed a minimum mean square error (MMSE) multi-cell precoding method.
As compared with the traditional zero-forcing (ZF) precoding, the MMSE scheme can significantly
improve the gain of the system without pilot contamination, but the pilot contamination is relatively
serious. The increase in system gain is limited. In [18], a pilot contamination precoding (PCP) scheme
is proposed in which a precoding matrix is designed with ZF to obtain an infinite signal-to-interference
plus noise ratio (SINR), but the effect is not ideal when the number of antennas is limited. In [19],
the optimal algorithm for finding the optimal pilot contamination precoding matrix and the simple
suboptimal algorithm is proposed depending on [18], and the two algorithms are proved to be
limited when the number of BS antennas is limited. Compared with traditional MIMO, the gain
of the system can be greatly improved. In [20], an intelligent pilot allocation scheme is proposed,
which can maximize the uplink SINR of all users in the target cell under the channel of large-scale
fading characteristics. The authors in [21,22] proposed a pilot power control method, which was
successfully used in the classified cell, which effectively reduced the pilot contamination and improved

52
Electronics 2019, 8, 55

the downlink reachability and rate of the entire system. Pilot contamination is a problem inherent in a
massive MIMO system. The method of solving pilot contamination in different situations is not unique.
Ideally, massive MIMO should use Fully Orthogonal Pilot Scheduling (FOPS) to assign orthogonal
pilots to each user, but the length of the pilot sequence and the pilot set size are limited by the
channel coherence time. In a typical scenario, the maximum number of orthogonal pilot sequences
in a 1ms coherence time is about 200 [23]. Therefore, a massive MIMO system generally adopts
Fully Reused Pilot Scheduling (FRPS). Since the pilot between users is non-orthogonal or identical,
pilot contamination is unavoidable [16]. When the number of BS antennas tends to be infinite and
there is no cooperation, the main factor affecting the system performance is inter-cell interference (ICI)
caused by pilot contamination [23], so pilot contamination is critical to the performance improvement
of massive MIMO systems.
At present, massive MIMO pilot decontamination is mainly carried out from three aspects:
channel estimation method [24–28], matrix precoding method case [29,30], and pilot allocation
strategy [21]. The authors in [25] utilize a diagonal Jacket matrix for pilot reduction which has
low complexity, excellent eigenvector and a constant diagonal treatment and an energy harvest.
The drawback of such a method is that it assumes perfect multipath fast channel estimation. In [26],
the authors propose a pilot mitigation algorithm using a low-rate coordination between cells during
the channel estimation phase itself. The coordination makes use of the additional second-order
statistical information about the user channels, which are shown to offer a power way of discriminating
across interfering user with even strongly correlated pilot sequences. The authors in [27] propose
a highly efficient Discrete Fourier Transform (DFT) based approximation of the Linear Minimum
Mean Square Error (LMMSE) estimator for reducing the pilot contamination problem. The authors
in [28] propose an eigenvalue-decomposition-based approach to channel estimation, that estimates
the channel blindly from the received data. The approach exploits the asymptotic orthogonality
of the channel vectors in massive MIMO systems. It also deploys a short training sequence to
resolve the multiplicative ambiguity of the received signals covariance. In [29], the authors propose
a zero forcing (ZF) time-shifted pilot scheme, which was known to mitigate the pilot contamination
effectively using conjugate beamforming. The authors in [30] proposed interreference-cancellation (IC)
precoding scheme for pilot contamination mitigation in massive MIMO systems. They investigated the
quality-of-service (QoS) guaranteed user-scheduling which is improved by deploying their proposed
scheme. The authors in [31] pilot contamination reduction scheme which is dependent on complex
exponential basis expansions. The Linear Time-Varying (LTV) channel is estimated and the optimal
pilot symbols are derived following the minimum mean square error (MMSE) criterion and it is shown
that the optimal pilot strategy is to group consecutive pilot tones together as a pilot cluster and to
distribute uniformly all pilot clusters in frequency-domain. Depending on the pilot allocation strategy,
the research found that: In [30], under the principle of maximizing Signal Leakage Noise Ratio (SLNR)
precoding, the same guidance is adopted for users with small ICI. Users with high frequency and
mutual interference user orthogonal pilots to improve the overall performance of the system in the
case of pilot contamination. However, the pilot scheduling scheme requires large-scale cooperation
between BSs and needs to know the large-scale fading factor of each user. Such a factor is very
difficult in a massive MIMO system with its own complex structure. The authors in [32] proposed
a pilot allocation strategy depending on power control, which makes the pilot transmit time slots
between cells with relatively large crossover gains staggered, but it is not easy to ensure pilot dynamic
synchronization of several cells. The choice of mechanism will directly affect the performance of
the strategy. The authors in [33] proposed a pilot coordinated allocation scheme, which allocates
pilot sequences by identifying pilot usage conditions and selects user multiplexed pilots that are least
affected by pilot contaminations, thereby reducing pilot contamination, but inter-cell cooperation to
system brings additional burdens and expenses. The authors in [34] proposed an improved strategy
depending on soft pilot multiplexing. On the basis of soft pilot multiplexing technology, packet
parameters are introduced for secondary grouping, but the path loss factor, shadow fading, and the size

53
Electronics 2019, 8, 55

of each user need to be known. The parameters such as the scale fading factor are more complicated,
and the computational complexity of implementing the secondary grouping is higher.

3. System Model

3.1. The System Model

Figure 1 shows the model for massive MIMO system. The channel propagation matrix of all users
in the cell to the cell site is:
Hij = Gij Dij (1)

where: Gij = gij1 gij2... gijK , gijk ∈ C M×1 is a small-scale fading vector, each vector is independent
of each other and obeys a zero mean complex Gaussian distribution with a variance of I M , that is

gijk ∼ CN (0, I M ); Dij = diag β ij1 β ij2... β ijK is a K × K order diagonal matrix, used to describe the
α
jth cell large-scale fading of each user to the ith cell BS, β ijk = zijk / rijk /R represents the large-scale
fading factor of user k to ith BS in the jth cell. Wherein, the shadow fading zijk obeys a lognormal

distribution, that is, 10lg zijk obeys a zero mean, the standard deviation is a Gaussian distribution of
σshadow , R is defined as the cell radius, and rijk represents the distance from the user k to the ith BS in
the jth cell, α represents the path loss factor, and zijk , rijk , α are independent of each other. At the same
time, it is assumed that the antenna arrays in the same BS are sufficiently compact in arrangement,
the large-scale fading of specific users is equal in all propagation paths, but the large-scale fading of
different users is independent of each other, and the channel is reciprocity, that is, uplink propagation
is assumed to be the conjugate transpose of the matrix in the downlink propagation matrix [22]. For a
multi-cell multi-user massive MIMO system, the number of antenna arrays installed by the BS is large,
and the channel satisfies progressive orthogonality, namely:

HijH Hij GijH Gij

= Dij Dij ≈ Dij (2)
M M
MK

Figure 1. Massive MIMO multi-cell multi-user TDD system model.

3.2. Causes of Pilot Contamination

In a massive MIMO multi-cell multi-user TDD system, the BS is estimating the uplink pilot
signal by the user in each coherent time and complete signal detection and downlink precoding.

54
Electronics 2019, 8, 55

At the beginning of each coherence time, all users in all cells simultaneously transmit pilot sequences.
At the beginning of each coherence time, all users in all cells simultaneously transmit pilot sequences.
Suppose ψi = (ψ1i , ψ2i , . . . , ψKi ) T is the K × τ dimension pilot sequence matrix of all users in ith
cell (τ is the sequence length), which satisﬁes ψi ψiH = IK , where IK is the unit matrix of order K × K.
Under the FRPS policy, the pilot matrix received by the uplink cell BS is:

L
√
∑ Hij ψi
p p
yi = pp + ni (3)
j =1

p
where, p p is the pilot signal transmission power and ni is the additive white Gaussian noise matrix
of order M × τ. After receiving the pilot signal, the BS starts uplink channel estimation. The channel
estimation value Ĥii of the target cell is obtained by using the LS channel estimation introduced earlier
which is expressed as:
1 p 1 p
Ĥii = √ yi ψiH = Hii + ∑ Hij + √ ni ψiH (4)
pp j =i
pp

It can be seen from Equation (4) that the ith cell channel estimation value Ĥii includes the
superposition of the channel propagation matrix of other cells to the target cell in addition to the
inﬂuence of the target channel and noise, which is pilot contamination.
After the user sends the pilot, all users send data signals to the BS and use the same time-frequency
resource. The signal received by the ith cell BS is deﬁned as:

√ L K
yiu = pu ∑ ∑ hijk xujk + niu (5)
j =1 k =1

where x ujk is the transmitted data symbol of user k in jth cell; pu is the uplink user data symbol average
transmit power; hijk represents the channel transmission vector of user k to ith cellBS in jth cell, which
is the kth column of Hij ; niu is the additive white Gaussian noise vector of order M × 1. The BS uses the
channel estimation value Ĥii of the Equation (4) and the received signal vector yiu , and the MF detects
the original data symbol x̂ ujk transmitted by the user k in the ith cell is expressed as:

H
L
√ L K
x̂ ujk = ĥiik yiu = ∑ hijk
H
+ vik
H
pu ∑ ∑ hijk xujk + niu (6)
j =1 j =1 k =1

p √
where vik is the column vector of the matrix ni ψiH / p p . When the number of BS antennas M
approaches positive inﬁnity, it is easy to know from Equation (2) that the channel of the massive MIMO
system exhibits progressive orthogonality, and the progressive expression of Equation (6) is:

√
u
x̂ik ≈ M pu β iik x ujk + ∑ β ijk x ujk (7)
j =i

It can be known from Equation (7) that when the BS antenna M → ∞ , the data symbol
u will not be affected by the small-scale fading factor and noise. Therefore, when M → ∞ ,
x̂ik
the signal-to-interference and noise ratio (SINR) of the user k uplink received signal in the ith cell can
be deﬁned as:
β2iik
u
SI NRik = (8)
∑ j =i β2ijk

It can be seen from Equation (8) that due to the existence of pilot contamination, the uplink SINR
is limited by the large-scale fading factor of the same pilot user in the interfering cell. According to
Equation (8), the uplink achievable rate of user k in the target ith cell is given by:

55
Electronics 2019, 8, 55

u
Cik = (1 − μ0 ) E{lb(1 + SI NRik
u
)} (9)

where μ0 is the pilot overhead coefﬁcient for full multiplexing, indicating the spectral efﬁciency loss
caused by the pilot sequence used for channel estimation, when other pilot allocation algorithms are
used, the adjustment in Equation (9) is to be appropriately made.

3.3. IPLUG Algorithm

The PLUG algorithm significantly improves the QoS of the edge user but lacks certain flexibility.
This is because the number of edge users selected by each cell in the PLUG policy is the same, and only
compared with the distance between other users in the cell and the BS. The user’s own specific
environment is not considered. Therefore, we proposed an improved pilot scheduling algorithm
depending on path loss to perform user grouping (IPLUG) and dynamically select the number of
edge users per cell by introducing decision parameters. Figure 2 is a schematic diagram of the IPLUG
algorithm decision parameters. The improved IPLUG algorithm is depending on the PLUG algorithm
and is designed to improve the accuracy and legitimacy of performing user groupings. In the IPLUG
algorithm, when the decision parameter λ is selected, the BS can complete the dynamic grouping
according to the user distance d and λ × R. The specific division principle is:

? Yes → Edge User
d λ×R → (10)
≥ No → Center User

Let the central user set be Uc , the edge user set be Ue , the central user pilot set be ψc , and the edge
T
user pilot set be ψe , the pilot set of jth cell is ψj = ψ1j , ψ2j , . . . , ψKj , j = 1, 2, . . . L, then:

1 → ψki ⊆ ψc and ψkj ⊆ ψc
H
ψki ψkj = (11)
0 → otherwise

Then, the pilot vector received by the target ith cell BS is:

L K L
√
∑ ∑ ∑
p p
yi = pp hijk ψkj + hijk ψkj + ni (12)
j =1 k =1, k ∈Uc k =1, k ∈Ue

For the target ith cell user k, the Least Square (LS) channel estimation is used to obtain the target
channel estimation value:
1 p
ĥiik = √ yi ψkiH (13)
pp
Combining Equations (12) and (13), when the number of antennas M → ∞ the SINR of the central
user under the IPLUG algorithm is:
2
H
hiik hiik
uc
SI NRik = 2 (14)
H
∑ j =i, ψki ⊆ψc , ψkj ⊆ψc h jik h jik

The edge user is not affected by the pilot contamination, so when M → ∞ , the edger user
ue → ∞ .
SI NRik
Figure 3 illustrates the ﬂowchart for the IPLUG algorithm.

56
Electronics 2019, 8, 55

Figure 2. IPLUG policy decision parameters for user grouping.

Start

Initialize parameters

Determine the path loss of

users

Determine the pilots associated

with users

d ≥ λR

No
Yes

Add the user to Edger user Add the user to Center user
group group

End

Figure 3. Proposed IPLUG algorithm ﬂowchart.

57
Electronics 2019, 8, 55

3.4. Pseudo-Random Code-Based Pilot Scheduling

3.4.1. Uplink
Assume that all users in any cell send a pilot sequence of length τ, if the pilot transmitted by K

users in the jth cell is ψj = φj1 φj2 . . . φjK , where φjk ∈ Cτ ×1 is the pilot transmitted by the kth user in
H φ = 1), when the average transmit power of the user is ρ , the pilot signal received by the
the cell (φjk jk r
lth cell BS is:
√ L
Yl = ρr τ ∑ Gjl ψjT + nl (15)
j =1

where Gjl = Hjl D jl is the channel between all users of the jth cell and the lth cell BS. Let β jlk
denote the large-scale fading coefﬁcient of the user k of the jth cell to the lth cell BS, then D jl can be

expressed as a diagonal matrix, and the diagonal element is β jl = β jl1 , β jl2 , . . . , β jlK , nl denotes
the additive white Gaussian noise of the lth cell. The term Hjl can be determined from the following
matrix expression:
⎡ ⎤
h j1 l1 · · · h j1 l M
⎢ ⎥
Hjl = ⎣ ... ···
..
. ⎦ (16)
h jK l1 · · · h jK l M
The elements of Hjl are independently and identically distributed (i.i.d), and h jk lm represents a small-
scale fading coefficient between the kth user in the jth cell and the mth antenna of the BS in the lth cell.
The BS estimates the channel using the received pilot signal, which is known from the standard
results in the estimation theory [24]. When the number of BS antennas tends to inﬁnity, the MMSE
estimate of the channel is:
−1
√ L
Ĝjl = ρr τYl I + ρr τ ∑ ψi∗ Dil ψiT ψ∗j D jl (17)
i =1

3.4.2. Downlink
Considering the BS of the lth cell, it is assumed that the information symbol transmitted by the lth

cell to the user is Sl = [Sl1 Sl2 . . . SlK ] T , and Al = f Ĝll is a linear precoding matrix of order M × K,
where f (·) represents a speciﬁc linear precoding technique at the BS. After precoding, the signal
matrix sent by the BS can be represented as Al Sl , and the BS satisﬁes the average power constraint,
+ ,
that is, tr E Al Sl SlH AlH ≤ P, then the data information received by the user of the jth cell can be
expressed as:
L
-
X j = ρ f ∑ GjlT Al Sl + n j (18)
l =1

where ρ f is the downlink transmit power and n j is the AWGN noise of the corresponding cell.
Assuming a simple MF precoding technique at the BS, i.e., Al = Ĝll∗ , Equation (25) can be expressed as:
⎛ ⎞∗
- L
- L L
Xj = ρf ∑ GjlT Ĝll∗ Sl + nj = ρ f ρr τ ∑ GjlT ⎝ρr τ ∑ Gj l ψjT + n l ⎠ Z ∗ Sl + n j (19)
l =1 l =1 j =1

−1
where Z = I + ρr τ ∑iL=1 ψi∗ Dil ψiT ψ∗j D jl . When the number of BS antennas increases and tends to
inﬁnity then:

−1
1 √ L L
√ X j → ρr τ ∑ D jl ψlH ψj + ρr τ ∑ ψlH ψi Dil∗ ψiH ψj Dll∗ Sl (20)
M ρ f ρr τ l =1 i =1

58
Electronics 2019, 8, 55

It can be obtained from Equation (20) that when all cells multiplex the same set of orthogonal
pilots, the right side of Equation (20) can be equated to:

L L −1

∑ Djl I + ρr τ ∑ Dil∗ Dll∗ Sl (21)

l =1 i =1

At this time, the signal received by the jth cell will be interfered by the signals transmitted by
other cells, especially when the large-scale fading coefﬁcient is large; this phenomenon is called pilot
contamination. When all cells use orthogonal pilots, the right side of Equation (20) can be equated to:
−1
D jj I + ρr τD ∗jj D ∗jj Sl (22)

In this case, the signal received by the user is a scaling of the signal transmitted by the BS of the cell,
and there is no pilot contamination. However, in actual communication, because the coherence interval
is limited, it cannot guarantee that all cells can adopt an orthogonal pilot. Therefore, pilot contamination
has become an important factor limiting the performance of massive MIMO systems.

3.4.3. Pseudo-Random Code Scheme Description

The pseudo-random code has the similarity of noise sequence. It is a seemingly random but
actually regular periodic binary sequence, including m sequence, a Gold sequence, M sequence and
combined sequence [35]. Pseudo-random codes have good autocorrelation and cross-correlation
properties and can be used as address codes in CDMA communication technology. In addition, thanks
to its pseudo-random characteristics, pseudo-random codes are also widely used in information
encryption [36]. The pseudo-random sequence is generated by the linear feedback shift register
shown in Figure 4, where ci = 0, indicates that the line is off; ci = 1 indicates that the line is on.
The characteristic polynomial of generating a pseudo-random sequence is:
n
f (x) = ∑ c i x i = c0 + c1 x + . . . + cn x n (23)
i =0

where xi indicates the ith shift register; c0 = c1 = 1, for any i = 0 and i = n, ci ∈ {0, 1}.

Figure 4. Pseudo-random code linear feedback shifter register.

Since the pilot contamination is generated because different cells multiplex the same set of
orthogonal pilots and the pseudo-random sequences have good cross-correlation characteristics,
this paper uses this feature to improve the orthogonality of pilots between different cells. Depending
on the above considerations, this paper proposes a pilot design scheme depending on pseudo-random
code: in the case that all cells multiplex the same set of orthogonal pilots, the pseudo-random code is
used as the code sequence for distinguishing different cells, for each cell allocates a pseudo-random
code with different delays, and synchronously scrambles the pseudo-random code in each cell to the
user pilot of the corresponding cell to obtain a new pilot.

59
Electronics 2019, 8, 55

The pilot design scheme depending on the pseudo-random code is shown in Figure 5. Since the
ﬂow of the pilot design of different cells is the same, in order to facilitate the representation without
loss of generality, the pilot design scheme is speciﬁcally described here. Assume that the pilot of the
kth user in a cell is φk = [φk1 , φk2 , . . . , φkτ ] T , and the pseudo-random sequence generated by the linear
feedback shifter register in the corresponding cell is m = [m1 , m2 , . . . , m N ]. The scrambler in Figure 5
indicates that the pseudo-random matrix is multiplied by the matrix of the corresponding user’s pilot
matrix. In general, a pseudo-random sequence is a 0–1 bit stream whose length is much larger than the
length of the pilot sequence.

Figure 5. Pseudo-random code pilot design scheme.

In order to ensure that the generated pseudo-random sequence can be used for scrambling,
it needs to process as follows:

1. The pseudo-random sequence generated by the linear feedback shift register is truncated, and the
method is to generate a rectangle depending on the pilot length τ and using a rectangular window
for pseudo-random sequences to make truncation.
2. Performing BPSK modulation on the truncated pseudo-random sequence.
3. The pseudo-random sequence is diagonalized after modulation, that is, τ pseudo-random
numbers are generated as pseudo-random matrices on the diagonals of the diagonal matrix.

The pseudo-random matrix used for scrambling after the above processing can be expressed as:
⎡ ⎤
m1 0 ··· 0
⎢ ⎥
⎢ 0 m2 ··· 0 ⎥
P=⎢
⎢ .. .. .. .. ⎥
⎥ (24)
⎣ . . . . ⎦
0 0 ··· mτ

After the pseudo-random matrix is scrambled by the scrambler to the user pilot, the pilot of the
kth user outputted by the scrambler is obtained as:

60
Electronics 2019, 8, 55

Ψk = Pφk = [m1 m2 φk2 . . . mτ φkτ ] T (25)

The scrambling method used in this paper is to use the same pseudo-random sequence to
synchronously scramble the pilots of all users in the cell, and the pilot of K users in the cell can be
obtained after the pilot design as:
⎡ ⎤
m1 φ11 ··· m1 φK1
⎢ .. .. ⎥
Ψ=⎣ . ··· . ⎦ (26)
mτ φ1τ ··· mτ φKτ

The designed pilot is transmitted to the BS through the channel, and the BS can obtain the required
Channel State Information (CSI) by using channel estimation.
When the pilot design is performed on users of other cells, considering the pilot contamination
is related to the distance between users, when the distance between two users is large enough,
the inﬂuence of pilot contamination is neglected. Therefore, the proposed pseudo-random code pilot
scheduling scheme only considers the pilot contamination of the target cell and ignores the pilot
contamination of the farther cell. Under the above assumptions, only the pseudo-random sequence
needs to be allocated to the target cell and the neighboring cell, and the pseudo-random sequence
can be reused for the farther cell, which greatly reduces the usage and implementation complexity
of the pseudo-random sequence and makes it easy to design a better pilot sequence. Depending on
the above considerations, the pilot design of all cells can be designed by using the proposed pilot
design scheme. The only difference is that the pseudo-random sequence used by the target cell and the
neighboring cell cannot be the same, which involves a pseudo-random sequence selection problem. Since
the cross-correlation between different pseudo-random sequences is different, if the cross-correlation
value between the selected pseudo-random sequences is large, even if the user pilot is designed, the pilot
contamination between the cells is still serious, so that the pilot design seems to be meaningless, so it is
necessary to follow the certain criteria when selecting the pseudo-random sequence as the code sequence
to distinguish difference cells. Theoretically, the cross-correlation values between pseudo-random
sequences are connected. Nearly 0, the effect of the pilot design is more obvious, but the complexity
of acquiring these pseudo-random sequences will become very high. In order to obtain the desired
effect within a certain complexity range, this paper adopts the following criteria to obtain the required
pseudo-random sequence. Assuming that the number of cells requiring different pseudo-random
sequences is M, the corresponding M pseudo-random sequences need to be satisfied:

ρ mi , m j < γi , j = 1, 2, . . . , M (27)
i=j

where ρ mi , m j = n1 ∑nk=1 mik m jk , is a function that represents a normalized cross-correlation value
between two pseudo-random sequences. γ ∈ [0, 1] is a constant values indicating the upper limit of
the normalized cross-correlation value between any two selected pseudo-random sequences.
It can be seen from Equation (27) that in the case where the pseudo-random sequence used is
determined, the setting of the value is an important factor affecting the impact of the pilot design
scheme. The smaller the value, the smaller the cross-correlation value between pseudo-random
sequences. After the pilot design, the orthogonality of user pilots between different cells is stronger,
and the effect of reducing pilot contamination is more obvious. Otherwise, it cannot effectively reduce
the pilot contamination. The BS and the user in each cell share a pseudo-random sequence, and the BS
can distinguish pilots from different cells by using detection techniques. At the same time, since each
cell has the same pseudo-random sequence to scramble a set of orthogonal pilots of all users in the
cell, after pilot design, the user pilots in each cell remain orthogonal, and in addition, no additional
intra-cell interference will be introduced. Moreover, increasing the pilot length is beneﬁcial to improve
the efﬁciency of the pilot design. This is because the length of the pseudo-random sequence is the same

61
Electronics 2019, 8, 55

as the length of the pilot, and the longer the pseudo-random sequence, the better the orthogonality
between them, so increasing the pilot length is equivalent to improving the orthogonality between the
pseudo-random sequences. Thereby, the orthogonality of user pilots between different cells after the
pilot design is improved.

3.4.4. Mean Square Error (MSE) Performance Analysis of Expected Channel Estimation
This section will use the new pilot to derive and analyze the MSE of the channel estimate and
explore the pilot design method described in Section 3.3. It can be seen from [15] that when the
pilot contamination is relatively serious, the channel estimation performance of the system drops
sharply. In this case, even if a complex multi-cell MMSE precoding scheme is applied, the performance
improvement of the massive MIMO system is limited. In order to facilitate derivation and analysis,
this section only studies simple single-cell ZF precoding and its precoding matrix is deﬁned as:
−1
Ĝll Ĝ H Ĝll
Al = . ll (28)
−1
tr ĜllH Ĝll

It can be found from (28) that a single-cell ZF precoding matrix can be designed as long as the
information of the desired channel is obtained. Therefore, the subsequent derivation and analysis in
this section are depending on the estimated expected channel.
As can be seen from Section 3.4, the user pilot of the jth cell is Ψj = Pj Ψ, then Equation (15) can be
expressed as:
√ L
√ L
Yl = ρr τ ∑ Gjl ΨjT + nl = ρr τ ∑ Gjl ψ T PjT + nl (29)
j =1 j =1

In the case where the number of BS antennas is limited, the MMSE estimation of the desired
channel is:
−1
√ L
Ĝll = ρr τ Yl Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT Pi∗ ψ∗ Cil (30)
i =1
+ ,
where Cn = E , is the autocorrelation matrix, representing the received noise nl ; Cjl =
nlH nl
! "
E GjlH Gjl is the autocorrelation matrix representing the channel transfer matrix Gjl .
The MMSE deﬁning the expected channel estimation of the lth cell BS is:
! 2 "
Mmse E Ĝll − Gll F (31)

√ −1
Let R = ρr τ Cn + ρr τ ∑iL=1 Pi∗ ψ∗ Cil ψ T PiT Pi∗ ψ∗ Cil , then Ĝll = Yl R, substituting into (38) we
get: ! ! ""
Mmse = E tr (Yl R − Gll ) H (Yl R − Gll )
+ + , + , + , + ,, (32)
= tr R H E YlH Yl R − R H E YlH Gll − E GllH Yl R + E GllH Gll
Using the model shown in Equation (29), there is:

! " L
E YlH Yl = Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT (33)
i =1
! " √
E YlH Gll = ρr τPi∗ ψ∗ Cll (34)
! " √
E GllH Yl = ρr τCll ψ T PlT (35)

62
Electronics 2019, 8, 55

Substituting Equations (32)–(34) into Equation (35), we can rewrite Mmse as:

L √ √
Mmse = tr R H Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT R − ρr τR H Pi∗ ψ∗ Cll − ρr τCll ψ T PlT R + Cll
+ i =1 ,
√
= tr Cll − ρr τCll ψ T PlT R #
−1
L
= tr Cll − ρr τ Cll ψ T PlT Cn + ρr τ ∑ Pi∗ ψ∗ Cil ψ T PiT Pl∗ ψ∗ Cll
i =1
(36)
Since the matrix Pi is a diagonal matrix composed of diagonal elements 1 and −1, and ψ H ψ = I,
so ( Pi ψ) H ( Pi ψ) = I can be obtained according to the properties of the matrix conjugate transpose.
Using the matrix inversion principle, we can express Equation (36) as:
⎧ ⎫
⎨ L −1 ⎬
M mse
= tr Cll − ρr τ Cll ΨlT Cn Ψl∗ + ρr τ ∑ ΨlT Ψi∗ Cil ΨiT Ψi∗ Cll (37)
⎩ i =1
⎭

where Ψi = Pi ψ, i = 1, 2, . . . , L. When M → ∞ then:

! " - ! "-
Cll = E GllH Gll = Dll E HllH Hll Dll = MDll (38)
! "
Cn = E nlH nl = MIτ (39)

Substituting Equations (38) and (39) into Equation (37), we get:

⎧ ⎫
⎨ L −1 ⎬
1 mse
M = tr Dll − ρr τ Dll I + ρr τ ∑ ΨlT Ψi∗ Dil ΨiT Ψi∗ Dll (40)
M ⎩ i =1
⎭

When the number of users in each cell is K = 1, ΨlT Ψi∗ = ( Pl ψ) T ( Pi ψ)∗ = γli , where γli
is a normalized cross-correlation value between the pseudo-random code in the lth cell and the
pseudo-random code in the ith cell, which can be obtained from equation (27). Assuming that the
large-scale fading coefﬁcient Dil = [ β il1 ] at this time, then Equation (40) can be simpliﬁed as:

1 mse ρr τβ2ll1
M = β ll1 − (41)
M 1 + ρr τβ ll1 + ρr τ ∑i =l γli2 β li1

It can be seen from the results of Equation (41) that when the large-scale fading coefﬁcient and the
uplink transmit power are constant and the pilot length is constant, the MSE performance of the channel
estimation that can be obtained by applying the pilot scheme depending on the pseudo-random code
and it is mainly limited by the cross-correlation performance between the pseudo-random sequences
used: when the normalized cross-correlation value between the pseudo-random sequences used is
large (|γli | is close to 1), the MSE performance of the channel estimation is relatively poor. When
the normalized cross-correlation value between the pseudo-random sequences used is small (|γli |
is close to 0), the MSE performance of the channel estimation is better. It can be concluded that if a
suitable pseudo-random sequence is selected, the above method is adopted. The proposed pilot design
scheme can effectively improve the performance of channel estimation, thus achieving the purpose
of reducing pilot contamination. Figure 6 shows the ﬂowchart of the proposed pseudo-random pilot
code algorithm.

63
Electronics 2019, 8, 55

Start

Initialize parameters

Determine the pilot sequence

for the kth user

Determine the pseudo-random

sequence in the corresponding No
cell

Scramble the pilot sequence of

the kth user with the generated
pseudo-random sequence

PNlength >> Pilot

Yes

Perform channel estimation

End

Figure 6. Proposed pseudo-random algorithm ﬂowchart.

4. Simulation Results

4.1. Simulation Scenario and Parameters

Taking the MATLAB software (R2017b, The MathWorks, Natick, MA, USA) as the simulation
platform, it is assumed that the massive MIMO multi-cell multi-user TDD system includes 7 regular
hexagonal cells, as shown in Figure 7.
Table 1 shows the details of simulation parameters which are used for simulation analysis of the
proposed pilot decontamination schemes. It is assumed that the antenna spacing is 12 times the carrier
wavelength, that is, there is a correlation between the antennas, but the correlation is not considered in
this paper, and it is assumed that all the antennas are omnidirectional antennas. The number of users
in each cell is K = 10, the user is uniformly distributed in the cell, the pilot length is 128, the path loss
factor is 3, the pilot overhead coefﬁcient is μ0 = 0.05, the cell radius R = 500 m, and the concentrated
antennas is 16 ≤ M ≤ 1024, and the lognormal shadow fading is 4 dB (only the last normal logarithmic
normal shadow fading is set to 2 dB), and the number of simulations is 2000.

64
Electronics 2019, 8, 55

Figure 7. Simulation scenario of massive MIMO system of seven cells.

Table 1. Simulation Parameters.

S. No Parameter Name Value

1 Number of BS antennas (M) 16~1024
2 Number of users per cell (K) 10
1
3 Antenna Spacing 2 λc
4 Pilot Length (τ) 128
5 Path Loss Factor (α) 3
6 Pilot overhead coefﬁcient (μ0 ) 0.05
7 Cell Radius (R) 1000 m
8 Lognormal Shadow Fading 4 dB
9 Number of simulations 5000
10 Average transmit power at BS (ρ f ) 20 dB
11 Average transmit power at User (ρr ) 10 dB
12 The upper limit of normalized cross-correlation (γ) 0.3
13 Cross-gain (β jlk ) 0.7

According to the above simulation parameters, the Monte Carlo method is used to simulate
the uplink SINR, and the normalized channel estimation MSE and the target cell achievable rate,
thereby comparing the performance between the PLUG algorithm, the IPLUG, and the FRPS
algorithm respectively.

4.2. Analysis of PLUG Algorithm

Figures 8 and 9 respectively show the performance of the channel estimation normalized MSE
and uplink SINR of the central user and the edge user. The performance of the uplink SINR varies
with the number of antennas.

65
Electronics 2019, 8, 55

Figure 8. NMSE comparison of FRPS and PLUG Algorithms.

Figure 9. NMSE comparison of FRPS and PLUG Algorithms.

It can be seen from Figures 8 and 9 that the channel estimation normalized MSE of the edge user
and the central user under both algorithms has a decreasing trend with the increase in the number
of antennas. However, the MSE and SINR curves of the central users in both algorithms overlap,
indicating that the center user multiplexes the pilots and causes the same pilot contamination. The edge
user allocates orthogonal pilots under the PLUG algorithm, so there is no pilot contamination. When
the number of antennas is 256, the NMSE performance of the edge users is increased by 10.88 dB,
and the SINR performance is improved by 3.23 dB. Figure 10 is a graph showing the average achievable
sum-rate performance of the center user and the edge user as the number of antennas increases when

66
Electronics 2019, 8, 55

the number of edge users is one. Figure 12 shows the performance variation of the uplink achievable
rate of the target cell of the central user and the edge user when the number of edge users is 1. It can
be seen from Figure 10 that the average achievable rate of the central user in the PLUG algorithm is
slightly lost. This is because the orthogonal pilot set is added under such algorithm, and the pilot
overhead is increased, resulting in a decrease in spectral efﬁciency (SE), but the performance of the
edge user is very good. With a large boost, the average achievable is increased by 0.42 bps/Hs when
the number of antennas is 256. It can be seen from Figure 12 that the uplink achievable rate curves
of the target cell are basically coincident under the two algorithms, which indicates that the PLUG
algorithm does not bring loss to the overall performance of the target cell, but only increases the
fairness of the central user and the edge use. It is also obvious from the results that the proposed PLUG
algorithm reduces the probability of edge user communication interruption. Figure 13 compares the
uplink target cell achievable rate in the case where the number of edge users of the FRPS and PLUG
algorithms is different and for the number of antennas of 256. As can be seen from Figure 13, when
the number of edge users increases, the achievable rate of the PLUG algorithm gradually decreases.
This is because when the number of edge users is greater than one, the cost of pilot overhead exceeds
the performance gains of the PLUG algorithm.

Figure 10. Comparison achievable rate of target cell of FRPS and PLUG algorithms.

4.3. Analysis of IPLUG Algorithm

Figure 11 compares the NMSE of the central user and the edge user against the number of
antennas for the FRPS, PLUG, and IPLUG algorithms, wherein the IPLUG algorithm corresponds
to a decision parameter 0.9, and the PLUG algorithm corresponds to an edge user of 1. As can be
seen from Figure 11, when the number of antennas is 256, the NMSE performance of the edge users
under the IPLUG algorithm is 2.95 dB higher than that of PLUG algorithm, and the central user NMSE
is improved by 4.68 dB. The improvement of channel estimation performance greatly improves the
performance of uplink signal detection and downlink coding. This is due to the ﬂexibility brought by
the IPLUG algorithm.

67
Electronics 2019, 8, 55

Figure 11. NMSE comparison of IPLUG, FRPS, and PLUG algorithms.

Figure 14 compares the uplink target cell achievable rate for FRPS and IPLUG with different
decision parameters when the number of antennas is 256. As can be seen from Figure 14, when the
decision parameter increases to λ ≥ 0.68, the achievable rate of the IPLUG algorithm gradually
increases. Therefore, the achievable rate of the proposed IPLUG algorithm have significantly exceeded
the FRPS algorithm, which is a clear advantage of the IPLUG algorithm, avoiding the waste of pilot
overhead caused by some users with good channel conditions being misclassified as edge users,
or some users with poor channel conditions are being misclassified as the central user which results in
communication interruption.

4.4. Analysis of Pseudo-Random Pilot Scheme

Figure 15 shows the probability density function (PDF) curves of the channel estimation MSE for
the proposed pseudo-random scheme and compares it with no pilot contamination and with pilot
contamination cases. The parameters set used for these results are: the number of users per cell is
K = 4, the number of BS antennas M = 100 and pilot length τ = 8. It can be seen from Figure 15
that the MSE of the channel estimation that can be obtained by all cells multiplexing the same set of
orthogonal pilots is around 1.63, and the MSE distribution of the channel estimation obtained by the
pilot after the proposed pseudo-random pilot design is around 0.82. It can be seen that the proposed
pseudo-random code scheme is opposite to the channel when all the cells are multiplexed with the same
set of orthogonal pilots. The estimated MSE performance has increased nearly two times. Although all
cells use orthogonal pilots, there is no pilot contamination, but there are certain uncorrelated noise and
fast fading effects when the number of antennas is limited. The MSE of channel estimation is not 0.
As the number of BS antennas increases, the uncorrelated noise and fast fading effects are gradually
averaged, and the MSE of channel estimation gradually approaches 0.

68
Electronics 2019, 8, 55

Figure 12. Achievable rate comparison of FRPS and PLUG algorithms for user fairness analysis.

Figure 13. Comparison of achievable sum rate against a different number of edge users for FRPS and
PLUG algorithms.

69
Electronics 2019, 8, 55

Figure 14. Comparison of achievable sum rate against a different number of user grouping factor λ for
IPLUG and FRPS algorithms.

Figure 15. PDF comparison of the proposed pseudo-random code scheme with other cases.

Figure 16 depicts the trend of the system’s downlink transmission BER as the number of antennas
increases. The number of users per cell is K = 8, the number of BS antennas M = 100 and pilot length
τ = 32. It can be seen from Figure 16, as the number of BS antennas increases, the performance of the
system is improved. In the case of full pilot contamination, the pilot performance is too serious, and the
improvement of system performance is not obvious with the increase of the number of antennas.
Therefore, it can be foreseen that when the number of antennas reaches a certain value, the system
performance will not be improved.

70
Electronics 2019, 8, 55

Figure 16. Comparison of the BER of the proposed pseudo-random pilot design scheme with other
schemes against the number of BS antennas.

When the proposed pilot design scheme or no contamination is used, the performance of the
system is significantly improved with the increase of the number of antennas. Figure 17 depicts
the trend of the downlink BER of the system as the average transmit power ρ f of the BS increases.
The results are analyzed for different pilot lengths. The number of users per cell K = 8, the number
of antennas configured by the BS M = 100. As can be seen from Figure 17 that when the number
of BS antennas is large, the increase of the BS transmit power is effective for improving the system
performance, but when it exceeds a certain value, the system performance will not be improved,
and this value will be affected by the severity of pilot contamination. It can be seen from Figure 17 that
in the case of complete pilot contamination, the system performance will not be improved after the BS
transmission power reaches 25 dB. In the case of the proposed pilot design scheme (pseudo-random
code) or no contamination, this value is greater than 25 dB. At the same time, it can be found that in the
case of complete pilot contamination, the system performance will not be improved with the increase
of the pilot length τ; in the absence of pilot contamination, the increase of the pilot length can improve
the system performance to a small extent, but there is no need to sacrifice pilot overhead to boost such
tiny performance. After using the proposed pseudo-random pilot design scheme, the performance
of the system will increase significantly with the increase of the pilot length. Furthermore, when the
pilot length τ = 64, the performance achieved by the proposed pilot design scheme is very close
to the case of no pilot contamination. Figure 18 compares the BER of the proposed pseudo-random
pilot design scheme with other cases versus the number of cell users K. The number of antennas
configured by the BS is M = 100, and pilot length τ = 128. It can be seen from Figure 18 that in the
case where the number of BS antennas is large, the increase in the number of cell users may deteriorate
the performance of the system, but the performance of the system is still far better when the proposed
pseudo-random pilot design or no pilot contamination scheme is used. At the same time, when the
number of users is small and the pilot length τ = 128, the same performance of the system can be
almost achieved in the case of no contamination when the proposed pilot design scheme is adopted.
This is because the increase in the length of the pseudo-random code can only make them infinitely
close to the orthogonal but cannot completely remove the pilot contamination. In the case of a small
number of users, the pilot contamination is small enough to affect the performance of the entire system.

71
Electronics 2019, 8, 55

In the case of a large number of users, the impact on the overall system performance is considerable
due to the superposition of pilot contamination. Overall, the proposed pilot design scheme is very
obvious for the improvement of system performance.

Figure 17. Comparison of the BER of the proposed pseudo-random code and another scheme versus
the average BS power ρ f .

Figure 18. BER comparison against the number of cell users for the proposed pseudo-random pilot
design and other cases.

Figure 19 compares the NMSE of the proposed IPLUG algorithm with the conventional
state-of-the-art algorithms [29,30,32] wherein the IPLUG algorithm corresponds to a decision parameter

72
Electronics 2019, 8, 55

0.9. As can be seen from the ﬁgure that the proposed IPLUG algorithm shows better NMSE performance
than the conventional algorithms for increasing number of base station antennas. Figure 20 compares.

Figure 19. NMSE comparison of IPLUG and conventional algorithms.

Figure 20. Comparison of the BER of the proposed pseudo-random pilot design scheme with other
schemes against the number of BS antennas.

73
Electronics 2019, 8, 55

Figure 20 illustrates the BER comparison of the proposed pseudo-random code scheme and
the conventional schemes as the number of antennas increases. The number of users per cell is
K = 8, the number of BS antennas M = 160 and pilot length τ = 32. It can be seen from Figure 20,
as the number of BS antennas increases, the performance of the system is improved. The proposed
pseudo-random code scheme shows better BER performance as compared with the conventional
algorithms with increasing number of antennas.

5. Conclusions
This paper proposes a robust approach for effective pilot decontamination in massive MIMO
systems. Two efficient pilot decontamination schemes are proposed. The first scheme is depending on
Path Loss to perform User Grouping (PLUG) method while the second scheme is depending on the
pseudo-random code. The PLUG scheme divides users into central and edge users. Edger users are
allocated orthogonal pilots, and central users are assigned multiplex pilots, which improves edge user
performance. The Improved PLUG scheme (IPLUG) is further proposed to overcome the deficiency of
the PLUG scheme as it dynamically selects and correctly classifies the edge users and central users so
that there is no wrong misclassification and therefore, the communication quality of service is improved.
The analytical and simulation results show that the proposed IPLUG scheme can avoid the waste of
pilot overhead caused by users with good channel conditions being misclassified as edge users, or the
users with poor channel conditions being misclassified as central users, resulting in communication
interruption, therefore, the proposed IPLUG scheme increases the fairness of communication for each
user. The MSE of the expected channel estimation after the proposed pseudo-random code pilot design
scheme is deduced and analyzed. It is found that this scheme can not only effectively improve the
performance of channel estimation, but a more accurate channel estimation can be obtained by selecting
an appropriate pseudo-random code and pilot length. Thereby improving the performance of the
entire downlink system. The above conclusions are verified by numerical simulation. The numerical
results also show that the proposed pilot design scheme depending on pseudo-random code can greatly
improve the performance of the entire system limited by pilot contamination. The proposed PLUG
and IPLUG schemes are focused only on the cells of the omnidirectional antenna, and other sector
cells are not considered. At the same time, the pilot allocation is performed in the target cell, and the
pilot allocation between the cells is independent of each other and has certain limitations. Therefore,
the future research direction will be PLUG and IPLUG schemes depending on pilot decontamination
in sectoral cells. The problem is to consider the mutual connection of each cell to reasonably allocate
pilots throughout the system.

Author Contributions: Conceptualization, O.A.S., I.K. and B.M.L.; Data curation, O.A.S. and I.K.; Formal analysis,
O.A.S., I.K. and A.T.; Funding acquisition, I.K. and B.M.L.; Investigation, O.A.S., I.K. and A.T.; Methodology, O.A.S.,
I.K., B.M.L. and A.T.; Project administration, O.A.S. and I.K.; Software, O.A.S., I.K. and A.T.; Writing—original
draft, O.A.S., I.K. and B.M.L.; Writing—review & editing, O.A.S., I.K., B.M.L. and A.T.
Funding: This work was supported by the Basic Science Research Program through the National Research
Foundation of Korea (NRF) funded by the Ministry of Education (grant number: NRF-2017R1D1A1B03028350).
Conﬂicts of Interest: The authors declare no conﬂict of interest.

References
1. Khan, I. A Robust Signal Detection Scheme for 5G Massive Multiuser MIMO Systems. IEEE Trans.
Veh. Technol. 2018, 67, 9597–9604. [CrossRef]
2. Khan, I.; Singh, D. Efﬁcient Compressive Sensing Based Sparse Channel Estimation for 5G Massive MIMO
Systems. AEU Int. J. Electron. Commun. 2018, 89, 181–190. [CrossRef]
3. Khan, I.; Zafar, M.H.; Jan, M.T.; Lloret, J.; Basheri, M.; Singh, D. Spectral and Energy Efﬁcient Low-Overhead
Uplink and Downlink Channel Estimations for 5G Massive MIMO Systems. Entropy 2018, 20, 92. [CrossRef]
4. Khan, I.; Singh, M.; Singh, D. Compressive Sensing-Based Sparsity Adaptive Channel Estimation for 5G
Massive MIMO Systems. Appl. Sci. 2018, 8, 754. [CrossRef]

74
Electronics 2019, 8, 55

5. Molisch, A.F.; Win, M.Z.; Winters, J.H. Space-time-frequency (STF) coding for MIMO-OFDM systems.
IEEE Commun. Lett. 2002, 6, 370–372. [CrossRef]
6. Bolcskei, H.; Paulraj, A.J. Space-frequency coded broadband OFDM systems. In Proceedings of the IEEE
Wireless Communications and Networking Conference (WCNC), Chicago, IL, USA, 23–28 September 2000;
pp. 1–6.
7. Darsena, D.; Gelli, G.; Paura, L.; Verde, F. Blind Channel Shortening for Space-Time-Frequency Block Coded
MIMO-OFDM Systems. IEEE Trans. Wirel. Commun. 2012, 11, 1022–1033. [CrossRef]
8. Redferm, A.J. Receiver window design for multicarrier communication systems. IEEE J. Sel. Areas Commun.
2002, 20, 1029–1036. [CrossRef]
9. Darsena, D.; Gelli, G.; Paura, L.; Verde, F. NBI-resistant zero-forcing equalizers for OFDM systems.
IEEE Commun. Lett. 2005, 9, 744–746. [CrossRef]
10. Mergen, B.S.; Scaglione, A. Randomized Space-Time Coding for Distributed Cooperative Communication.
IEEE Trans. Signal Process. 2007, 55, 5003–5017. [CrossRef]
11. Knopp, R.; Humblet, P.A. Information capacity and power control in single-cell multiuser communications.
In Proceedings of the IEEE International Conference on Communications (ICC), Seattle, WA, USA, 18–22 June
1995; pp. 331–335.
12. Mastronarde, N.; Verder, F.; Darsena, D.; Scaglione, A.; van der Schaar, M. Transmitting Important Bits and
Sailing High Radio Waves: A Decentralized Cross-Layer Approach to Cooperative Video Transmission.
IEEE J. Sel. Areas Commun. 2012, 30, 1597–1604. [CrossRef]
13. Rusek, F.; Persson, D.; Buon, K.L.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesso, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
14. Biswas, S.; Xue, J.; Khan, F.A.; Ratnarajah, T. Performance Analysis of Correlated Massive MIMO Systems
with Spatially Distributed Users. IEEE Syst. J. 2016, 12, 1850–1861. [CrossRef]
15. Khan, I. Channel modeling and analysis of OWC-massive MIMO Systems. Opt. Commun. 2018, 434, 209–217.
[CrossRef]
16. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
17. Jose, J.; Ashikhmin, A.; Marzetta, T.L.; Vishwanath, S. Pilot contamination and precoding in multi-cell TDD
systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
18. Ashikhmin, A.; Marzetta, T.L. Pilot contamination precoding in multi-cell large-scale antenna systems.
In Proceedings of the IEEE International Symposium on Information Theory Proceedings (ISIT), Cambridge,
MA, USA, 1–6 July 2012; Volume 1, pp. 1137–1141.
19. Liangbin, L.; Ashikhmin, A.; Marzetta, T.L. Pilot contamination precoding for interference reduction in
large-scale antenna systems. In Proceedings of the 51th Annual Allerton Conference on Communication,
Control, and Computing, Monticello, IL, USA, 2–4 October 2013; Volume 2, pp. 226–232.
20. Nguyen, T.M.; Le, L.B. Joint pilot assignment and resource allocation in multicell massive MIMO network:
Throughput and energy efﬁciency maximization. In Proceedings of the IEEE Wireless Communications and
Networking Conference (WCNC), New Orleans, LA, USA, 9–12 March 2015; pp. 393–398.
21. Cheng, H.V.; Björnson, E.; Larsson, E.G. Optimal Pilot and Payload Power Control in Single-Cell Massive
MIMO Systems. IEEE Trans. Signal Process. 2017, 65, 2363–2378. [CrossRef]
22. Dao, H.T.; Kim, S. Pilot power allocation for maximizing the sum rate in massive MIMO systems.
IET Commun. 2018, 12, 1367–1372. [CrossRef]
23. Elijah, O.; Leow, C.Y.; Rahman, T.A.; Nunoo, S.; Iliya, S.Z. A Comprehensive Survey of Pilot Contamination
in Massive MIMO—5G System. IEEE Commun. Surv. Tutor. 2015, 18, 905–923. [CrossRef]
24. Hoydis, J.; Brink, S.T.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
25. Khan, I.; Zafar, M.H.; Ashraf, M.; Bayati, A.K.S. Computationally Efﬁcient Channel Estimation for 5G
Massive Multiple-Input Multiple-Output Systems. Electronics 2018, 7, 382. [CrossRef]
26. Yin, H.; Gesbert, D.; Filippou, M.; Liu, Y. A coordinated approach to channel estimation in large-scale
multiple-antenna systems. IEEE J. Sel. Areas Commun. 2012, 31, 264–273. [CrossRef]
27. Neumann, D.; Joham, M.; Weiland, L.; Utschick, W. Low-complexity computation of LMMSE channel
estimates in massive MIMO. In Proceedings of the 19th International ITG Workshop on Smart Antennas,
Ilmenau, Germany, 3–5 March 2015; pp. 1–6.

75
Electronics 2019, 8, 55

28. Ngo, H.; Larsson, E.G. EVD-based channel estimation in multicell multiuser MIMO systems with very
large antenna arrays. In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal
Processing, Kyoto, Japan, 25–30 March 2012; pp. 3249–3252.
29. Jin, S.; Wang, Z.; Zheng, L.; Wong, K.; Huang, Y.; Tang, X. On massive MIMO zero-forcing transceiver using
time-shifted pilots. IEEE Trans. Veh. Technol. 2016, 65, 59–74. [CrossRef]
30. Xiong, X.; Jiang, B.; Gao, X.; You, X. QoS-Guaranteed User Scheduling and Pilot Assignment for Large-Scale
MIMO-OFDM Systems. IEEE Trans. Veh. Technol. 2015, 65, 6275–6289. [CrossRef]
31. Dai, X. Optimal training design for linearly time-varying MIMO/OFDM channels modeled by a complex
exponential basis expansion. IET Commun. 2007, 1, 945–953. [CrossRef]
32. Saxena, V.; Fodor, G.; Karipidis, E. Mitigating pilot contamination by pilot reuse and power control schemes
for massive MIMO systems. In Proceedings of the IEEE 81st Vehicular Technology Conference, Glasgow, UK,
11–14 May 2015; pp. 1–6.
33. Zhu, X.; Wang, Z.C.; Dai, L.L.; Qian, C. Smart pilot assignment for massive MIMO. IEEE Commun. Lett. 2015,
19, 1–4. [CrossRef]
34. Jin, S.; Li, M.; Huang, Y.; Du, Y.; Gao, X. Pilot scheduling schemes for multi-cell massive
multiple-input-multiple-output transmission. IET Commun. 2015, 9, 689–700. [CrossRef]
35. Litvinenko, L. OFDM signal PAPR reduction by pre-scrambling and clipping for frequency domain
comb-type channel estimation case. In Proceedings of the 23rd International Conference Radioelectronika,
Pardubice, Czech Republic, 16–17 April 2013; pp. 273–277.
36. Yang, B.; Deng, C.J.; Wu, P.H.; Xi, J.; Shi, L. Image encryption algorithm depending on two-one-dimensional
logistic chaotic inter-scrambling systems and m-sequence. In Proceedings of the IEEE International
Conference on Software Engineering and Service Science (ICSESS), Beijing, China, 27–29 June 2014;
pp. 521–524.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

76
electronics
Article
An Efﬁcient Pilot Assignment Scheme for Addressing
Pilot Contamination in Multicell Massive
MIMO Systems, Shamala Subramaniam and Ali
Mohammed Mansoor
Ahmed S. Al-hubaishi 1, *, Nor Kamariah Noordin 1,2 , Aduwati Sali 1,2 , Shamala Subramaniam 3,4
and Ali Mohammed Mansoor 5
1 Department of Computer and Communication Engineering, Universiti Putra Malaysia, Serdang 43400, Malaysia;
[email protected] (N.K.N.); [email protected] (A.S.)
2 Wireless and Photonics Networks Laboratory, Universiti Putra Malaysia, Serdang 43400, Malaysia
3 Department of Communication Technology and Network, Universiti Putra Malaysia, Serdang 43400, Malaysia;
[email protected]
4 Sports Academy, Universiti Putra Malaysia, Serdang 43400, Malaysia
5 Faculty of Computer Science & Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia;
[email protected]
* Correspondence: [email protected]; Tel.: +60-967-777-888-129

Received: 25 February 2019; Accepted: 21 March 2019; Published: 27 March 2019

Abstract: The reuse of the same pilot group across cells to address bandwidth limitations in a network
has resulted in pilot contamination. This causes severe inter-cell interference at the targeted cell. Pilot
contamination is associated with multicell massive multiple-input multiple-output (MIMO) systems
which degrades the system performance even when extra arrays of antennas are added to the network.
In this paper, we propose an efﬁcient pilot assignment (EPA) scheme to address this issue by maximizing
the minimum uplink rate of the target cell’s users. To achieve this, we exploit the large-scale characteristics
of the fading channel to minimize the amount of outgoing inter-cell interference at the target cell. Results
from the simulation show that the EPA scheme outperforms both the conventional and the smart pilot
assignment (SPA) schemes by reducing the effect of inter-cell interference. These results, show that
the EPA scheme has signiﬁcantly improved the system performance in terms of achievable uplink rate
and cumulative distribution function (CDF) for both signal-to-interference-plus-noise ratio (SINR), and
uplink rate.

Keywords: pilot contamination; massive MIMO; pilot assignment; large-scale fading coefﬁcients

1. Introduction
Equipping the base station (BS) with a large number of antennas (also known as massive
multiple-input multiple-output (MIMO)) has been considered one of the fundamental technologies that
leads to 5G [1]. The introduction of this technology is to meet the increasing demand for mobile data
in 5G [2]. Although the use of massive MIMO systems increases spectral efﬁciency, enhances energy
efﬁciency, and reduces the effect of small scale fading [3–7], but invariably promotes pilot contamination.
In massive MIMO, time-division duplex (TDD) protocol is preferred over the frequency-division duplex
(FDD) [8,9], as the former allows channel estimation in one direction (i.e., uplink) and avoids the estimation

Electronics 2019, 8, 372; doi:10.3390/electronics8040372 www.mdpi.com/journal/electronics

77
Electronics 2019, 8, 372

of the other side (i.e., downlink) due to channel reciprocity property. In other words, the use of TDD
based channel reciprocal minimizes the overhead signals used for channel estimation, which largely saves
network bandwidth. Although, the channel estimation ensures high utilization of TDD massive MIMO
via uplink transmission, but its channel coherence blocks are restricted in size (limited size). Therefore, the
orthogonal pilot sequences cannot be allocated for all users among the cells. To overcome this problem,
the orthogonal pilot sequences have to be reused across the cells. Although, pilot reuse approach is a
remarkable way forward in addressing the associated problem, however, the channel estimate obtained in
a given cell will be contaminated by pilots transmitted by users in other cells. Speciﬁcally, the inter-cell
interference exacerbates the estimation error and also makes sure the channel estimation of two or more
users sharing the same pilot sequence is correlated at a given cell [10]. Thus, with multicell massive MIMO
systems, its performance deteriorates during uplink and downlink transmission. This issue is referred to
as pilot contamination, and depicted in Figure 1.

Figure 1. The effect of pilot contamination in multicell massive MIMO systems at a cell a, where the solid
line represents the direct gain and the dotted line represents the inter-cell interference.

To address the issue associated with pilot contamination, several research methods have been
proposed to eliminate/relieve pilot contamination. Among these methods, the pilot assignment technique
is identiﬁed to be a potential technique for solving this problem. Smart pilot assignment (SPA) method
proposed by [11], focused on adjusting the combination between the users and pilot sequences, but did
not consider inter-cell interference which causes the pilot contamination. In this paper, we propose an
efﬁcient pilot assignment mechanism to improve the performance of users with respect to intense pilot
contamination in multicell massive MIMO systems. We summarize our contributions below:

• We formulate the pilot assignment as an optimization problem and develop a heuristic algorithm, in
order to maximize the minimum throughput considering the reduction in the inter-cell interference
pilot contamination.
• We evaluate the performance of the proposed mechanism in terms of signal-to-interference-plus-noise
ratio (SINR), and uplink rate with an extensive MATLAB simulation.
• We compare our work with SPA and other conventional schemes.

The rest of this paper is organized as follows. The related work is summarized in Section 2, the system
model is described in Section 3, the pilot contamination phenomenon and the achievable uplink rate are
illustrated in Section 4, the EPA scheme is explained in Section 5, the simulation results are depicted in
Section 6, and ﬁnally, this paper is concluded in Section 7.
Notation: Throughout this paper, the bold lower case letters represent vectors and matrices are
represented by bold upper case letters. I M denotes the identity matrix of dimensions M × M. The operators

78
Electronics 2019, 8, 372

(.)−1 , (.) T , and (.) H are deﬁned for inverse, transpose and conjugate transpose operations, respectively.

The expectation operator is represented by E . .

2. Related Work
Different traditional algorithms based on pilot assignment have been proposed for pilot
decontamination [12,13]. A vertex graph-coloring-based pilot assignment has been proposed in [12], the
pilot sequences are allocated to the users according to the inter-cell interference (ICI) graph. The evaluation
of ICI graph depends on both angle of arrival (AoA) correlation and distances between users. However,
this scheme requires a second order channel information to construct ICI graph. A deep learning-based
pilot allocation scheme (DL-PAS) is proposed in [13] to address the pilot contamination problem in massive
MIMO systems. This algorithm aims at learning from the relationship between pilot assignment and users’
location. However, the DL algorithm requires high data and subsequently takes a longer time to process
the data.
The authors in [14,15] developed the location-based pilot assignment approaches for pilot
decontamination. A new expression for line of sight (LOS) interference is derived in [14] which is
considered as the criteria for pilot allocation. Although, there was an improvement in the sum spectral
efficiency (SE), but the pilot assignment process takes a longer time to be implemented, especially in large
networks. The work in [15] characterizes the angular region of the targeted user, and the pilot assignment
process was implemented with the aim of making this region interference-free. This angular region is
characterized by both the number of BS antennas and the location of the targeted user. However, the pilot
assignment problem is formulated by the joint optimization problems which subsequently introduce high
computational complexity.
In [16,17], the pilot allocation based pilot reuse (reuse factor more than 1) is also considered for pilot
contamination’s elimination technique. A systematically-constructed pilot reuse method is proposed
in [16]. In this approach, the neighbor cells are allowed to use different sets of pilot sequences according to
the tree division. To improve performance, it ensures larger distance between cells that share similar pilot
sets, the depth of the tree is increased as the pilot contamination severity increases. This approach offers
an effective performance when the ratio of the channel coherence time to the number of users in each cell
is relatively large. For the purpose of improving the quality of service (QoS) of the edge users, a soft pilot
reuse (SPR) scheme was proposed by [17]. The channel quality for each user is initially compared with a
determined threshold before the pilot allocation procedure, but an increase in complexity was recorded
due to additional computational cost incurred by finding the optimal threshold value.
By considering a fairness among users in order to mitigate the pilot contamination, pilot allocation
schemes were proposed in [18,19]. Specifically, to maximize the sum rate of the system and guarantee
fairness among users, a pilot allocation scheme was proposed by [18]. An optimization problem is
formulated based on a max-product criterion, then both min-leakage algorithm and user- exchange
algorithm based on greedy (UEBG) pilot allocation were suggested to solve the optimization problem.
Although this scheme almost achieves the same performance as the optimal exhaustive search algorithm
(ESA), it still suffers a setback due to high complexity. For the purpose of pilot contamination mitigation
in [19], the pilot assignment scheme based on the harmonic SINR utility function was introduced to
regulate the fairness among users. However, the system complexity increases as the number of users and
network size grows (more than two cells).
Based on performance degradation of users, a pilot assignment scheme has been proposed in [20],
the degradation performance is initially evaluated for all users according to the value of the uplink
achievable rate. Therefore, the optimal pilot sequences were assigned to users who suffered from the
highest degradation in a greedy way. Obviously, this scheme is not effective in bad channel conditions.

79
Electronics 2019, 8, 372

In [11,21], the pilot allocation approaches aim at enhancing the performance of users who suffer from
bad SINR. The pilot allocation in [21] focused on maximizing the sum capacity of the whole system for
pilot decontamination. In this work, the pilot sequences were assigned initially to the users who have bad
channel condition. However, the complexity of the pilot assignment procedure increases as the network
size is increased. A SPA scheme is proposed in [11] to improve the performance of users with poor SINR.
Users with low channel quality were assigned to pilot sequences which resulted in a low interference.
However, the achievement of this scheme is limited as it did not consider inter-cell interference which
causes the pilot contamination.
Some authors have tried to make a combination of two schemes to get an improved performance
as shown in [22,23]. As such, a joint pilot assignment scheme has been proposed by [22], in which
time-shifted [24] and the SPA [11] schemes were combined in order to mitigate the effect of pilot
contamination. Inter-group interference is suppressed according to [24] strategy, whereas SPA is used
to reduce intra-group interference. Although an improved overall performance was recorded, the
mutual interference between downlink data and uplink pilot signals cannot be eliminated despite the
use of SPA scheme. New pilot assignment schemes such as greedy-based and swapping-based were
implemented together with pilot contamination precoding design (PCP) for massive MIMO downlinks [23].
This combination offers a considerable improvement over the random pilot assignment, but the PCP
matrix is changed according to the update in pilot assignment information.
By exploiting the channel sparsity for wideband massive MIMO system, the pilot contamination
can be removed with the help of pilot assignment policy in [25]. The pilot assignment policy is designed
to help identify the subspace of the desired channel. The difﬁculty in this approach, lies on how to
deal with the subspace estimation, which can be realized through multiple frames after randomizing the
pilot contamination.
Differing from the aforementioned works [11,20], we consider the source of inter-cell interference
throughout pilot assignment, which is essentially the cause of the pilot contamination. In some other
works [12–15], the availability of some factors (e.g., user location, AoA, or LOS interference) are needed
for pilot assignment which are not always easy to estimate, while our approach requires only large-scale
fading coefﬁcients, which can be tracked easily as they do not frequently change during coherence interval.
Besides, comparing to previous works [17–19,21], our algorithm is not computationally intensive, and
therefore it can be applied for large-scale networks.

3. System Model
In this section, we describe the system model under which the TDD-massive MIMO systems are
implemented. In this model, the uplink comprises L cells, in which each cell contains a BS equipped with
M antennas. Furthermore, in each cell coverage area K single-antenna users communicate simultaneously
to their designated BS, assuming that M K [2,5]. The propagation channels connecting the k-th user
located in the j-th cell to the BS in i-th cell is modeled as Rayleigh block fading [26] and the channel vector
hijk ∈ C M×1 is denoted as:
hijk = gijk βij . (1)
k

where gij and βij denote the small scale-fading vector and large-scale fading coefficient, respectively.
k k
The small scale-fading vector has a complex Gaussian distribution with zero mean and unity variance, CN
(0, I M ) , while the large-scale fading coefficient is referred to the effect of both path-loss and shadowing
and it can be tracked easily as it changes slowly during coherence interval τc = Bc Tc [27–29]. We use
Bc and Tc to denote the coherence bandwidth and the coherence time, respectively. Figure 2 illustrates
the coherence block for TDD protocol. We also consider that large-scale fading coefficient is equal for

80
Electronics 2019, 8, 372

all antenna elements, assuming that the distance between user k and BS is signiﬁcantly larger than the
distances between antenna elements.

Figure 2. Time-division duplex (TDD) Protocol.

4. Pilot Contamination and Achievable Uplink Rate

Since the size of channel coherence blocks is limited, it is difﬁcult to assign orthogonal pilot sequences
to all users in order to prevent pilot contamination. Thus, it is necessary to reuse the pilot sequences in
all cells to overcome this limitation [2]. The pilot sequences Φ = [φ1 , φ2 , ..., φK ] T ∈ C M×τp are assumed
mutually orthogonal Φ T Φ = τp IK with length of τp . During the pilot phase, the pilot sequences are
φ
distributed randomly to all users. Thus, the received signal Ui ∈ C M×τp at the BS in the i-th cell can be
written as:
L K
φ √ φ
Ui = ρφ ∑ ∑ hijk φkT + Ni (2)
j =1 k =1

Ki L Kj
φ √ √ φ
Ui = ρφ ∑ hiik φkT + ρφ ∑ ∑ hijk φkT + Ni (3)
k =1 j =1 k =1
j =i

φ
where ρφ denotes the pilot transmission power, and Ni ∈ C M×τp denotes the additive white Gaussian
noise (AWGN) matrix which is assumed independent and identically distributed (i.i.d) random variables
whose elements have zero mean and variance σN 2 . The received signal Uφ is called the observation, in which
i
the BS in the cell i can use it to estimate the channel responses. The ﬁrst term in (3) represents the received
pilot signals from users in the serving cell, whereas, the middle term represents the inter-cell interference
signal from the neighbor cells, which causes the pilot contamination. Correspondingly, the received uplink
data uid ∈ C M at the BS in the i-th cell can be represented by:

√ L K
uid = ρu ∑ ∑ hijk xujk + niu (4)
j =1 k =1

where x ujk denotes the uplink transmitted symbol from user k located in the j-th cell, ρu denotes the power

of the uplink transmitted symbol with E | x ujk |2 = 1, and niu ∈ C M×τu denotes the AWGN vector with
variance σn2 and zero mean value. The minimum mean square error (MMSE) is exploited for the purpose
of the channel estimation ĥijk ∈ C M×1 [9]. Therefore, the MMSE estimated channel vector ĥijk based on the
φ
observation Ui in (2) can be given as [10]:
√ p
ĥijk = ρu Rijk Ψijk uijk (5)

and
−1
Ψijk = ∑ ρu τp Rijk + σu2 IK (6)
j,k

81
Electronics 2019, 8, 372

φ
where uijk = Ui φk∗ which is called the received proceed signal, Ψijk denotes the inverse of the
p

normalized correlation matrix, and Rijk denotes the spatial correlation matrix of the channel to be estimated,
Rijk = E[hijk hiH
jk ].
The estimated channel is then used to detect the uplink data symbol and precode the downlink data.
Herein, we consider both maximum ratio combining (MRC) and zero forcing (ZF) as a linear detectors at
the BS which are given by [30]:

Ĥii MRC
Ai = −1 (7)
Ĥii Ĥii H Ĥii ZF

The received detected signal is evaluated by multiplying the received uplink data signal uid by the
decoding vector aiH i
ik , which represents the k-th column of the matrix Ai and hik is the k-th column of the
matrix Hii . Therefore, the detected symbol of user k at a given BS located in a cell i can be expressed as:

√ L K
u
ziik = aiH
ik ui = aik
d iH
ρu ∑ ∑ hijk xujk + niu (8)
j =1 k =1

Ki Kj
√ √ √ L
u
ziik = ik hik xik +
ρu aiH i u
ρu ∑ aiH
ik hin xin +
i u
ρu ∑ ∑ aiH
ik h jk x jk + aik ni
i u iH u
(9)
n =1 j =1 k =1
n =k j =i

The ﬁrst term in (9) represents the desired signal, the second one represents the intra-cell interference,
the third term is the effect of pilot contamination (inter-cell interference), and the last one represents the
uncorrelated noise.
Consequently, the average SINR of the k-th user in the target cell i can be evaluated as:

ρu |aiH
ik hik |
i 2
u
SI NRik = (10)
Kj υik
i
ρu ∑ Lj=1 ∑k=1 |aiH
ik h jk | +
i 2
ρu
j =i

and
Ki
i
υik = ρu 2 ∑ |aiH
ik hin | + ρu || aik ||
i 2 i 2
n =1
n =k

where υik
i denotes the intra-cell interference and uncorrelated noise, in which their effect is almost neglected

as the number of antennas increases (M → ∞) [5]. Then, the uplink SINR can be described by large-scale
fading coefﬁcients βijk as follows:

2
βiik
u
SI NRik = 2
, when (M → ∞) (11)
∑ j =i βijk

It is clear from the above expression that the effect of small-scale fading and thermal noise are
averaged out as the number of antennas is increased [5]. Therefore, the ergodic achievable uplink rate of
the user k according to [26] is:
1
Tc ∑
u
Rik = (1 + SI NRik
u
) (12)
τu

82
Electronics 2019, 8, 372

u is calculated in bit/channel use and τ refers to the uplink duration. From (12), it is obvious that
where Rik u
the average uplink rate of multicell massive MIMO systems is limited due to pilot contamination and it
cannot be boosted by increasing either the number of serving antennas or both ρu and ρ p .

5. Proposed Scheme
In this section, an efﬁcient heuristic algorithm is developed for addressing the multicell massive
MIMO associated problem. To do this, the assignment and the reuse of pilot group across cells in the
network is formulated as an optimization problem.

5.1. Problem Formulation

Formally, we formulate an optimization problem as depicted in:

2
M→∞ βiik
P −−−→ P̂ : max
∀ k ∈i min 2
(13)
min
∀φk ∑ j =i βijk

The above optimization problem is based on the method proposed by [11]. In this method, it is
assumed the number of antennas is very large and as such make use of the large-scale fading coefficients
βijk . To address problems related to pilot contamination, this study concentrates on assigning the pilot
sequences for a specific cell in multicell massive MIMO systems. In the target cell, the number of possible
iterations is defined by the number of K users which is usually very high. In contrast, the conventional
scheme assigns the pilot sequences Φ = [φ1 , φ2 , ..., φK ] T randomly to K users.
The performance of multicell massive MIMO systems is much degraded by the effect of the strong
inter-cell interference from the neighbor cells and is exacerbated when the channel quality of the users
in target cell is poor. Specifically, in the SPA scheme, the set of users with the worst channel quality are
assigned pilot sequences with the lowest inter-cell interference. Although these pilot sequences have the
lowest interference, they are still considered high interference pilot sequences when used by users which
have bad channel quality. Therefore, the interference that is associated with such pilot sequences must
be minimized.

5.2. Proposed Solution

To achieve minimal outgoing inter-cell interference among neighbor cells for the target cell, we ensure
a weak channel cross gain of the interfering users against desired users. The large-scale fading coefficients
are used to measure the effect of inter-cell interference at the target BS. Thus, the effect of inter-cell
interference can be measured by using these fading coefficients as it changes progressively during the
coherence interval τc , as every user measured result is sent to its corresponding BS. The required conditions
for finding the large-scale fading coefficients can be met in long term evolution-advanced (LTE-A) systems.
These corresponding BSs contain the channel’s information for the available BSs. The user keeps tracking
these BSs until a reliable BS is identified for suitable handover. To enhance the cooperation among the
BSs, we assume acquisition of the coordinated multi points (CoMP). Furthermore, a mobility management
entity (MME) is connected to BSs by S1 interface and has a huge ability for computing. As a result, this
unit can collect the large-scale fading coefficients from the connected BSs [31,32].
To abate the effect of setback suffered by users due to poor channel quality or high interference,
the SINR is optimized. This is done by assigning the pilot sequence, which is associated with low
interference, to users having poor channel quality.

83
Electronics 2019, 8, 372

In order to achieve this, we propose a heuristic algorithm based on SPA to solve the optimization
problem in (13). Before illustrating the algorithm, we need to deﬁne a set of parameters η jk which
characterizes the squared cross gain of the interfering users from neighboring cells:

2
η jk = βijk , k = 1, 2, ..., K, j = 1, 2, ..., L and j = i

The interference that is produced by users who shared the same pilot sequence φk can be evaluated
at the target cell as:
ξ k = ∑ η jk (14)
j =i

In addition, the set of parameters k is used to characterize the square channel quality of the target
cell’s users which can be expressed by:

2
k = βiik , k=1,2,...,K

So, the optimization problem in (13) can be re-written as the following:

M→∞ k
P −−−→ P̂ : max
∀ k ∈i min min (15)
∀φk ∑ j =i η jk

The proposed algorithm EPA is summarized in Algorithm 1 to solve the above optimization problem.

Algorithm 1 Efﬁcient Pilot Assignment (EPA).

1: Input:
2: βijk ∀i, j and k
3: Output:
4: Assigning pilot sequences Φ for all users in all j cells ∀ j = 1, 2, ..., L
5: Procedure:
for each neighbor cells j = i do
for all users K in cell j do
2
6: Evaluate: η jk = βijk , k = 1, 2, ..., K
end for
7: Classifying the users into different levels: V1 , V2 , ..., Vk , ..., VK .
Vk = [ η1k , η2k , ..., η jk , ..., η( L−1)k ]
8: Assign the pilot sequence φk to the users in Vk .
end for
9: Find the sum: ξ k = ∑ j =i η jk , ξ k ∈ [ξ 1 , ξ 2 , ..., ξ K ] ∀Φ
for each user in the target cell i do
2
10: Evaluate k = βiik , k = 1, 2, ..., K
end for
11: Sort k in descending order: 1 ≥ 2 , ..., ≥ k , ..., ≥ K
12: Sort ξ k in descending order: ξ 1 ≥ ξ 2 ≥, ..., ≥ ξ k , ..., ≥ ξ K
13: Assigning the pilot sequence φk ,which associates with ξ k , to the user who has k .

84
Electronics 2019, 8, 372

The available large scale fading coefficients are exploited to measure the interference from the neighbor
cells. From the above algorithm, the users in the neighbor cells are classified into different levels according
to the value of squared cross gain (η jk ), which gives an indication of the strength of the interference at
the target cell i. The users that cause the highest interference (which have the largest η jk ) are classified
as the level V1 users. This level involves the worst interfering users from each neighbor cell. The second
level V2 contains the users which cause less interference than that in V1 . This classification process will
continue until the last level VK , which contains the users that produce the smallest interference. The k-th
interference level can be represented by:

Vk = [ η1k , η2k , ..., η( L−1)k ] (16)

The amount of interference that is produced by the users in each level is described by (14). After that,
the interfering users in each level are assigned the same pilot sequence. For instance, the users in V1
and VK are assigned the pilot sequences φ1 and φK , respectively. As a result, the pilot sequence φ1 is
suffering from the highest interference, whereas φK is the one with the lowest interference. The remaining
pilot sequences have different levels of interference between φ1 and φK . After minimizing the inter-cell
interference at the serving BS, the second step is to assign pilot sequences to its users and this can be
achieved by solving the following formula:

M→∞ k
P −−−→ P̂ : max
∀ k ∈i min (17)
ξk

Obviously, from the EPA algorithm, the pilot assignment process for the users of the target cell
depends on, both the squared channel quality k and the minimized outgoing interference ξ k , which is
caused by users sharing the same pilot sequence in the level Vk . The optimization problem in (17) can be
solved with the help of the SPA algorithm. In this algorithm, users that suffer setbacks due to bad channel
quality are exempted from the pilot sequence as it will cause severe interference. Thus, the sets of users
with the worst channel quality are assigned pilot sequence with the lowest inter-cell interference. For the
remaining cells, the process will continue in a sequential way, excluding the cells that are already included
with the target cell.
Furthermore, our algorithm is not computationally intensive in the sense that it ultimately relies on
cell sorting, thus the time complexity it incurs is O (L K log K), and therefore it works faster if compared to
recent schemes. For example, EPA shows less computational complexity than the work in [19,21], which
incur O (L K3 ) and O (L2 K log K), respectively. In addition, the scheme in [17] incurs O (M(K2 e + K2 CS ),
where Ke denotes the number of edge users in the network, and KCS represents the number of users in the
largest cell. So apparently [17] is much more intense than EPA. The SPA scheme [11], as it is fundamentally
limited to only a target cell optimization, unsurprisingly it incurs only O ( K log K).

6. Simulation Results
The base code implemented is obtained from [26], while Monte Carlo simulation is used to evaluate
the performance of the EPA scheme. A typical hexagonal cellular network made up of L cells is considered
in the EPA scheme. Each of these cells comprises of a BS which is equipped with M number of antennas
and K users with single antennas under its coverage area [2,5]. A center cell surrounded by all other cells
is considered as a target cell. The system parameters are summarized in Table 1. The parameter βiik is
modeled in decibel as [10]:
dijk
βiik = Υ + 10 α log10 + Fjki (18)
1 km

85
Electronics 2019, 8, 372

where dijk (km) is the distance between the k-th user in the j-th cell and the BS in the i-th cell, α is the
path-loss exponent, Υ determines the median channel gain at 1 km as a reference distance which can be
calculated according to many propagation models [33], and Fijk N (0, σ2s f ) is the shadow fading which
creates log-normal random variations around the nominal value Υ + 10 α log10 (dijk /1 km).

Table 1. Simulation parameters.

Parameter Value (for Figures 3–12) Value (for Figure 13)

Number of Cells L 7 7
Number of BS Antennas M 8 ≤ M ≤ 512 8 ≤ M ≤ 512
Number of users in each cell K 8 20
Cell RadiusR 500 m 300 m
Cell edge SNR 15 dB 20 dB
Path-loss Exponentα 3 3
Shadow Fading Standard Deviation σs2f 8 8
Thermal Noise Variance −174 dBm/H −174 dBm/H

We evaluate the SPA [11] and the conventional schemes [2,5] against the EPA scheme. Figure 3
depicts the average uplink rate per user of the EPA, SPA and conventional schemes against the number
of BS’s antennas using the ZF as a linear detector. Obviously, the average uplink rate of the EPA scheme
outperforms the other schemes. This improvement can be attributed to the policy implemented for pilot
assignment in the neighbor cells. This implemented policy ensures a signiﬁcant reduction of the inter-cell
interference at the serving BS, which invariably leads to a better throughput. Due to the pilot assignment
in the target cell which was executed according to the users’ channel quality, the SPA scheme achieves
better performance than the other conventional scheme. However, the performance of both SPA and
conventional schemes changes slightly when the number of antennas exceeds certain points (e.g., greater
than 150).

3.5

Conventional Scheme
SPA
3
EPA
Average Rate per User [bit/channel use]

2.5

1.5

0.5

0
50 100 150 200 250 300 350 400 450 500
Number of Receive Antennas per Cell

Figure 3. The average uplink rate per user with zero forcing (ZF) for different numbers of antennas.

86
Electronics 2019, 8, 372

Figure 4 shows the impact of the EPA scheme when using the MRC as a linear detector. It can be
clearly observed that the average uplink rate per user (bits/channel use) is substantially enhanced by
the EPA scheme when the number of antennas is increased. The superiority of the EPA scheme over
other schemes, arose as a result of the minimization of the inter-cell interference that comes from the
neighbor cells. This is achieved by allowing the users in each interference level Vk to share the same pilot
sequence. Consequently, EPA scheme has shown a low interference from the neighbor cells compared to
the other schemes.

3
Conventional Scheme
SPA
EPA
2.5
Average Rate per User [bit/channel use]

1.5

0.5

0
50 100 150 200 250 300 350 400 450 500
Number of Receive Antennas per Cell

Figure 4. The average uplink rate per user with maximum ratio combining (MRC) for different numbers
of antennas.

Figure 5 depicts the performance of the EPA scheme when compared with both conventional and SPA
schemes in terms of cumulative distribution function (CDF) of the average SINR. When the number of BS’s
antennas is 64 with ZF detector, the probabilities of the average uplink SINR being less than −10 dB for
the conventional, the SPA, and the proposed EPA schemes are almost 80%, 26.25%, and 10%, respectively.
The improvement is achieved because the effect of the interference, which is associated with the pilot
sequences, on channel quality of the users in the target cell became slight, which effectively increased the
SINR of the system.
Figure 6 depicts the CDF of the minimum SINR when M is 64. It is evident that the minimum SINR
of the EPA scheme is signiﬁcantly improved when compared with SPA and conventional schemes. For
example, the probability of the minimum SINR to be less than −20 dB for the EPA scheme is approximately
16.25%, while this probability is about 34.6% and 79.6% for the SPA and the conventional schemes,
respectively. The reason behind this improvement is due to assigning the pilots of the users with the
lowest interference, in the neighbor cells, to the users who have bad channel quality in the target cell.
In consequence, the performance of these users was improved due to the reduction of their interference.

87
Electronics 2019, 8, 372

1
Conventional Scheme
0.9 SPA
EPA

0.8

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
-25 -20 -15 -10 -5 0 5 10 15
Average SINR

Figure 5. The cumulative distribution function (CDF) of the average signal-to-interference-plus-noise ratio
(SINR) when M = 64 using ZF.

1
Conventional Scheme
0.9 SPA
EPA

0.8

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
-70 -60 -50 -40 -30 -20 -10 0 10
Min. SINR

Figure 6. The CDF of the min. SINR when M = 64 using ZF.

Figures 7 and 8 depict the CDF of average and minimum SINR, respectively, using MRC detector when
M is 64. As observed from Figures 7 and 8, the EPA scheme outperforms the SPA and the conventional

88
Electronics 2019, 8, 372

schemes. As shown in Figure 7, the EPA scheme increases the average SINR by 1.8 dB over the SPA scheme,
whereas it increases up to 4.69 dB for minimum SINR, as illustrated in Figure 8.

1
Conventional Scheme
0.9 SPA
EPA

0.8

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
-30 -25 -20 -15 -10 -5 0 5
Average SINR

Figure 7. The CDF of the average SINR when M = 64 using MRC.

1
Conventional Scheme
0.9 SPA
EPA
0.8

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
-70 -60 -50 -40 -30 -20 -10 0
Min. SINR

Figure 8. The CDF of the min. SINR when M = 64 using MRC.

From Figures 5–8, the minimum SINR always achieves a better performance. In other words,
the performance of edge users is signiﬁcantly enhanced. This is due to the fact that the inter-cell interference
has been greatly reduced at the target cell while the users with poor channel quality are assigned to the

89
Electronics 2019, 8, 372

suitable pilot sequences in order to maximize its SINR. Moreover, the results obtained as a result of using
ZF and MRC linear detections, are approximately comparable when run on the same parameters setting.
This is because the inter-cell interference is greatly reduced by the EPA scheme that runs before the process
of signal detection.
By using ZF detector, the performance of the EPA scheme has been examined in terms of the CDF of
the average uplink rate when M is 64, as shown in Figure 9. It can be seen that the performance of the
CDF in the conventional scheme is highly inﬂuenced by the pilot contamination. The assignment of the
pilot randomly, has led to the worst performance compared to the SPA and the EPA schemes. On the other
hand, the EPA scheme outperforms the SPA and the conventional schemes, since the effect of users who
cause the highest interference is considered weak compared to users having good channel quality when
they are assigned the same pilot sequence. As a result, these interfering users are excluded from sharing
the same pilots of users with bad channel quality.

0.9
Conventional Scheme
0.8 SPA
EPA

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5
Average Throughput

Figure 9. The CDF of the average uplink rate when M = 64 using ZF.

Result of evaluation for the CDF of the minimum uplink rate is depicted in Figure 10. It is clear that
the EPA scheme performs better than the other schemes. For example, the minimum uplink rate of the
EPA scheme is doubled when compared to the SPA scheme. This improvement has been achieved because
the interference associated with pilot sequences, which is allocated to users with bad channel quality, was
reduced effectively by the EPA scheme.
Figures 11 and 12 represent the CDF of the average and the minimum uplink rate, respectively, when
the MRC is utilized and M is 64. The EPA scheme achieves the highest performance when compared
with other schemes, especially in the minimum uplink rate. Speciﬁcally, the achieved gain in minimum
uplink rate is doubled while it is 1.2 times in average uplink rate in comparison with SPA. The reason for
this improvement in the minimum uplink rate is due to the priority given to the users having the worst
channel quality during the pilot assignment process.

90
Electronics 2019, 8, 372

0.9
Conventional Scheme
SPA
0.8 EPA

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6 1.8 2
Min. Throughput

Figure 10. The CDF of min. uplink rate when M = 64 using ZF.

0.9

0.8
Conventional Scheme
SPA
0.7
EPA

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
0 0.5 1 1.5 2 2.5
Average Throughput

Figure 11. The CDF of average uplink rate when M = 64 using MRC.

In order to verify the effectiveness of the EPA scheme, the average uplink rate against the
number of antennas has been evaluated in Figure 13 with different parameters, considering ZF detector.
These parameters, which are shown in Table 1, increase the interference severity at the target cell. Obviously,
the average uplink rate of EPA schema is higher than other schemes, despite the intensity of interference.

91
Electronics 2019, 8, 372

0.9 Conventional Scheme

SPA
0.8 EPA

0.7

0.6
CDF

0.5

0.4

0.3

0.2

0.1

0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Min Throughput

Figure 12. The CDF of min. uplink rate when M = 64 using MRC.

Conventional Scheme
2 SPA
EPA
Average Rate per User [bit/channel use]

1.5

0.5

0
50 100 150 200 250 300 350 400 450 500
Number of Receive Antennas per Cell

Figure 13. The average uplink rate per user with ZF for different numbers of antennas, K = 20 , and
R = 300 m.

92
Electronics 2019, 8, 372

7. Conclusions
In this paper, we propose a new pilot assignment approach to address the pilot contamination
in multicell massive MIMO systems. An optimization problem is formulated in order to improve the
minimum uplink rate for users in the target cell. Our approach to solving this optimization problem
ensures an overall reduction of the outgoing inter-cell interference of neighbor cells. This reduction is
achieved by assigning the pilot sequences to the neighbor cell’s users and maximizing the minimum
uplink rate of the target cell’s users based on SPA algorithm. The numerical results have clearly shown
that the EPA scheme is more effective than the other schemes in both MRC and ZF linear detections.
Additionally, using such an efﬁcient assigning approach entitles the new EPA scheme to achieve signiﬁcant
performance when the typical parameter M is 64 compared to the SPA and the conventional schemes.
Likewise, the minimum uplink rate is greatly enhanced by the new EPA scheme than the SPA scheme.
Furthermore, the proposed scheme has also proved high effectiveness and performance even in severe
interference environments.

Author Contributions: Conceptualization, A.S.A.-h. and N.K.N.; Formal analysis, A.S.A.-h. and N.K.N.; Methodology,
A.S.A.-h., N.K.N., A.S., S.S. and A.M.M.; Writing—review & editing, A.S.A.-h. and N.K.N.
Funding: This work was funded by [Advancing the State of the Art of MIMO: The Key to Successful Evolution of
Wireless Networks (ATOM)] grant number [690750-ATOM-H2020-MSCA- RISE-2015, UPM: 6388800-10801] and The
APC was funded by [Research Management Centre (RMC)].
Conﬂicts of Interest: The authors declare no conﬂicts of interest.

References
1. Boccardi, F.; Heath, R.W.; Lozano, A.; Marzetta, T.L.; Popovski, P. Five disruptive technology directions for 5G.
IEEE Commun. Mag. 2014, 52, 74–80. [CrossRef]
2. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
3. Lim, Y.G.; Chae, C.B.; Caire, G. Performance analysis of massive MIMO for cell-boundary users. IEEE Trans.
Wirel. Commun. 2015, 14, 6827–6842. [CrossRef]
4. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
5. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IIEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
6. Björnson, E.; Larsson, E.G.; Debbah, M. Optimizing multi-cell massive MIMO for spectral efﬁciency: How many
users should be scheduled? In Proceedings of the 2014 IEEE Global Conference on Signal and Information
Processing (GlobalSIP), Atlanta, GA, USA, 3–5 December 2014. [CrossRef]
7. Rajatheva, N.; Osseiran, A.; Monserrat, J.F.; Marsch, P. 5G Mobile and Wireless Communications Technology;
Osseiran, A., Monserrat, J.F., Marsch, P., Eds.; Cambridge University Press: Cambridge, UK, 2016;
ISBN 978-1-107-13009-8.
8. Björnson, E.; Larsson, E.G.; Marzetta, T.L. Massive MIMO: Ten myths and one critical question. IEEE Commun. Mag.
2016, 54, 114–123. [CrossRef]
9. Jose, J.; Ashikhmin, A.; Marzetta, T.; Vishwanath, S. Pilot Contamination and Precoding in Multi-Cell TDD
Systems. IEEE Trans. Wirel. Commun. 2011, 10, 2640–2651. [CrossRef]
10. Björnson, E.; Hoydis, J.; Sanguinetti, L. Massive MIMO Networks: Spectral, Energy, and Hardware Efﬁciency.
IFound. Trends Signal Process. 2017, 11, 154–655. [CrossRef]
11. Zhu, X.; Wang, Z.; Dai, L.; Qian, C. Smart Pilot Assignment for Massive MIMO. IIEEE Commun. Lett. 2015,
19, 1644–1647. [CrossRef]

93
Electronics 2019, 8, 372

12. Dao, H.T.; Kim, S. Vertex Graph-Coloring-Based Pilot Assignment With Location-Based Channel Estimation for
Massive MIMO Systems. IEEE Access 2018, 6, 4599–4607. [CrossRef]
13. Kim, K.; Lee, J.; Choi, J. Deep Learning Based Pilot Allocation Scheme (DL-PAS) for 5G Massive MIMO System.
IIEEE Commun. Lett. 2018, 22, 828–831. [CrossRef]
14. Akbar, N.; Yan, S.; Yang, N.; Yuan, J. Location-Aware Pilot Allocation in Multicell Multiuser Massive MIMO
Networks. IEEE Trans. Veh. Technol. 2018, 67, 7774–7778. [CrossRef]
15. Muppirisetty, L.S.; Charalambous, T.; Karout, J.; Fodor, G.; Wymeersch, H. Location-Aided Pilot Contamination
Avoidance for Massive MIMO Systems. IEEE Trans. Wirel. Commun. 2018, 17, 2662–2674. [CrossRef]
16. Sohn, J.; Yoon, S.W.; Moon, J. On Reusing Pilots Among Interfering Cells in Massive MIMO. IEEE Trans.
Wirel. Commun. 2017, 16, 8092–8104. [CrossRef]
17. Zhu, X.; Wang, Z.; Qian, C.; Dai, L.; Chen, J.; Chen, S.; Hanzo, L. Soft Pilot Reuse and Multi-Cell Block
Diagonalization Precoding for Massive MIMO Systems. IEEE Trans. Veh. Technol. 2016, 65, 3285–3298. [CrossRef]
18. ZLiu, J.; Li, Y.; Zhang, H.; Guo, S. Efficient and Fair Pilot Allocation for Multi-cell Massive MIMO Systems.
In Proceedings of the 2017 9th International Conference on Wireless Communications and Signal Processing
(WCSP), Nanjing, China, 11–13 October 2017. [CrossRef]
19. Ma, S.; Xu, E.L.; Salimi, A.; Cui, S. A Novel Pilot Assignment Scheme in Massive MIMO Networks.
IEEE Commun. Lett. 2018, 7, 262–265. [CrossRef]
20. Wang, H.; Huang, Y.; Jin, S.; Du, Y.; Yang, L. Pilot Contamination Reduction in Multi-cell TDD Systems with Very
Large MIMO Arrays. Wirel. Pers. Commun. 2017, 96, 5785–5808. [CrossRef]
21. Du, T.; Wang, Y.; Wang, J.; Du, Y.; Yang, L. Multi-cell Joint Optimization to Mitigate Pilot Contamination for
Multi-Cell Massive MIMO Systems. In Proceedings of the 2017 IEEE 85th Vehicular Technology Conference
(VTC Spring), Sydney, NSW, Australia, 4–7 June 2017. [CrossRef]
22. Wang, P.; Zhao, C.; Zhang, Y.; Stuber, G.L. A Novel Pilot Assignment Approach for Pilot Decontaminating in
Massive MIMO Systems. In Proceedings of the 2017 IEEE Wireless Communications and Networking Conference
(WCNC), San Francisco, CA, USA, 19–22 March 2017. [CrossRef]
23. Zhao, M.; Zhang, H.; Guo, S.; Yuan, D. Joint Pilot Assignment and Pilot Contamination Precoding Design for
Massive MIMO Systems. In Proceedings of the 2017 26th Wireless and Optical Communication Conference
(WOCC), Newark, NJ, USA, 7–8 April 2017. [CrossRef]
24. Fernandes, F.; Ashikhmin, A.; Marzetta, T.L. Inter-Cell Interference in Noncooperative TDD Large Scale Antenna
Systems. IEEE J. Sel. Areas Commun. 2013, 31, 192–201. [CrossRef]
25. Chen Z.; Yang, C. Pilot decontamination in wideband massive MIMO systems by exploiting channel sparsity.
IEEE Trans. Wirel. Commun. 2016, 15, 5087–5100. [CrossRef]
26. Björnson, E.; Matthaiou, M.; Debbah, M. Massive MIMO with Non-Ideal Arbitrary Arrays: Hardware Scaling Laws
and Circuit-Aware Design. IEEE Trans. Wirel. Commun. 2015, 14, 4353–4368.. [CrossRef]
27. Li, M.; Jin, S.; Gao, X. Spatial Orthogonality-Based Pilot Reuse for Multi-cell Massive MIMO Transmission.
In Proceedings of the 2013 International Conference on Wireless Communications and Signal Processing,
Hangzhou, China, 24–26 October 2013. [CrossRef]
28. Ashikhmin, A.; Marzetta, T. Pilot Contamination Precoding in Multi-Cell Large Scale Antenna
Systems. In Proceedings of the 2012 IEEE International Symposium on Information Theory Proceedings,
Cambridge, MA, USA, 1–6 July 2012. [CrossRef]
29. Muller, R.; Cottatellucci, L.; Vehkapera, M. Blind pilot decontamination. IEEE J. Sel. Top. Signal Process. 2014,
8, 773–786. [CrossRef]
30. NGO, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and Spectral Efficiency of Very Large Multiuser. IEEE Trans.
Wirel. Commun. 2013, 61, 1436–1449. [CrossRef]
31. Sesia, S.; Toufik, I.; Baker, M. Network Architecture and Protocols. In LTE-The UMTS Long Term Evolution: From
Theory to Practice; John Wiley & Sons Ltd.: Chichester, UK, 2011; ISBN 978-0-470-66025-6.

94
Electronics 2019, 8, 372

32. Zhu, X.; Dai, L.; Wang, Z. Graph coloring based pilot allocation to mitigate pilot contamination for multi-cell
massive mimo systems. IEEE Commun. Lett. 2015, 19, 1842–1845. [CrossRef]
33. Sarkar, T.K.; Ji, Z.; Kim, K.; Medouri, K.; Salazar-Palma, M. A survey of various propagation models for mobile
communication. IEEE Antennas Propag. Mag. 2003, 45, 51–82. [CrossRef]

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article
distributed under the terms and conditions of the Creative Commons Attribution (CC BY)
license (http://creativecommons.org/licenses/by/4.0/).

95
electronics
Article
Downlink Channel Estimation in Massive
Multiple-Input Multiple-Output with Correlated
Sparsity by Overcomplete Dictionary and
Bayesian Inference
Wei Lu 1, *, Yongliang Wang 1 , Xiaoqiao Wen 1 , Shixin Peng 2 and Liang Zhong 3
1 Air Force Early Warning Academy, Wuhan 430019, China; [email protected] (Y.W.);
[email protected] (X.W.)
2 National Engineering Research Center for E-Learning, Central China Normal University, Wuhan 430079,
China; [email protected]
3 Department of communication system, China University of Geoscience, Wuhan 430074, China;
[email protected]
* Correspondence: [email protected]; Tel.: +86-027-85695800

Received: 24 March 2019; Accepted: 24 April 2019; Published: 28 April 2019

Abstract: We exploited the temporal correlation of channels in the angular domain for the downlink
channel estimation in a massive multiple-input multiple-output (MIMO) system. Based on the slow
time-varying channel supports in the angular domain, we combined the channel support information
of the downlink angular channel in the previous timeslot into the channel estimation in the current
timeslot. A downlink channel estimation method based on variational Bayesian inference (VBI)
and overcomplete dictionary was proposed, in which the support prior information of the previous
timeslot was merged into the VBI for the channel estimation in the current timeslot. Meanwhile
the VBI was discussed for a complex value in our system model, and the structural sparsity was
utilized in the Bayesian inference. The Bayesian Cramér–Rao bound for the channel estimation mean
square error (MSE) was also given out. Compared with other algorithms, the proposed algorithm
with overcomplete dictionary achieved a better performance in terms of channel estimation MSE
in simulations.

Keywords: massive MIMO; channel estimation; Bayesian inference; overcomplete dictionary

1. Introduction
Massive multiple-input multiple-output (MIMO) is the key technology for next generation wireless
communication. The large number of antennas enable high spectrum efficiency and lower power
consumption [1]. To get these benefits, the base station (BS) needs to acquire the channel stated
information (CSI) for uplink and downlink. Pilot-based channel estimation is widely used in wireless
communication systems. In the time division duplex (TDD) system, the channel reciprocity is used to
get the CSI by only estimating the uplink channel at BS. In the frequency division duplex (FDD) system,
the channel reciprocity cannot be used directly. In FDD massive MIMO system it is challenging to get
the downlink CSI with the conventional feedback scheme. In the conventional feedback scheme each
user estimates its channel and then feeds back the estimated CSI to the BS. The pilot and feedback
overheads are high for massive MIMO, since they are scaling linearly with the number of antennas.
Hence, it is important to design an efficient downlink channel estimation and feedback scheme for a
FDD massive MIMO system.
By exploiting the sparsity in massive MIMO channel, compressed sensing (CS) was applied in
the channel estimation and feedback. The users could feed the compressed training measurements

Electronics 2019, 8, 473; doi:10.3390/electronics8050473 97 www.mdpi.com/journal/electronics

Electronics 2019, 8, 473

back to the BS, and an orthogonal matching pursuit (OMP) was used for downlink CSI recovery in [2].
In [3] the modified basis pursuit (MBP) was proposed by utilizing the partial priori signal support
information to improve the recovery performance. In [4] the support information of a signal in the
discrete fourier transform domain was incorporated into the weighted l1 minimization approach for CS
recovery, which could reduce the number of measurements by the size of the known part of support.
In [5] a three-level weighting scheme based on the support information was used for the weighted
l1 minimization and the simulation results showed superiority. In [6] we exploited the reciprocity
between uplink and downlink channels in the angular domain, and diagnosed the supports of the
downlink channel from the estimated uplink channel, and proposed a weighted subspace pursuit
(SP) channel estimation algorithm for FDD massive MIMO. It can be seen that CS was effective in the
channel estimation for massive MIMO.
However, most of these algorithms need the sparsity level in the estimation algorithm, which is
not practical in engineering scenarios. The Bayesian framework can be applied to the compressive
channel estimation. In [7], Bayesian estimation of sparse massive MIMO channel was developed in
which neighboring antennas shared among each other their information about the channel support.
In [8] a variational expectation maximization strategy was used for massive MIMO channel estimation,
and a Gaussian mixture prior model was designed to capture the individual sparsity for each channel
and the joint sparsity among users. In [9] a sparse Bayesian learning algorithm was proposed for FDD
massive MIMO channel estimation with arbitrary 2D-array. By the Bayesian framework in compressive
channel estimation the sparsity level is unnecessary, and it has relatively better recovery performance.
Additionally, there exists angular reciprocity in massive MIMO. For example, the channel covariance
matrices for uplink and downlink are reconstructed by making use of the angle reciprocity between
uplink and downlink channels in [10]. Hence it is promising to apply the angular reciprocity and
Bayesian framework in the compressive massive MIMO channel estimation.
Additionally, there exists angular reciprocity in the FDD massive MIMO. There is also time
correlation of channels. In [11] a differential compressive feedback in FDD massive MIMO was proposed
based on the channel impulses response (CIR) between timeslots, which were slow time-varying and
sparse, and the differential CIR between two CIRs in adjacent timeslots was sparse. Inspired by the
sparsity in the angular domain and time correlation of channels, the correlated angular sparsity can
also be exploited for massive MIMO channel estimation.
In this paper we proposed a downlink channel estimation in a TDD/FDD massive MIMO system.
The timeslots were divided into groups. In each group the estimated channel support information
of the previous timeslot was utilized by the following timeslot. The correlated angular sparsity
between timeslots in the downlink channel was utilized in the Bayesian inference for channel recovery.
We transformed the complex sparse vector to the real sparse vector recovery by Bayesian inference,
and the structural sparsity of the transformed real sparse vector was utilized. Meanwhile, the prior
support information from the estimated channel in the previous timeslot was made use of in modeling
the hidden hyperparameters in the Bayesian model. A Bayesian Cramér–Rao bound analysis is
presented, and simulations are given out to verify the performance of the proposed algorithm.
The main contributions were as follows: (1) a group-based channel estimation scheme was proposed,
in which previous estimated channel support information was used as the priori information in
the following timeslot due to the sparsity correlation; (2) priori information was merged into the
Bayesian inference algorithm for channel recovery; (3) the Bayesian Cramér–Rao bound for the channel
estimation mean square error (MSE) was analyzed.
The system model is illustrated in Section 2, while the proposed channel estimation algorithm
based on Bayesian inference is presented in Section 3. The Bayesian Cramér–Rao bound (BCRB) for the
channel estimation of mean square error (MSE) is given out in Section 4. Simulations and conclusions
are presented in Sections 5 and 6.
In the paper, we used the following notations. Scalars, vectors and matrices were denoted by
lower-case, boldface lower-case and boldface upper-case symbols. The probability density function

98
Electronics 2019, 8, 473

of a given random variable was denoted by p(·). Gamma(x|a, b) was the Gamma probability density
function (PDF) with shape parameters a and b for x, while Normal(x|c, d) was the Gaussian PDF with
parameters mean c and variance d for x. Γ(·) was the Gamma function, and ln(·) was the logarithm
function. Tr(·) stood for the trace operator. a (·) denoted the expectation operation with the PDF of
variable a.

2. System Model
We considered a massive MIMO TDD/FDD system with a single user, and assumed that the BS
was equipped with N antennas and the user terminal (UT) had a single antenna. For the downlink
channel estimation in the massive MIMO system, the BS transmitted the pilots to UT. The UT received
the pilots and fed back the received signal to the BS directly. The received signal yd (t) at the UT in the
t-th timeslot was written as
yd (t) = ρd Ahd (t) + nd (t) (1)

where hd (t) ∈ CN×1 is the downlink channel, A ∈ CTd ×N is the downlink pilots, Td is the pilot length,
ρd is the downlink received power, nd ∈ CTd ×1 is the received noise with each element to be i.i.d
Gaussian with mean 0 and variance σ2 , yd (t) ∈ CTd ×1 is the received signal at UT.
Since in the massive MIMO there existed sparsity, when Dd ∈ CN×M was the channel dictionary
for downlink channel which could be unitary dictionary or overcomplete dictionary (M > N,
their column vector had the form of steering vector with a diﬀerent sampling angle), hda (t) was
the sparse representation with hd (t) = Dd hda (t). In this paper we applied the overcomplete dictionary
to present the sparse angular channel to get a better recovery performance. In the downlink channel
d
estimation, we needed to obtain ĥa (t) the estimated downlink channel in the angular domain in the
t-th timeslot.
By utilizing the sparse channel representation we then had

yd ( t ) = ρd ADd hda (t) + nd (t) (2)

For simplicity, the timeslot mark is omitted in the following equations. Since yd (t), hda (t), and nd (t)
are complex number vectors, we could rewrite Equation (2) into real number vectors as
⎡ ⎤
⎢⎢ Re( ρd ADd ) ⎥⎥
Re(yd ) ⎢⎢ −Im( ρd ADd ) ⎥⎥ Re(hda (t)) Re(nd (t))
= ⎢⎢⎢ ⎥⎥
Im(yd ) ⎣ Im( ρd ADd ) ⎥⎦ Im(hd (t)) + Im(nd (t)) (3)
Re( ρd ADd ) a

where Re(·) and Im(·) denote the real and imaginary parts respectively. For simplicity, we rewrote
Equation (3) as
y = Ah + n (4)
⎡ ⎤
⎢⎢ Re( ρd ADd ) −Im( ρd ADd ) ⎥⎥
Re(yd ) ⎢⎢⎢ ⎥⎥⎥, h = Re(hda (t))
where y = , A = ⎢ ⎥ and n =
Im(yd ) ⎢⎣ ⎥ Im(ha (t))
d
Im( ρd ADd ) Re( ρd ADd ) ⎦

Re(nd (t))
.
Im(n (t))
d

On the other hand, we considered the meaning of sparse angular channel representation hda (t).
If the transmission angles were allocated exactly at the sampling points in the channel dictionary Dd ,
then the corresponding coeﬃcient in the hda (t) was nonzero. If the path number was smaller than
the antenna number, then hda (t) was sparse. However, there was leakage eﬀect induced by dictionary
mismatch which will have deteriorated the sparsity of the angular channel representation [12]. When the
movement velocity of UT was not very high, e.g., v = 12 km/h, and the typical timeslot duration
τ = 0.5 ms, the movement distance of UT in one timeslot was 0.017 m. When the distance of UT and BS

99
Electronics 2019, 8, 473

was 200 m, the angle change for the line of sight (LoS) transmission in one timeslot was 0.0049◦ which
was much smaller than the sampling interval in the dictionary. For the non-LoS (NLoS) transmission,
the angle change was also small which is discussed in Section 4.1. Hence the transmission angle change
between two timeslots is very small if the transmission environment doesn’t change dramatically,
and there is correlation in the angular channel sparsity between adjacent timeslots. In other words,
the information regarding the estimated angular channel in the previous timeslot could be utilized in
the current channel estimation.
It was proven that the prior support information could improve the channel recovery
performance [3–6]. Hence in this paper we made use of the prior support information from the previous
timeslot to improve the Bayesian channel estimation. In the following section we have discussed how
to merge the prior information into the Bayesian inference algorithm for channel estimation.

3. Proposed Algorithm
We designed a three-layer hierarchical graphical model as shown in Figure 1. In the ﬁrst layer,
h was assigned a Gaussian prior distribution

2N
p(h|α) = p(hi αi ) (5)
i=1

where hi and αi are the i-th entry in h and α respectively, p(hi αi ) = Normal(hi 0, αi ) and αi is the
inverse variance of the Gaussian distribution. When hi is close to 0, then αi is very large, and vice versa.
In the second layer, we assumed a Gamma distribution as hyperpriors over the hyperparameters
αi , and it can be presented as
2N
p(α) = Gamma(αi ai , bi ) (6)
i=1

where Gamma(·) is the Gamma PDF, and the parameters ai and bi characterize the shape of Gamma
PDF. For ﬁxed ai , the larger bi is, the smaller αi is; then hi tends to be nonzero. In the sparse Bayesian
learning ai and bi were set to be very small for non-informative hyperprior over αi [13].
In our model, we set ai to be constant with a predeﬁned value, and we modeled bi as random
parameters. In Figure 1 it could be found that the entries of y were divided into two sets by their

indices, i.e., S, S+N and S, S+N c , where S was the set with channel support indices from the previous
timeslot, and S+N was the set with each index in S added by N, since we converted the complex

system model to the real system model as Equation (3). S, S+N c was the complementary set of S, S+N .
For example, in the (t − 1)-th timeslot, the positions of nonzero entries or called supports in ha (t − 1)
d

were S = {4, 5, 6}, then S+N = {4 + N, 5 + N, 6 + N}. The probable supports for hda (t) in the current
t-th timeslot can be assumed to be the same as those for previous (t − 1)-th timeslot for simplicity.
On the other hand, we could have also diagnosed the probable channel supports further by taking the
angle deviation and leakage eﬀects into consideration. In this paper we adopted the support diagnosis
algorithm, and the details can be found in [6].

For y j , j ∈ S, S+N , we employed a Gamma distribution over the hyperparameters bi in the third
layer as
Gamma(bi |c, d) = Γ(c)−1 dc bc−1
i e
−dbi
(7)

where c and d characterize the shape of Gamma PDF. By the system model and assumptions for massive
MIMO, we could use a Bayesian inference to perform the sparse channel recovery.

100
Electronics 2019, 8, 473

Figure 1. Graphical model for the channel estimation with Bayesian inference. The nodes with
double circle, single circle and square correspond to the observed data, hidden variables and
parameters, respectively.

According to the standard Bayesian inference [14], let z h, α, b , we have

ln p(zi ) = Ezi ,i j [ln p(y, z)] + constant

(8)
∝ Ezi ,ij [ln p(y, z)]

where constant is a constant used for p(zi ) normalization, p( y, z) is the joint pdf for h and z, and zi can
be h, α, andb. We have p(y, z) = p(z y)p( y) . We assume p(z y) posterior independence among the
hidden variables z, then p(z y) ≈ p(z) , and p(z) is the product of PDF of h, α, and b.
In order to make use of the prior support information from the previous timeslot and the structure
sparsity in Equation (4), we needed to make some modifications to the standard Bayesian inference.
The main considerations for the modifications were as follows:
(I) Since we rewrote Equation (2) as Equation (4), if hda,i was nonzero, then hi and hi+N were
nonzero simultaneously. Hence it was wise to assume that bi and bi+N were the same;
(II) In the standard Bayesian learning ai and bi were set to be very small for non-informative
hyperprior over αi . This assumption was valid if no prior information was provided. If the prior
support information was available, such as that the support information of the previous timeslot could
be used for channel estimation in the coming timeslot by sparsity correlation, it was wise to assume
that the supports between adjacent timeslots were partially common. If the i-th element in the angular
channel vector was nonzero, then the hyperparameter bi and bi+N tended to be variables rather than to
be fixed small numbers, which meant only for the indices from the prior support set S the third layer
prior model was adopted.
It can be seen that the consideration (II) was similar to [15]. However, our proposed algorithm
was extended for a complex number system and the structure sparsity was considered. However,
on the other hand, the overcomplete dictionary was adopted in our algorithm.
The proposed uplink-aided downlink channel estimation based on Bayesian inference was
as follows:
(i) Update of p(h)
According to Equation (8), by ignoring the terms which are independent of h, we have

101
Electronics 2019, 8, 473

ln p(h) ∝ Eα,b ln p(y h) + ln p(h α) + ln p(b)

∝ Eα,b ln p(y h) + ln p(h α) (9)
−1 T T
= 2σ2
(y − Ah) (y − Ah) − 12 h Λh

where Λ = diag Eα [αi ] , σ2 is the noise variance in the
system model, the vectors b and α are
comprised
by bi and αi respectively. Since p(y h) and p h α are a Gaussian distribution, then p h follows a
Gaussian distribution with the mean μ and covariance φ given by

1 T
μ= ΦA y (10)
σ2

1 T
Φ= A A+Λ (11)
σ2

(ii) Update of p(α)

According to Equation (8), by ignoring the terms which are independent of α, we have

ln p(α) ∝ Eh,b ln p(y h) + ln p(h α) + ln p(α|a, b ) + ln p(b)

∝ Eh,b ln p(h α) + ln p(α|a, b )

2N 2

= Eh,b (ai − 0.5) ln αi − 0.5hi + bi αi
i=1
2

= Eh,b (ai + 0.5 − 1) ln αi − 0.5hi + bi αi +
i∈{S,S+N }
2
(12)
Eh,b (a − 0.5) ln αi − 0.5hi + bi αi
c
i∈{S,S+N }
⎧ ⎛ ⎞ ⎫
⎪
⎪ ⎜⎜ Eh,b (bi +bi+N ) Eh,b (hi +hi+N ) ⎟⎟ ⎪
2 2
⎪
⎨ ⎜ ⎟⎟αi ⎬
= ⎪(ai + 0.5 − 1) ln αi − ⎜⎝
⎪ + ⎠ ⎪ ⎪ +
i∈{S,S+N } ⎩ ⎭
2 4

⎧ ⎛ ⎞ ⎫
⎪ ⎪ )⎟ ⎪
⎪
2 2
⎨ ⎜⎜ E (h +h
⎪ ( + − ) α − ⎜bi + h,b i i+N ⎟⎟⎟αi ⎬
⎜
⎪
c⎩
a i 0.5 1 ln i ⎝ 4 ⎠ ⎪
⎪
⎭
i∈{S,S+N }

where S is the estimated support set from the previous timeslot. Since the complex system model was
converted in Equation (4). By (II), S+N {si + N} was also the support set in the converted system

model in Equation (4). For i ∈ S, S+N , bi is variable number, bi and bi+N were assumed to be the same,
we used 0.5Eh,b (bi + bi+N ) to present Eh,b (bi ). The same assumption was applied to hi and hi+N with
2 2 2
Eh,b (hi ) = 0.5Eh,b (hi + hi+N ). In this way the structural sparsity was utilized.

Since p(α|a, b ) is the Gamma distribution and p h α is the Gaussian distribution, p(α) is the
ai and '
Gamma distribution. Then p(αi ) is also the Gamma distribution with the updated parameters ' bi
given by
'ai = ai + 0.5 (13)
⎧
⎪
⎪ Eh,b (bi +bi+N )
2 2
E ( hi + hi + N )
⎪
⎪ + h,b 4 , i ∈ S, S+N
⎪
⎨
'
bi = ⎪
2
(14)
⎪
⎪
⎪ 2 2
⎩ b + Eh,b (hi +hi+N ) , i ∈ S, S c
⎪
i 4 +N

(iii) Update of p(b{S,S+N } )

102
Electronics 2019, 8, 473

According to Equation (8), by ignoring the terms which are independent of b, we have

ln p(b{S,S+N } ) ∝ Eα,h ln p(y h) + ln p(h α) + ln p(α a, b) + ln p(b c, d)

∝ Eα,h [ln p(α a, b) + ln p(b c, d)] (15)

= −bi Eα (αi ) + (ci − 1) ln bi − di bi
i∈{S,S+N }

where b{S,S+N } is comprised by the entries indicated by S, S+N in b. In (15) the α, a, b, c, d are also

comprised by their indicated S, S+N , the subscript S, S+N is omitted for simplicity. As shown
in Figure 1, b{S,S+N } was modelled as a Gamma distribution. Since p(αi |ai , bi ) and p(bi |ci , di ) were a
~
Gamma distribution, p(b{S,S } ) was Gamma('
+N
bi∈{S,S } ' +M
ci and d'i were given by
ci , d'i ), and the updated '

'
ci = ai + ci (16)

d'i = di + Eα (αi ) (17)

Then the Bayesian inference for the channel estimation was executed iteratively among (i), (ii),
and (iii). The details of the algorithm are summarized in step 3 of Algorithm 1. When the estimated
channel vector h was recovered, we needed to convert it to the complex vector hda according to
Equation (3).

Algorithm 1 Downlink channel estimation with variational inference algorithm and overcomplete dictionary.
Input: A, y, σ2
Output: h
1. Divide the timeslots into groups, and with each group comprised by tg timeslots.
2. For the ﬁrst timeslot in the group, use variational Bayesian inference (VBI) for channel estimation, and
obtain the angular channel supports.
3. For the rest of the timeslots in the group, utilize the support information from the previous timeslot for
channel estimation one by one. The recovery algorithm in each timeslot is as follows:

3.1. Initialize α, a, b, c, d.
T T 2
3.2. μ = σ12 φA y, φ = ( σ12 A A + Λ), Eh,b (hi ) = μ2i + φi,i , where Λ = diag Eα [αi ] , μi is the i-th
entry in μ, and φi,i is the i-th diagonal entry in φ.
3.3. Update ' ai and ' ai and '
bi according to Equations (13) and (14) in (ii) (' bi are the updated ai and bi ,
and ai and bi are the results from last iteration); then according to the property of the Gamma
ai /'
distribution variable, Eα (αi ) = ' bi .
3.4. Update 'c and d'according to Equations (16) and (17) in (iii) ('
c and d'are the updated c and d, and c
and d are the results from last iteration); then according to the property of the Gamma
distribution variable, Eα ('
bi ) = '
c/d.'
3.5. Go to step 3.2 until stop criteria meets.
3.6. Then h = μ.
4. Go back to step 2 for a new group of timeslots.

In a practical massive MIMO system, the transmission environment may change suddenly, in this
way the correlation of sparsity between adjacent timeslots will deteriorate, and the previous channel
support information cannot be utilized. On the other hand, the error will accumulate if the previous
channel support information is utilized timeslot by timeslot. Hence, the initialization is important for
the robustness and eﬃciency of the algorithm. As shown in Figure 2 divided the timeslots into groups,
and each group was comprised of several timeslots. During the channel estimation for each group,

103
Electronics 2019, 8, 473

the VBI was used for the channel estimation in the ﬁrst timeslot, and then the proposed algorithm was
executed for the remaining timeslots in which the channel support information of the previous timeslot
was made use of by the current timeslot. This procedure is detailed in steps 1, 2 and 4 in Algorithm 1.

Figure 2. Channel estimations by group. Each block represents one timeslot, and the block ﬁlled with
grey is the timeslot with variational Bayesian inference (VBI) for the channel estimation, while the
blank blocks are the timeslots with the proposed algorithm for channel estimation.

4. Discussion

4.1. Sparsity Correlation Analysis

The UT movement distance was very small when the velocity of UT was small and the timeslot
was 0.5 ms. The reflector for the transmission was static during the UT moving between timeslots.
The ellipse geometry channel model is shown in Figure 3. The line of sight (LoS) distance between BS
and UT was dLos , the non-LoS (NLoS) distance by reflector between BS and UT was dNLoS , and the UT
movement distance in one timeslot was dΔ . If the transmission path was still reflected by the same
reflector as shown in Figure 3, the maximum and minimum NLoS distances from BS to UT between
timeslots were dNLoS + dΔ and dNLoS − dΔ . The transmission angle change was Δθ . The distance
between the reflector and BS was d1 . By some mathematical manipulations shown in Appendix A,
we got
2dΔ (dNLoS − d1 ) 1
Δθ ≈ √ (18)
2d1 dLoS 1 − cos2 θ

Figure 3. Ellipse geometry channel model for line of sight (LoS) and non-LoS (NLoS) transmission.

In order to illustrate the angle change Δθ during one timeslot, we assumed that dNLoS was 800 m,
the velocity of UT was 14.4 km/h, and the typical timeslot duration τ = 0.5 ms, then the movement
distance of UT in one timeslot was 0.02 m. By changing the distance between BS and reﬂector, as shown
in Figure 4, the angle change was not more than 0.025◦ . It should be noted that when the LoS distance

104
Electronics 2019, 8, 473

and the dNLoS were ﬁxed, BS and reﬂector distance could not be arbitrary vales due to triangle inequality.
Hence, the angle of arrival or departure changed slowly and then there was sparsity correlation among
the angular channels for adjacent timeslots.

/R6GLVWDQFH
/R6GLVWDQFH

/R6GLVWDQFH
DQJHOFKDQJH

%6DQGUHIOHFWRUGLVWDQFH P

Figure 4. Transmission angle change during one timeslot with a different LoS distance and different
distances between the base station (BS) and reflector.

4.2. Bayesian Cramér-Rao Bound Analysis

In this section we have discussed the Bayesian
Cramér–Rao bound (BCRB) for the channel
estimation with the proposed algorithm. Let z h, σ , then the BCRB for the channel vector h is given
by the inverse of the Fisher information matrix J as:
( 2 )
∂ log p(y, z)
J = Ez − (19)
∂zi ∂z j

According to the system model in Section 2, h, σ are independent, the Fisher information matrix J
is block diagonal. We can rewrite p(y, z) as

p(y, z) = p(y z)p(h α)p(α b)p(b) (20)

Then the BCRB on the MSE of the estimated channel vector h is given by
( 2
) * +

E h − h ≥ tr J−1 (21)
hi h j

( )
∂2 log p(y,z)
where Jh h = Ez − is the ﬁsher information sub-matrix. Thus, we can obtain the Bayesian
i j ∂hi ∂h j

Cramér–Rao bound of the minimum mean square error for the estimated channel h as shown in
Proposition 1.
Proposition 1. The BCRB of MSE for the channel estimation h is represented as

105
Electronics 2019, 8, 473

( ) ⎛* * + +−1 ⎞ ,
2 ⎜⎜ 1 1 T ⎟⎟ 1 , 1
E h − h ≥ tr⎜⎜⎝ diag(E ) + 2 A A ⎟⎟⎠ = + (22)
αi σ 1+c
+ λi bi
+ λi
i∈S adi σ iS a σ
T T
where S is the diagnosed support set, λi is the eigenvalues of A A, and A A ∈ R2N×2N , and a, bi , c, and di are
T
the parameters in the Bayesian model in Figure 1. When Td , M → ∞ and Md = β, according to the random
matrix theory, we have
( )
ˆ H ˆ 1
E h−h h−h ≥ |S| · |S| 1+c
1
λ + (N − |S|) · 1
(N−|S|)
1
max(b) λi
i∈S a min(d)
++ σi iS a +σ
(23)
amin(d) F(snr1 ,β) F(snr2 ,β)
→ |S| 1+c 1− 4βsnr1 + (N − |S|) maxa(b) 1 − 4βsnr2

* +2
amin(d)
√ 2 √ 2
where snr1 = (1+c)σ
, snr2 = a
σmax(b)
, F(x, z) = x 1 + z + 1 − x 1 − z + 1 , min(d) and
max(b) are the minimum and maximum entries in d and b.

The proof of proposition 1 is presented in Appendix B. From proposition 1, we can see that the
MSE lower bound is related to the priori support size |S|, (1 + c)/min(d) and max(b) for the massive
MIMO channel estimation.

5. Simulations
In the simulation, the support diagnosis algorithm in [6] was adopted, and we assumed that
the transmission angle change between timeslots was within 1 degree. The pilot length was 50,
and antenna number at the BS was 100. The channel was generated according to the spatial model as
deﬁned in 3GPP TR25.996. We compared our proposed algorithm with a unitary dictionary with a
size of 100 and the overcomplete dictionary with a size of 150, 200, and 250, and compared this with a
Bayesian sparse learning (SL) [16], weighted subspace pursuit (WSP) [6], weighted l1 minimization
(W-l1 min) [5], weighted iteratively reweighted least square(W-IRLS), IRLS [17], compressive sampling
matched pursuit (COSAMP) in [11], and l1 minimization (l1 min) [18].
In order to evaluate the channel estimation performance, we used a normalized mean-square
error (MSE) between true and estimated channel vectors as follows:
-- -
d -2
1 , -ĥ − h -
d
MSE = -- --2 (24)
T -hd -
T

d
where T is the number of trials, ĥ and hd are the estimated and original channel vector, respectively
for each trial. In the simulations the trial number T was 250.
In Figure 5 the overcomplete dictionary size was 150 in the proposed algorithm. It could be seen
that when the unitary dictionary was used, our proposed algorithm outperformed WSP, COSAMP
and IRLS, but was a little worse than W-l1 with a small gap. However, when the overcomplete
dictionary was used, our proposed algorithm outperformed other algorithms, but almost had the
same performance as SL with a little performance improvement which could be seen in the zoomed-in
subﬁgure. The overcomplete dictionary in the proposed algorithm can dramatically improve the MSE
performance due to the fact that there are more atoms in the overcomplete dictionary than in the
unitary dictionary which can improve the sparsity in the angular channel; however, it doesn’t mean
that the larger the overcomplete dictionary size is, the better performance it has, which is shown in
Figure 6.

106
Electronics 2019, 8, 473

Figure 5. Comparisons of channel estimation mean square error (MSE) for diﬀerent algorithms.

Figure 6. Comparisons of channel estimation of MSE for the proposed algorithm with diﬀerent
dictionary sizes.

We compared the performance of the proposed algorithm with diﬀerent dictionary sizes in
Figure 6. It can be seen that in the high SNR region the performance improved when an overcomplete
dictionary was used, but the MSE performance gain did not improve when increasing the dictionary

107
Electronics 2019, 8, 473

size. For example, the algorithm with a dictionary size of 150 had a relatively better performance than
with a dictionary size of 100. However, the performances with a dictionary size of 200 and 250 almost
gave the same trends as that with a dictionary size of 150. This was because the larger dictionary would
induce angel ambiguity because the correlation of atoms increased. Hence, in the practical engineering,
the dictionary size is not recommended to be very large. A large dictionary size is computationally
expensive and the benefit is limited. It also should be noted that in the low SNR region the MSE
performance with a larger dictionary size did not always do better than those with a smaller dictionary
size. For example, when the SNR was 0 dB, they hadsimilar performance. The reason was that in
the low SNR region the estimated channel support of the previous timeslot was not accurate enough,
and on the other hand larger dictionary size would have deteriorated the dictionary incoherence.
We compared the runtime and convergence performance of the proposed algorithm with a
different dictionary size in Figure 7. The relative error was defined as the ratio of the difference of
adjacent iteration results to the previous iteration result. It can be seen that the proposed algorithm with
dictionary size 150 converged fast than with a dictionary size of 100. However, the improvement had
its price, and the runtime for the proposed algorithm with dictionary size 150 was longer which meant
that the computational complexity was higher with a larger dictionary size. Based on the simulation
results shown in Figures 6 and 7, when the antenna at BS is 100, the dictionary size is recommended to
be set at 150 or so to balance the performance improvement and computation complexity.

Figure 7. Comparisons of runtime and convergence performances of the proposed algorithm with
orthogonal dictionary (size is 100) and overcomplete dictionary (size is 150).

6. Conclusions
In this paper we proposed a downlink channel estimation algorithm based on overcomplete
dictionary and variational Bayesian inference. We converted the complex system model to a real model
and exploited the correlation of angular channel sparsity in adjacent timeslots. In the algorithm we
divided the timeslots into groups and made use of the channel support information of the previous
timeslot to the channel estimation in the current timeslot within each group. The sparsity correlation
and Bayesian Cramér–Rao bound for the MSE of channel estimation was analyzed. Compared with
other recovery algorithms, such as WSP, IRLS, WIRLS, l1 min, W-l1 min and COSAMP, our proposed
algorithm with overcomplete dictionary had a relatively better performance. Moderate overcomplete
dictionary can improve the MSE performance of channel estimation to balance the computational
complexity and performance gain.

Author Contributions: Conceptualization and methodology, W.L.; validation, X.W., S.P. and L.Z.; formal analysis,
W.L.; writing—original draft preparation, W.L.; supervision, Y.W.

108
Electronics 2019, 8, 473

Funding: This work was supported in part by the National Science Foundation of China (No.61601509 and
61601334), the China Postdoctoral Science Foundation Grant (No.2016M603045 and 2018M632889) and the
self-determined research funds of CCNU(CCNU18QN007) from the colleges basic research and operation of MOE.
Acknowledgments: Thank you to the reviewers for their valuable comments.
Conﬂicts of Interest: The authors declare no conﬂicts of interest.

Appendix A

Proof of angle change with UT movement.

According to the cosine law, we have

d21 + d2LoS − (dNLoS − d1 )2

cos θ = , (A1)
2d1 dLoS

d21 + d2LoS − (dNLoS ± dΔ − d1 )2

cos(θ ± Δθ ) = . (A2)
2d1 dLoS
Then we can get
⎛ 2 ⎞
⎜⎜ d1 + d2LoS − (dNLoS − d1 )2 − d2Δ ∓ 2dΔ (dNLoS − d1 ) ⎟⎟
θ ± Δθ = arccos⎜⎜⎝⎜ ⎟⎟.
⎟⎠ (A3)
2d1 dLoS

Since d2Δ is very small compared with d1 and dLoS , by the ﬁrst-order approximation we have
* +
d21 +d2LoS −(dNLoS −d1 )2 ∓2dΔ (dNLoS −d1 )
θ ± Δθ ≈ arccos 2d1 dLoS

d21 +d2LoS −(dNLoS −d1 )2 2d (d −d1 ) . (A4)

≈ arc cos( 2d1 dLoS ) ± Δ 2dNLoS √ 1
1 dLoS 1−cos2 θ
2dΔ (dNLoS −d1 )
= θ± 2d1 dLoS
√ 1
1−cos2 θ

Then we have
2dΔ (dNLoS − d1 ) 1
Δθ ≈ √ . (A5)
2d1 dLoS 1 − cos2 θ

Appendix B

Proof of Proposition 1.

Let z h, σ , the we have

Ez (z − ẑ)(z − ẑ)T ≥ J−1 . (A6)

Since h, σ are independent, the Fisher information matrix J is block diagonal, and can be presented as
⎡ ⎤
⎢⎢ J 0 ⎥⎥⎥
J = ⎢⎢⎣ h,h ⎥⎦. (A7)
0 Jσ,σ

Then the inverse of matrix J is ⎡ ⎤

⎢⎢ J −1 0 ⎥⎥
J−1 = ⎢⎢⎣ h,h ⎥⎥.
⎦ (A8)
0 Jσ,σ −1

109
Electronics 2019, 8, 473

Because p(y, z) = p y z p(h α)p(α b)p(b)p(σ) , we have
2
∂ log p(y,z)
J = Ez − ∂z ∂z
2
i j
( ) 2
∂ log p(y|z) ∂2 log p(h α) ∂ log p(α|b)
= Ez − ∂z ∂z + Ez − ∂z ∂z + Ez − ∂z ∂z +
i j i j i j (A9)
2 2
∂ log p(b) ∂ log p(σ)
Ez − ∂z ∂z + Ez − ∂z ∂z
i j i j

Since we mainly focus on the MSE of h, we only need to analyze Jh,h . We discuss the above
formula part by part as follows:
2
∂ log p(y|z)
1) Let Jh,h (y) = Ez − ∂z ∂z , according to the Bayesian model in Figure 1, we have
i j

T T
A A A A
p y z ∼ Normal(y|Ah, σI)thenJh,h (y) = Ez { }= .
σ σ
( )
∂2 log p(h α) .
2N
2) Let Jh,h (h) = Ez − ∂z ∂z , and we have P h α = Normal(hi 0, αi ) , then we get Jh,h (h) =
i j
i=1
Ez α1i .
2 2 2
∂ log p(α|b) ∂ log p(b) ∂ log p(σ)
3) Because Ez − ∂z ∂z , Ez − ∂z ∂z and Ez − ∂z ∂z are independent with h, they are
i j i j i j
T
all 0. Then in summary, we get Jh,h = diag(E α1 ) + σ12 A A.
i

Since the priori support set information is used in our proposed algorithm, a three-layer model is
constructed for the elements belonging to the priori support
set, and a two-layer model is used for the
elements not belonging to the priori support set, so EZ α1 has diﬀerent expressions for the two cases.
i
EZ α1 in the two cases are discussed as follows:
i

1) When i belongs to the priori support set, according to the three-layer graph model we have

2N
p(α) = Gamma(αi a, bi ) , (A10)
i=1

p(bi ) = Gamma(bi c, di ) = Γ(c)−1 dci bc−1

i e
−di bi
. (A11)

Then we get /∞
p(αi ) = 0
p(αi bi )p(bi )dbi
/∞
= Γ(a)−1 bai αa−1 e−bi αi Γ(c)−1 dci bc−1 e−di bi dbi
0 i i . (A12)
Γ (a+c)
= Γ(a)−1 Γ(c)−1 αa−1 dci
i (ai +di )a+c

Accordingly, we have
/∞ −1 −1 a−1 c Γ(a+c)
E 1
αi = 0 αi Γ (a) Γ (c) αi di (ai +di )a+c dαi
1

/∞ −a−c , (A13)
1 Γ(a+c) αi a−1 αi α
= 0 αi Γ(a)Γ(c) d1 d1 +1 ddi
1

Γ (a+c)
α a−1 α −a−c
where Γ(a)Γ(c) d i di + 1
i
satisﬁes the probability density function of Beta prime
i
distribution. According to the properties of the Beta prime distribution, when −a < −1 < c,
we have

110
Electronics 2019, 8, 473

( )
ˆ H ˆ 1
E h−h h−h ≥ |S| · |S| 1+c
1
λ + (N − |S|) · 1
(N−|S|)
1
max(b) λi
i∈S a min(d)
++ σi iS a +σ
. (A14)
amin(d) F(snr1 ,β) F(snr2 ,β)
→ |S| 1+c 1− 4βsnr1 + (N − |S|) maxa(b) 1 − 4βsnr2

2) When i does not belong to the priori support set, according to the high-order moment properties
for the general gamma distribution, we have
( )
1 bi
E = . (A15)
αi a

Then in summary, we have

( ) ⎛* * + +−1 ⎞ ,
2 ⎜⎜ 1 1 T ⎟⎟ 1 , 1
E h − h ≥ tr⎜⎜⎝ diag(E ) + 2 A A ⎟⎟⎠ = + , (A16)
αi σ 1+c
+ λi bi
+ λi
i∈S adi σ iS a σ

T T
where S is the diagnosed support set, λi is the eigenvalues of A A,and A A ∈ R2M×2M .

2π
When overcomplete dictionary is as Dd = √1 e−j M kn , k ∈ {1, · · · , M}, n ∈ {1, · · · , N}, and A is
N n,k
1
Gaussian random matrix with each element is mean 0 and variance Td , then ADd is complex Gaussian
ρd
random matrix. Then A is Gaussian random matrix with mean 0 and variance 2T .
d
According to the random matrix theory, for N × K dimensional random matrix H with each element
K
is independent and is variable with mean 0 and variance 1/N, when K, N → ∞ and N → β ,then the
+
empirical distribution of eigenvalues of H H converges almost surely as fβ (x) = 1 − β1 δ(x) +
T
√
(x−a)+ (b−x)+ 0 2 0 2
2πβx , where (x)+ = max(0, x), a = 1 − β , b = 1 + β .
ρd
Since A ∈ R2Td ×2M , and its element is Gaussian random variable with mean 0 and variance 2Td By
applying the above results for the empirical distribution of eigenvalues of HT H, when Td , M → ∞ and
Td T
M = β, the empirical distribution of eigenvalues λ of A A converges almost surely as

* ++ (λ − a)+ (b − λ)+
1
fβ ( λ ) = 1 − δ(λ) + (A17)
β
2πβλ ρd
0 2 0 2
where a = ρd 1 − β , b = ρd 1 + β . When s , M → ∞ and s
M = μ, we have
( )
ˆ H ˆ 1
E h−h h−h ≥ |S| · |S| 1+c
1
λ + (N − |S|) · 1
(N−|S|)
1
max(b) λi
i∈S a min(d)
++ σi iS a +σ
, (A18)
amin(d) F(snr1 ,β) F(snr2 ,β)
→ |S| 1+c 1− 4βsnr1 + (N − |S|) maxa(b) 1 − 4βsnr2

* +2
amin(d)
√ 2 √ 2
where snr1 = (1+c)σ
, snr2 = a
σmax(b)
and F(x, z) = x 1+ z +1− x 1− z +1 .
Then the proofs are complete.

111
Electronics 2019, 8, 473

References
1. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overview of Massive MIMO: Benefits and
challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
2. Rao, X.; Lau, V.K.N. Distributed compressive CSIT estimation and feedback for FDD multi-user massive
MIMO systems. IEEE Trans. Signal Process. 2014, 12, 3261–3271.
3. Vaswani, N.; Lu, W. Modified-CS: Modifying compressive sensing for problems with partially known
support. IEEE Trans. Signal Process. 2010, 9, 4595–4607. [CrossRef]
4. Borries, R.V.; Miosso, C.; Potes, C. Compressed sensing using priori information. In Proceedings of the 2nd
IEEE International Workshop on Computational Advances in Multi-Sensor Adaptive, St. Thomas, VI, USA,
12–14 December 2007; pp. 121–124.
5. Tseng, C.C.; Wu, J.Y.; Lee, T.S. Enhanced compressive downlink CSI Recovery for FDD Massive MIMO
systems using weighted Block l1 minimization. IEEE Trans. Commun. 2016, 3, 1055–1066. [CrossRef]
6. Lu, W.; Wang, Y.; Fang, Q.; Peng, S. Downlink compressive channel estimation with support diagnosis in
FDD massive MIMO. J. Wirel. Commun. Netw. 2018, 115, 1–12. [CrossRef]
7. Masood, M.; Afify, L.H.; Al-Naffouri, T.Y. Efficient coordinated recovery of sparse channels in massive MIMO.
IEEE Trans. Signal Process. 2015, 1, 104–118. [CrossRef]
8. Cheng, X.; Sun, J.; Li, S. Channel estimation for FDD multi-user massive MIMO: A variational Bayesian
inference-based approach. IEEE Trans. Wirel. Commun. 2017, 11, 7590–7602. [CrossRef]
9. Dai, J.; Liu, A.; Lau, V.K.N. FDD massive MIMO channel estimation with arbitrary 2D-Array Geometry.
IEEE Trans. Signal Process. 2018, 10, 2584–2599. [CrossRef]
10. Xie, H.; Gao, F.; Jin, S.; Fang, J.; Liang, Y. Channel Estimation for TDD/FDD Massive MIMO Systems with
Channel Covariance computing. IEEE Trans. Wirel. Commun. 2018, 6, 4206–4218. [CrossRef]
11. Shen, W.; Dai, L.; Shi, Y.; Shim, B.; Wang, Z. Joint Channel Training and Feedback for FDD Massive MIMO
Systems. IEEE Trans. Veh. Technol. 2016, 10, 8762–8767. [CrossRef]
12. Tauböck, G.; Hlawatsch, F.; Eiwen, D.; Rauhut, H. Compressive Estimation of Doubly Selective Channels in
Multicarrier Systems: Leakage Effects and Sparsity-Enhancing Processing. IEEE J. Sel. Top. Signal Process.
2010, 2, 255–271. [CrossRef]
13. Tipping, M.E. Sparse Bayesian learning and the relevance vector machine. J. Mach. Learn. Res. 2001, 1,
211–244.
14. Christopher, M. Bishop. Bayesian Linear Regression. In Pattern Recognition and Machine Learning;
Springer-Verlag: Heidelberg, Germany, 2006; pp. 152–160.
15. Fang, J.; Shen, Y.; Li, F.; Li, H.; Chen, Z. Support knowledge-aided sparse Bayesian learning for compressed
sensing. In Proceedings of the 2015 IEEE International Conference on Acoustics, Speech and Signal Processing,
Brisbane, Australia, 19–24 April 2015; pp. 3786–3790.
16. Ji, S.; Xue, Y.; Carin, L. Bayesian compressive sensing. IEEE Trans. Signal Process. 2008, 6, 2346–2356. [CrossRef]
17. Lu, W.; Wang, Y.; Fang, Q.; Peng, S. Compressive Channel Estimation Based on Weighted IRLS in FDD
Massive MIMO. Wirel. Pers. Commun. 2018, 2, 1–10. [CrossRef]
18. Candes, E.J.; Romberg, J.K.; Tao, T. Stable signal recovery from incomplete and inaccurate measurements.
Commun. Pure Appl. Math. 2006, 8, 1207–1223. [CrossRef]

112
electronics
Article
Multiple-Symbol Non-Coherent Detection for
Differential QAM Modulation in Uplink Massive
MIMO Systems
Hieu Trong Dao and Sunghwan Kim *
School of Electrical Engineering, University of Ulsan, 93 Daehak-ro, Nam-gu, Ulsan 44610, Korea;
[email protected]
* Correspondence: [email protected]; Tel.: +82-52-259-1401

Received: 21 May 2019; Accepted: 17 June 2019; Published: 20 June 2019

Abstract: In this paper, we propose a novel multiple-symbol detector based on maximum likelihood
metric for differential quadrature amplitude modulation in massive multiple-input multiple-output
(MIMO) systems. While current research on differential modulation in massive MIMO has focused
on two consecutive symbols, our proposed detector is based on multiple-symbol, which is larger
than or equal to two. Moreover, we derive new distance based on the proposed detector. To encode
and decode data, we apply existing look-up table algorithm using the proposed distance, which is
known as optimum encoding algorithm for differential modulation. Simulation results show the
improvement based on the bit-error-rate performance since the proposed detector and distance vary
according to the channel statistic information.

Keywords: 5G wireless networks; massive MIMO; non-coherent detection; QAM

1. Introduction
Massive multiple-input multiple-output (MIMO) transmission technique has gained a lot of
attention in recent decades [1–13], since it can achieve significant improvement in terms of the energy
and spectral efficiency while using simple signal processing [1–4]. Massive MIMO systems prefer
operating in the time division duplex (TDD) mode in which users must synchronously send mutually
orthogonal pilot signals to the corresponding base station (BS) so that the BS can estimate the channels.
This method uses the estimated channels to perform signal processing [5–13], so that pilot signals
account for a significant part of the total coherent interval, which decreases the spectral efficiency.
In addition, when the number of users is large, the orthogonal pilot set has to be reused in every cell,
which leads to pilot contamination problems; this is considered as a performance bottleneck in massive
MIMO systems.
The authors of [8] investigate the power allocation to improve the spectral efficiency, this require
a large information exchange in backhaul of system between BSs or between BSs and users. Besides,
the algorithm to optimize the power is quite complex. Some semi-blind and blind channel estimation
methods in uplink massive MIMO have been proposed. In [9], the authors proposed an eigenvalue
decomposition-based method to blindly estimate the uplink channel from the data signal. However,
they assumed that the number of antennas was very large such that the channel vectors become
mutually orthogonal. The authors of [10] derived a new channel estimator based on subspace projection.
However, this channel estimation algorithm relies heavily on the eigenvalues of the channel matrix.
Interestingly, the authors of [11] proposed an energy detection scheme in which data symbols could
be detected without relying on estimated channels. The scheme used in [11] requires designing
unique modulated signal constellation for each user in the system. Another promising technique
that does not require estimated channels is differential modulation, but it has not received much

Electronics 2019, 8, 693; doi:10.3390/electronics8060693 113 www.mdpi.com/journal/electronics

Electronics 2019, 8, 693

attention in the massive MIMO research field until now. In [12], differential quadrature amplitude
modulation (DQAM) was proposed for massive MIMO systems. The modulation scheme used in [12]
was based on the asymptotic behavior of the channel when the number of BS antennas goes to
infinity; however, the authors of [12] did not show the detector when the number of BS antennas is
finite. The authors of [13] generalized the QAM detector in [12] and proposed a new detector and
non-coherent distance with better performance when the number of BS antennas is not very large.
The differential encoding part of [13] was done via the look-up table algorithm used in [14,15]; this is
known as the optimum encoding algorithm for differential modulation. Besides, both [12,13] can only
detect two consecutive symbols at a time. Recently, the authors of [16] developed a new differential
detector based on multiple-symbol differential detection (MSDD) and the generalized likelihood ratio
test (GLRT) criterion. However, the authors of [16] only consider the case of M-ary Phase Shift Keying
modulation (M-ary PSK).
In this paper, we propose a novel multiple-symbol detector for DQAM based on the maximum
likelihood metric, which can detect more than two symbols at a time and varies following channel
condition to adapt better with the change in environment. In addition, we propose a novel distance
which can be used to encode and decode data by using the look-up table algorithm in [14,15] for DQAM
encoding. Since the proposed scheme varies following the change in channel statistic information
while the schemes in [12,13] are unchanged, they adapt better to the change of environment and show
significantly better performance when compared to previous works [12,13].

2. System Model and Previous Works

Similar to [12,13], we also consider an uplink massive MIMO system consisting of a single-antenna
user and a base station equipped with a large number of antennas M ( M 1). We consider the
TDD mode and the block fading model which are popular used in research on massive MIMO
system [1,3,5,7,8], in which the channel is unchanged in one coherent interval T. The received signal
vector at the mth BS antenna is modeled as [1]
√
ym = ρhm x + nm , (1)

where ρ presents the average signal-to-noise ratio (SNR); x = [ xt , xt+1 , ..., xt+ L−1 ] T is the transmit signal
vector with length L( L ≤ T ) and E[||x||2 ] = L, where elements of x are taken from the conventional
QAM constellation as shown in Figure 1a, nm is the additive white Gaussian noise vector at the mth BS
antenna whose entries follow CN (0, 1); and hm is the channel coefﬁcient with CN (μh , σh2 ). An example
of the block fading model and signal vector is illustrated as in Figure 1b.

(a) (b)

Figure 1. Conventional 16-QAM constellation and illustration of block fading. (a) 16-QAM constellation;
(b) an illustration of block fading length T = 7 and signal vector length L = 3.

For simplicity, we normalize the channel so that μ2h + σh2 = 1. Since Rayleigh and Rician fading
models are very popular in evaluating system performance on both massive and regular MIMO

114
Electronics 2019, 8, 693

system [17,18], we focus on these two models with μh = Kr

1 + Kr and σh = 1
1 + Kr [12]. Thus,
the channel vector from a user to a BS now can be modeled as

h = μh hLOS + σh hNLOS , (2)

where Kr represents the Rician factor; In a special case, when Kr = 0, the channel becomes a Rayleigh
fading channel. Additionally, hLOS = [1, exp− jπ sin(θ ) , ..., exp− jπ ( M−1) sin(θ ) ] T ∈ C M×1 with the arrival
angle θ; is the light-of-sight (LOS) component when the antenna spacing is a half of wavelength.
hNLOS ∈ C M×1 denotes the non-light-of-sight (NLOS) component whose elements follow i.i.d
Gaussian variables with zero mean and unit variance.
In [12,13], the authors considered two consecutive tth and (t − 1)th instants, with channel vectors
ht ; and ht−1 , and assumed ht ≈ ht−1 . The received signal vector at the tth instant is given as
√
yt = ρht xt + nt . (3)

where xt is taken from a 16-DQAM constellation based on [19]. With a very large number of BS
antennas M, they have
1 H 1 H
lim ht ht−1 = 1, lim nt ht−1 = 0,
M→∞ M M→∞ M
(4)
1 H 1 H
lim ht nt−1 = 0, lim nt nt−1 = 0,
M→∞ M M→∞ M

Eventually, the signal symbol at the tth instant can be detected as

1 T ∗
rt = y y = ρxt xt∗−1 for very large M, (5)
M t t −1
in which xt xt∗−1 can be mapped back to the information symbol by the encoding rule of [19]. However,
the authors of [12] did not propose a detector for when the number of BS antennas M is ﬁnite.
The authors of [13] generalized the detector in [12] as ([13], Equation (6)), which can be applied for any
value of M. After that, they proposed a new two consecutive-symbol detector based on the conditional
probability and a new non-coherent distance as ([13], Equations (9) and (10)). The new distance in [13]
is used for look-up table algorithm in [14,15] for differential encoding.
Particularly, the authors of [14] had already proved that any differential encoding techniques can
be transform equivalently to a differential encoding via a look-up table. Using the algorithm in [14,15]
to create look-up table for encoding and decoding 16-DQAM signal, a brief explanation of a look-up
table is as follows. The readers should refer to [14,15] for the details of the algorithm.

• Sort all possible codeword pair c x and cy in descending order based on their proposed distance.
• Arrange all groups: Take codeword pair one by one from the sorted list L. Put two codewords
of a pair into the same group if possible; Otherwise, put them into two different groups and the
two groups are written into L, which is called the list of near group pairs (near groups means
the distance between members in two groups are small, which means there is a big chance that
these two groups may be wrongly estimated as each other). Two codewords cannot be in the same
group if their ﬁrst symbols are identical.
• Assign modulated symbol to each group G1 , G2 . . . , G16 based on list L: Take group pairs one by
one in order from list L. The number of different bits assigned for the considered two groups
taken from L should be as small as possible (since two near groups have big change to be wrongly
estimated as each other, this step’s goal is to minimize the bit error).

Some properties of the look-up table are listed up below.

115
Electronics 2019, 8, 693

• The number of rows in the table are numbers of groups of codewords.

• Codewords belong to the same group will encode for the same information.
• Number of codewords in each group are the same.
• The important metric to generalize the look-up table is the non-coherent distance between any
codewords, which is one of the new propose in this paper.
• General rules of generating the look-up table is that two codewords with small non-coherent distance
should be placed in the same group. Otherwise, they should be putted into different groups.

Moreover, this look-up table can be optimized by using algorithms proposed in [14,15]. In this
paper, we propose new non-coherent detector, new non-coherent distance and apply the algorithm
in [14,15] to generate the look-up table. We compare the performance of the proposed detector and
distance to the existing detectors and distances proposed in [12,13]. Due to the limitation of length,
we would like to skip the detail of the look-up table algorithm and refer interested readers to [14,15].

3. New Differential Detector and Non-Coherent Distance

Consider the received signal at mth BS antenna as in Equation (1), the conditional probability of
the received signal vector ym , given transmitted signal vector x, is calculated as

1 1
p(ym |x) = exp − (ym − ȳm ) H Ry−1 (ym − ȳm ) (6)
2π det( Ry ) 2

where ȳm is the mean of ym and is given as

√
ȳm = E{ym } = ρμh x, (7)

where det( Ry ) is the determinant of Ry , and Ry is the covariance matrix of ym , which can be
calculated as
Ry = E (ym − ȳm )(ym − ȳm ) H

√
= E ρ(hm − μh )xx H (hm − μh ) H + ρ(hm − μh )xnm
H
(8)

√
+ ρ( hm − μh ) H nm x H + nm nm
H
= (ρσh2 xx H + IN ).

Since the proposed detector aims to maximize the summation of the conditional probability
of received signal vector ym at all BS antennas 1 ≤ m ≤ M, given transmitted signal vector x,
the estimated signal vector x̂ can be calculated as

M
x̂ = arg max
x∈χ
∑ p ( y m | x ), (9)
m =1

where χ presents the vector space of all possible transmitted signal vectors x. Since the natural
logarithm function is monotonically increasing, maximizing p(ym |x) is equivalent to maximize
ln p(ym |x). Finally, the proposed detector is given as
M
x̂ = arg max ∑ −(ym − ȳm ) H Ry−1 (ym − ȳm ) − ln(det( Ry )) , (10)
x∈χ
m =1

Specially, when the channel is Rayleigh fading, we have ȳm = 0, and the proposed
detector becomes M
x̂ = arg max ∑ −ymH R−y 1 ym − ln(det( Ry )) ,
x∈χ,xt ∈ Q1
(11)
m =1

116
Electronics 2019, 8, 693

where Q1 is the first quadrant. This means the number of decision values that need to be calculated is
L
reduced by a factor of four, from N L to N4 , with N-QAM. The reduction occurs since when ȳm = 0,
for any codeword x = [ xt , xt+1 , ..., xt+ L−1 ] T with the first symbol xt belongs to the first quadrant of
the constellation, there are also three other codewords which have the first symbol xt belong to three
other quadrants, that have the same estimated vector x̂ as x. In other words, we only need to calculate
the decision values of codewords which have the first symbol xt belongs to the first quadrant of the
constellation in case of Rayleigh fading. To calculate the proposed non-coherent distance, we propose
the distance from x1 to x2 and the distance from x2 to x1 as in Equations (12) and (13) based on the
proposed detector, Equation (10), as below:

d(x1 → x2 ) = ||[−ρ(ch x1 ) H R− H −1
x1 ( c h x1 ) − ln(det( R x1 ))] − [− ρ ( c h x1 ) R x2 ( c h x1 ) − ln(det( R x2 ))]||;
1
(12)

d(x2 → x1 ) = ||[−ρ(ch x2 ) H R− H −1
x2 ( c h x2 ) − ln(det( R x2 ))] − [− ρ ( c h x2 ) R x1 ( c h x2 ) − ln(det( R x1 ))]||.
1
(13)

R x1 , R x2 andch in Equations (12) and (13) are calculated as

R x1 = ρσh2 x1 x1H + IL ; R x2 = ρσh2 x2 x2H + IL ; ch = σh + μh ; (14)

The Equation (12) is the non-coherent distance from x1 to x2 , which is based on the assumption
that we did send the codeword x1 but the detector wrongly estimated that x2 was sent. Inversely,
the Equation (13) is the non-coherent distance from x2 to x1 in which, the x2 was actually sent but
the detector wrongly estimated that x1 was sent. In other words, Equations (12) and (13) can be
used by likelihood estimator as a distance between two codewords x1 and x2 . The larger the values
of Equations (12) and (13) are, the less chance the detector wrongly estimates between x1 and x2 .
Eventually, the proposed non-coherent distance is calculated as

d(x1 , x2 ) = min(d(x1 → x2 ), d(x2 → x1 )). (15)

In the differential encoding part, we apply the look-up table algorithm for DQAM as in [14,15] by
using the proposed non-coherent distance in Equation (15).
The main contribution of these above steps and equations are summed up as follow.

• Equation (10) is our proposed detector which is derived based on the conditional probability
of received signal and contains channel statistical information, so that it adapts better with the
change in environment.
• Equation (11) is a simpliﬁed detector when the channel is Rayleigh fading.
• Equations (12) and (13) are derived based on Equation (10), and they are used to evaluate the
distance between two signal vectors (a small distance means that it is easy to wrongly decode
between two signal vectors).
• d(x1 → x2 ) is distance from x1 to x2 which is used to evaluate how likely it is to wrongly estimate
x1 as x2 . Similar explanation is applied to d(x2 → x1 ).
• Eventually, we use Equation (15) to take the minimum value between d(x1 → x2 ) and d(x2 → x1 )
as the distance between x1 and x2 .

4. Numerical Results
In simulations, we use the conventional 16-QAM constellation and apply the look-up table
algorithm in [14,15] to differentially encode the information. Particularly, one 4-bit information symbol
is encoded into two consecutive 16-QAM points; thus, the non-coherent distance as in Equation (15)
calculated with the length of transmitted signal vectors x1 , x2 is 2. Finally, the look-up table for
16-DQAM has 16 rows presenting 16 different groups; each group contains 16 different vectors x,
and all transmitted signal vectors in the same group correspond to the same information symbol.
The look-up table for the proposed 16-DQAM scheme with Rician fading channel Kr = 1 and an

117
Electronics 2019, 8, 693

average SNR = −4 dB is given in Table 1 as an example. Notice that, in [14,15] after generating the
look-up table, there is one more step that maps information bit to each group. This step adds a little
more improvement in bit error rate performance since groups with small non-coherent distance are
mapped to information bit symbols with a small difference in the number of bits. However, in this
simulation, we focus on comparing the performance of detector and distance between our proposed
ones and previous ones in [12,13] so that we skip this step and add the information bit symbols
sequentially from ﬁrst group to last group in the look-up table.

Table 1. Look-up table for 16-DQAM using proposed distance, L = 2.

G1 (0; 0) (1; 1) (2; 2) (3; 0)

G2 (0; 1) (1; 6) (2; 0) (3; 1)
G3 (0; 2) (1; 0) (2; 13) (3; 2)
G4 (0; 3) (1; 3) (2; 3) (3; 3)
G5 (0; 4) (1; 5) (2; 6) (3; 4)
G6 (0; 5) (1; 10) (2; 4) (3; 5)
G7 (0; 6) (1; 4) (2; 1) (3; 6)
G8 (0; 7) (1; 7) (2; 7) (3; 7)
G9 (0; 8) (1; 9) (2; 10) (3; 8)
G10 (0; 9) (1; 14) (2; 8) (3; 9)
G11 (0; 10) (1; 8) (2; 5) (3; 10)
G12 (0; 11) (1; 11) (2; 11) (3; 11)
G13 (0; 12) (1; 13) (2; 14) (3; 12)
G14 (0; 13) (1; 2) (2; 12) (3; 13)
G15 (0; 14) (1; 12) (2; 9) (3; 14)
G16 (0; 15) (1; 15) (2; 15) (3; 15)

Figure 2 illustrates the simulation results of the proposed 16-DQAM scheme with different lengths
of estimated signal vector L = 2 and 3, as well as the 16-DQAM schemes in [12,13] where the number
of BS antennas are M = 128 and 500, and the coherent length T = 7. Since the authors of [12]
did not show the detector when M is ﬁnite, we suppose that the 16-DQAM scheme in [12] uses the
generalized detector as in [13], Equation (6), and the corresponding non-coherent distance in [13].
As previously shown in [12,13], the 16-DQAM scheme of [12] shows an error ﬂoor when M is not very
large. The proposed scheme significantly outperforms the schemes in [12,13] for both M = 128 and 500.

Figure 2. Performance comparison between the proposed 16-DQAM scheme and the previous works
of References [12,13] under Rayleigh fading.

118
Electronics 2019, 8, 693

Since the schemes in [12,13] can only detect two consecutive symbols at a time, we simulate our
proposed scheme when the length of signal vector is L = 2. We can clearly see that, with the same
channel condition and signal vector’s length, the proposed scheme outperforms the other schemes
for M = 128 with nearly 3 dB when BER = 10−4 . With not so large number of BS antenna M = 128,
the significant improvement of the proposed scheme shows a huge potential that it can be deployed in
real system. When M increases to 500, the BER performance increases much more further with nearly
5 dB at BER = 10−4 , which also shows the advantages of massive MIMO with very large number of
BS antennas.
Noticeably, the performance of the proposed scheme is improved significantly when the length
of the estimated signal vector L increases, regardless of the value of M. The gain is nearly 1.5 dB at
BER = 10−5 when the length L increases from 2 to 3. When M = 500, the scheme of [13] only performs
better than [12] at low BER (≤ 10−4 ) while the performance of the proposed scheme is remarkably
better than both [12,13]; the gain is approximately 3 dB at BER = 10−5 in comparison with [13].
Figure 3 shows the simulation results for the aforementioned schemes under Rician fading with
T = 7, L = 2, M = 128 and different values of the Rician factor Kr = 0, 1, and 10. Notice that when
Kr = 0, the Rician channel becomes a Rayleigh channel. The error floor still happens when Kr = 1
in the case of [12]. However, when the LOS component of the Rician channel becomes stronger with
Kr = 10, the error floor seems to disappear and the performance of [12] is improved much more
than [13]; the performance gap is nearly 1 dB at BER = 10−5 . The performances of scheme of [13] are
nearly the same with different values of Kr , this is because the detector in [13] cancels out the channel
coefficient between two consecutively received symbols. The performance of the proposed scheme
is the best among three schemes. Even with Kr = 0 (i.e., Rayleigh fading), the performance of the
proposed scheme remains better than the other schemes with Kr = 1 or 10. When Kr is increased from
0 to 1 and 10, the performance of the proposed scheme is improved significantly with gains 5.5 dB
and 9.5 dB, respectively. In summary, we conclude that the proposed scheme shows much better
performance in Rician channels than in Rayleigh channels.

Figure 3. Performance comparison between the proposed 16-DQAM scheme and the previous works
of References [12,13] under Rician fading with different Rician factors.

5. Conclusions
In this paper, we propose a new detector and non-coherent distance for differential QAM
modulation in massive MIMO systems. We also apply the well-known look-up table algorithm

119
Electronics 2019, 8, 693

for DQAM encoding using the proposed non-coherent distance. The proposed detector can detect
multiple symbols (≥ 2) at a time. The proposed scheme varies following the change in channel
information statistics, allowing them to adapt better to the change in environment. Additionally, they
can be applied in a wide class of channels with a not too large number of base station antennas. This
paper focuses on massive MIMO system with single cell. Therefore, as a future work, it will be very
attractive to investigate the performance and how to improve the proposed scheme in the multiple-cell
environment where there is the presence of interference between users in nearby cells.

Author Contributions: All authors discussed the contents of the manuscript and contributed to its presentation.
H.T.D. designed and implemented the proposed scheme, analyzed the simulation results and wrote the paper
under the supervision of S.K.
Funding: This research was funded by the Research Program through the National Research Foundation of Korea
(NRF-2016R1D1A1B03934653, NRF-2019R1A2C1005920).
Conﬂicts of Interest: The authors declare no conﬂict of interest.

Abbreviations
The following abbreviations are used in this manuscript:

MIMO Multiple-input multiple-output

BS Base station
TDD Time Division Duplex
DQAM Differential Quadrature Amplitude Modulation
SNR Signal-to-noise
LOS Light-of-Sight
NLOS non Light-of-Sight
AWGN Additive white Gaussian noise

References
1. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
2. Kim, K.J.; Choi, K.J.; Lee, S.R.; Kim, K.S. Multi-user massive MIMO for next-generation WLAN systems.
IET Electron. Lett. 2015, 51, 792–794. [CrossRef]
3. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overivew of massive MIMO: Beneﬁts and
challenges. IEEE J. Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
4. Hassan, N.; Fernando, X. Massive MIMO Wireless Networks: An Overview. Electronics 2017, 6, 63. [CrossRef]
5. Trong Dao, H.; Kim, S. Worst Cell Based Pilot Allocation in Massive MIMO Systems. Electronics 2018, 7, 197.
[CrossRef]
6. Nyarko, J.K.N.; Mbom, C.A. A Performance Study of Massive MIMO Heterogeneous Networks with
Ricean/Rayleigh Fading. Electronics 2018, 7, 79. [CrossRef]
7. Zhu, X.; Dai, L.; Wang, Z.; Wang, X. Weighted-graph-coloring-based pilot decontamination for multicell
massive MIMO systems. IEEE Trans. Veh. Technol. 2017, 66, 2829–2834. [CrossRef]
8. Dao, H.T.; Kim, S. Pilot power allocation for maximising the sum rate in massive MIMO systems.
IET Commun. 2018, 12, 1367–1372. [CrossRef]
9. Ngo, H.Q.; Larsson, E.G. EVD-based channel estimation in multicell multiuser MIMO systems with very
large antenna arrays. In Proceedings of the 2012 IEEE International Conference on Acoustics, Speech and
Signal Processing (ICASSP 2012) , Kyoto, Japan, 25–30 March 2012; pp. 3249–325.
10. Muller, R.R.; Cottatellucci, L.; Vehkapera, M. Blind pilot decontamination. IEEE J. Sel. Top. Signal Process
2014, 8, 773–786. [CrossRef]
11. Manolakos, A.; Chowdhury, M.; Goldsmith, A. Energy-Based Modulation for Noncoherent Massive SIMO
Systems. IEEE Trans. Wirel. Commun. 2016, 15, 773–786. [CrossRef]
12. Kong, D.; Xia, X.; Jiang, T. A Differential QAM Detection in Uplink Massive MIMO Systems. IEEE Trans.
Wirel. Commun. 2016, 15, 6371–6383. [CrossRef]

120
Electronics 2019, 8, 693

13. Wei, R.; Wang, X. Differential 16-QAM and 16-APSK for Uplink Massive MIMO Systems. IEEE Wirel.
Commun. Lett. 2017, 7, 170–173. [CrossRef]
14. Wei, R.-Y. Differential encoding by a look-up table for quadrature amplitude modulation. IEEE Trans. Commun.
2011, 59, 84–94. [CrossRef]
15. Wei, R.-Y.; Chen, L.-T. Further results on differential encoding by a table. IEEE Trans. Commun. 2012, 60,
2580–2590. [CrossRef]
16. Wang, Y.; Tian, Z. Multiple Symbol Differential Detection for Noncoherent Communications With Large-Scale
Antenna Arrays. IEEE Wirel. Commun. Lett. 2018, 7, 190–193. [CrossRef]
17. Angelis, C.T.; Chronopoulos, S.K. System Performance of an LTE MIMO Downlink in Various Fading
Environments. In International Conference on Ambient Media and Systems; Springer: Berlin, Germany, 2011;
pp. 36–43.
18. Chronopoulos, S.K.; Christoﬁlakis, V.; Tatsis, G.; Kostarakis, P. Preliminary BER Study of a TC-OFDM system
operating under noisy conditions. J. Eng. Sci. Technol. Rev. 2016, 9, 13–16. [CrossRef]
19. Egri, R.G.; Horrigan, F.A. A ﬁnite group of complex integers and its application to differentially coherent
detection of QAM signals. IEEE Trans. Inf. Theory 1994, 40, 216–219. [CrossRef]

121
electronics
Article
Throughput Enhancement in Downlink MU-MIMO
Using Multiple Dimensions
Jong-Gyu Ha, Jae-Hyun Ro and Hyoung-Kyu Song ∗
Department of Information and Communication Engineering, uT Communication Research Institute,
Sejong University, Gunja-dong 98, Gwangjin-gu, Seoul 05006, Korea
* Correspondence: [email protected]; Tel.: +82-2-3408-3890

Received: 13 May 2019; Accepted: 4 July 2019; Published: 5 July 2019

Abstract: This paper focuses on the throughput performance enhancement in the single cell multi-user
MIMO (MU-MIMO) downlink system model. For better quality of service, this paper proposes the
scheme that increases system throughput and improves the spectral efﬁciency. Speciﬁcally, the signal
transmission and detection schemes are proposed by using multiple dimensions. At the transmitter
side, two dimensions (power and space) are adopted at the same time. To achieve multiple access
(MA), the space domain is exploited by using a block diagonalization (BD) precoding technique,
and the power domain is exploited to transmit more data symbols. At the receiver, the signal detection
structure corresponding to a transmitter is also proposed. In the simulation results, comparisons
of throughput performance are presented in various aspects. As a result, the proposed scheme
outperforms the conventional schemes using only one dimension in terms of throughput. This paper
shows strong performance in MU-MIMO senarios by adopting multiple dimensions.

Keywords: multi-user MIMO; space division multiple access (SDMA); block diagonalization (BD);
non-orthogonal multiple access (NOMA); broadcast channel

1. Introduction
In the future, in order to handle explosive data traffic, studies will probably aim to increase
channel capacity and data rate in the overall wireless communication system [1]. Multiple-input
multiple-output (MIMO) has been studied in wireless systems since it has dramatic gains in channel
capacity [2]. Also, multiuser MIMO (MU-MIMO) has been studied widely as a potential for improving
the overall throughput [3–5]. In downlink broadcasting (BC) channel, MU-MIMO is accomplished by
multiuser beamforming that eliminates the multiuser interference (MUI) completely [6–9]. A number
of users can be served by one base station (BS) simultaneously and the spectral efficiency can be
increased. Therefore, the use of space-division multiple access (SMDA) in the downlink channel
provides a considerable gain in system capacity. The sum rate of the MU-MIMO broadcast channel
is achieved by dirty paper coding (DPC). However, the critical drawback of DPC is extreme high
complexity to implement in practice [10]. Another promising technique in an MU-MIMO system is
block diagonalization (BD). In this paper, the proposed scheme considers BD as a generalization of the
channel inversion [11,12]. BD supports the multiple data stream with low complexity and approaches
the sum capacity of DPC using user selection algorithms [13].
On the other hand, non-orthogonal multiple access (NOMA) is one of the most promising
techniques for improving the overall spectral efficiency [14,15]. NOMA shares the same resources
with multiple users by exploiting the domains. The well-known NOMA schemes can be divided
into power-domain and code-domain NOMA. Power-domain NOMA multiplexing is achieved by
different allocated power for users according to the channel conditions. Symbols are superposed
to each user and receivers perform successive interference cancellation (SIC) [16]. Furthermore,

Electronics 2019, 8, 758; doi:10.3390/electronics8070758 123 www.mdpi.com/journal/electronics

Electronics 2019, 8, 758

MIMO-NOMA is another popular technique to increase sum capacity. In MIMO-NOMA work,

the signal processing techniques are investigated and two main routes exist: single-cluster and
multi-cluster MIMO-NOMA [17–19]. Using MIMO-NOMA, a full beneﬁt of system capacity and
user fairness is achieved by well-chosen power allocation, user clustering, and beamforming [20,21].
Most of these papers consider the user grouping and clustering. However, the proposed scheme
fundamentally does not consider user grouping and clustering. Due to the fundamental difference,
the proposed scheme has additional advantages. In this paper, the proposed signal transmission and
detection scheme use superposition coding (SC) and SIC in conjunction with MU-MIMO to improve
the system spectral efﬁciency. With the SDMA scheme, a number of data symbols can be served to
each user by SC and SIC. The power and space domain are fully exploited at the same time and the
overall system throughput can be improved in single cell MU-MIMO scenario. The main contributions
of this paper are summarized as follows:

1. First, the proposed scheme is written in a different view from the existing MU-MIMO papers.
Recently, many researches in MU-MIMO and NOMA have been mainly focused on increasing
spectral efﬁciency by using user clustering, power allocation, and beamforming. User clustering
means that close and far users are clustered within each beam and result in intra-beam interference.
Intra-beam interference is a major cause of performance degradation. In existing papers,
the resource reuse approach is highly affected by intra-beam interference and is critical to overall
system performance. However, the proposed scheme does not need to consider intra-beam
interference. The proposed scheme makes a single user in each beam and power domain is
exploited in a single user. In conclusion, power and spatial domains are exploited within a
single user.
2. Second, the proposed scheme improves performance through joint design that uses dimensions
appropriately according to wireless communication systems. In a wireless communication system,
it is important to decide which technique to use according to the system requirements and
characteristics along with cost and complexity constraints. While there are various combinations
of techniques that make up the wireless communication system, it is difﬁcult to get a clear
answer as to the best scheme for a complex multiuser system under a range of typical operating
conditions. This paper proposes a technique that can appropriately bring out the merits of various
combinations of techniques.
3. Finally, the proposed idea can be described from a different perspective and provides insights into
other system implementations. The key idea here is to use multiple dimensions appropriately. The
beauty of this approach is that it can even be applied to other multiple access systems. The various
system models can be implemented by using additional dimensions. This paper demonstrates
that the system throughput can be increased by using multiple dimensions. This concept can
be extended further. If the additional dimensions are used without degradation or with a little
tradeoff, the system throughput can be increased. The idea presented in this paper provides a
multitude of interesting avenues for future research.

The remainder of this paper is organized as follows. Section 2 describes the problems of the
existing scheme and explains the solution and the reasonability of the proposed scheme. Section 3
describes the overall system model and the proposed scheme. Also, this section presents the
performance of the proposed scheme. Section 4 presents simulation results and compares with
the conventional scheme under various conditions. Section 5 considers some extensions about the
implementation of the proposed scheme. Finally, conclusion is provided in Section 6.

2. Motivation
The novelty of the proposed scheme is to use multiple dimensions as much as possible.
Figure 1a,b represent examples of resource usage in NOMA and BD schemes using only one dimension,
respectively. Figure 1c represents example of resource usage for the proposed scheme. In Figure 1a,b

124
Electronics 2019, 8, 758

which represent existing techniques, the two techniques cannot be simply combined since all existing
schemes utilize each technique for multiple access. However, in the proposed scheme, the SC is used to
separate each symbol by utilizing the power domain. In conclusion, it is possible to utilize both power
and space domain at the same time and improve performance by changing usage in SC application.
In addition, through proper combination, the proposed scheme compensates the shortcomings of
existing schemes and has advantages. First, overhead of the BS is reduced. A dynamic user scheduling
and grouping strategy needs the feedback information at the BS. The proposed scheme can reduce the
feedback overhead signiﬁcantly by not considering intra-beam interference. Second, proper fairness
among users is assured. The interference is treated as noise to users and does not guarantee fairness
among users. In the proposed scheme, a certain error probability is ensured for all users. Third,
the complexity is reduced for each user. In the case of the existing NOMA scheme, it is necessary
to decode even if it is not the user’s own signal. However, in the proposed scheme, all the decoded
symbols become the user’s own data. Therefore, it is possible to reduce the complexity of the user in
demodulating unnecessary data. As a result, the proposed scheme allows more ﬂexibility in spreading
user signals over the multiusers.

(a)

(b)

Figure 1. Cont.

125
Electronics 2019, 8, 758

(c)

Figure 1. Examples of resource usage in various dimensions (a) Example of resource usage for
conventional NOMA; (b) Example of resource usage for conventional BD; (c) Example of resource
usage for proposed scheme.

By adopting the multiple dimensions, the same resources can be shared at each domain. In this
paper, both the power and space domain are exploited. Figure 2 shows an example of additional
dimension usage. Four dimensions (frequency, time, power, space) are used and the same resources
can be appropriately shared. Therefore, the more data symbols can be transmitted to more users.

Figure 2. The spectral efﬁciency using multiple dimensions.

This paper considers a downlink single-cell MU-MIMO system as shown in Figure 3. In the
proposed scheme, multiple access is accomplished by SDMA and the power domain is exploited to
transmit more data symbols to each user. As a result, the proposed scheme has signiﬁcant potential
to improve spectral efﬁciency and provide better wireless services to many users. Also, this paper
offers advantages in various design issues to meet the requirements and characteristics of the system
by using multiple dimensions.

126
Electronics 2019, 8, 758

Figure 3. The downlink single-cell MU-MIMO system.

3. Proposed Scheme
This section describes the proposed scheme from data transmission to signal detection. Using the
proposed system, more data symbols can be transmitted and more users can be served simultaneously.
First, the transmission model explains how two additional domains can be used to improve spectral
efﬁciency. Second, the signal detection model explains how each signal is detected and reliability can
be satisﬁed. Additionally, in the received SINR, some considerations for the proposed scheme are
presented. Finally, the sum throughput between the proposed scheme and conventional schemes is
compared. As a result, the throughput performance of the proposed scheme is superior to that of the
conventional scheme.

3.1. System Model

This paper considers a downlink MU-MIMO broadcast system which consists of one BS and
K users as shown in Figure 4. BS is equipped with Nt transmitting antennas and each user has Nr
receiving antennas. The MIMO channel of each user is assumed to be ﬂat fading, since frequency
selective fading channel can be easily overcome by using orthogonal frequency division multiplexing
(OFDM) modulation. The system model can be further extended to frequency selective fading MIMO
channel considering all subcarriers. In this system model, the transmit signal for the k-th user can be
denoted as follows,
xk = Wk sk . (1)

The received signal at the k-th user is given by

K
yk = Hk Wk sk + ∑ Hk W j s j +nk , k = 1, · · · , K, (2)
2 34 5 j=1,j =k
desired signal 2 34 5
undesired signals

where k and j are user indices, Wk is Nt × Nr precoding matrix for user k, sk is a Nr × 1 data symbol
vector, xk is a Nt × 1 precoded signal vector for the k-th user. yk is a received signal vector for the
k-th user and nk is Nr × 1 zero-mean additive white Gaussian noise (AWGN) vector with variance σ2 .
In the Equation (2), the ﬁrst term denotes signal in the intended direction (desired user sk ) and the
second term denotes multi-user interference caused due to undesired signals (undesired users s j )

127
Electronics 2019, 8, 758

Figure 4. The downlink MU-MIMO broadcasting model.

3.2. Data Transmission Model

To exploit both the power and space domain, BD and SC scheme are used. The overall
transmission model is shown in Figure 5. First, SC is used for exploiting power domain. The transmit
symbols are superposed with different powers in one signal. Therefore, the transmitter can transmit
more data stream at the same time. In the existing NOMA scheme, SC and SIC are used to suppress
the MUI by allocating the different powers to different users. However, the proposed scheme uses
SC for separating the symbols in one superposed signal on the same user which BS transmits. In the
proposed scheme, the transmit signals for the i-th receiving antenna can be written as follows,
- - -
si = P1 s̃i,1 + P2 s̃i,2 + · · · + PN s̃i,N , (3)

where i is a receiving antenna index, N is the number of symbols in one superposed signal. s̃ are
the symbols in one superposed signal. The proposed scheme allocates optional power to symbols to
detect each symbol, leading P1 < P2 < · · · < PN . Then, transmit signals for the k-th user are deﬁned
as follows,
⎡ ⎤ ⎡ √ √ √ ⎤
s1 ( k ) P1 s̃1,1 (k ) + P2 s̃1,2 (k) + · · · + PN s̃1,N (k )
⎢ ⎥ ⎢ √ √ √ ⎥
⎢ s2 ( k ) ⎥ ⎢ P1 s̃2,1 (k ) + P2 s̃2,2 (k) + · · · + PN s̃2,N (k ) ⎥
sk = ⎢
⎢ .. ⎥ =
⎥ ⎢
⎢ .. ⎥.
⎥ (4)
⎣ . ⎦ ⎣ . ⎦
√ √ √
s Nr (k ) P1 s̃ Nr ,1 (k) + P2 s̃ Nr ,2 (k) + · · · + PN s̃ Nr ,N (k )

Figure 5. The proposed signal transmission model.

128
Electronics 2019, 8, 758

For exploiting space domain, the precoding matrix W should be designed in SDMA. In the scheme,
BD beamforming is adopted to suppress the MUI since BD method shows a strong performance in
terms of capacity and has a good ﬂexibility. The objective of the BD method is to completely elminate
the MUI by employing the precoding matrix W. Then, the precoding matrix design for MUI elimination
can be deﬁned as follows,
H̃k Wk = 0, k = 1, · · · , K. (5)

where H̃k is represented as the channel matrix for all users except for user k,

H̃k = [H1T · · · HkT−1 HkT+1 · · · HKT ] T . (6)

With the help of singular value decomposition (SVD), the precoding matrix for eliminating the
MUI is designed. SVD is used to decompose a matrix into matrices representing rotation and scaling.
By applying the SVD, the H̃k is deﬁned as follows

(1) (0)
H̃k = Uk Λk [Vk Vk ] H , (7)

(0)
where Λk is the diagonal matrix of which the diagonal elements are singular value of H̃k . Vk contains
(1) (0)
vectors of the zero singular values, and contains vectors of the non-zero singular values. Vk is
Vk
an orthogonal basis for the null space of H̃k and the required precoding matrix. As a result, intended
user’s channel is projected on the null space in order to have the transmission under the constraint of
(0)
zero-interference (i.e.,H̃k Ṽk = 0 , k = 1, · · · , K.)
The received signal of the k-th user after eliminating MUI is deﬁned as follows,

yk = Hk W ⎡ k sk + nk ⎤= H⎡e f f ,k sk + n⎤
k
s1 ( k ) n1 ( k )
⎢ ⎥ ⎢ ⎥
⎢ s2 ( k ) ⎥ ⎢ n2 ( k ) ⎥
= He f f ,k ⎢
⎢ .. ⎥+⎢
⎥ ⎢ .. ⎥
⎥
⎣ . ⎦ ⎣ . ⎦
⎡ s Nr (k ) n (k) ⎤ ⎡ ⎤ (8)
√ √ √ Nr
P1 s̃1,1 (k ) + P2 s̃1,2 (k ) + · · · + PN s̃1,N (k) n1 ( k )
⎢ √ √ √ ⎥ ⎢ ⎥
⎢ P1 s̃2,1 (k ) + P2 s̃2,2 (k ) + · · · + PN s̃2,N (k) ⎥ ⎢ n2 ( k ) ⎥
= He f f ,k ⎢
⎢ .. ⎥+⎢
⎥ ⎢ .. ⎥.
⎥
⎣ . ⎦ ⎣ . ⎦
√ √ √
P1 s̃ Nr ,1 (k ) + P2 s̃ Nr ,2 (k) + · · · + PN s̃ Nr ,N (k ) n Nr (k )

where He f f ,k denotes the effective channel of the k-th user. According to the Equation (8), the MUI is
perfectly eliminated and the k-th user receives its own data. Finally, the users can be considered as
point-to-point MIMO.
As a result, by exploiting both the spatial and power domains additionally, more data symbols
can be transmitted in the same resource (frequency/time). In the proposed scheme, as the number of
symbols at each superposed signal is increased, sum throughput at each user is linearly improved.
Although the allocated power to each symbol is reduced, the total throughput is improved since the
number of transmit symbols is increased. Unlike other techniques that use only one dimension, such
as space or power domains, the proposed scheme can achieve signiﬁcant gains in overall system
throughput by using additional domains.

3.3. Signal Detection Model

The MUI is perfectly eliminated by using the precoding matrix. Since the k-th user receives its
own data without MUI, the appropriate receiver structure for each user is similar to the point-to-point
MIMO. In Equation (8), MIMO detection is performed. In MIMO detection algorithm, there exist linear
and non-linear algorithms. In the proposed scheme, linear detection algorithms such as zero-forcing

129
Electronics 2019, 8, 758

(ZF) and minimum mean squared error (MMSE) can be applied simply. If the ZF detection scheme is
used, the ﬁlter matrix for the k-th user is as follows,

Gk = HeHf f ,k (He f f ,k HeHf f ,k )−1 . (9)

The detected superposed signals of the k-th user using ZF MIMO detection can be represented
as follows,

ŝk = Gk yk = HeHf f ,k (He f f ,k HeHf f ,k )−1 (He f f ,k sk + nk )

−1 −1 (10)
= HeHf f ,k HeHf f ,k He f f ,k He f f ,k sk + HeHf f ,k (He f f ,k HeHf f ,k )−1 nk = sk + Gk nk .

In the Equation (10), sk can be detected since it satisﬁes Gk He f f ,k = I. However, if the linear
detection algorithms are used, the bit error rate (BER) performance is too poor since the power of
symbols in one superposed signal is low. Low performance detection techniques in terms of BER
can cause a negative effect on performing SIC. Therefore, the non-linear detection algorithms can
be applied such as maximum likelihood (ML), ordered successive interference cancellation (OSIC),
decision feedback equalizer (DFE), QRD-M.
From Equation (10), the estimated superposed signals of the k-th user after performing MIMO
detection can be reconstructed as follows,
⎡ ⎤ ⎡ √ √ √ ⎤
ŝ1 (k ) P1ˆ˜s1,1 (k) + P2ˆ˜s1,2 (k) + · · · + PNˆ˜s1,N (k )
⎢ ⎥ ⎢ √ √ √ ⎥
⎢ ŝ2 (k ) ⎥ ⎢ P1ˆ˜s2,1 (k) + P2ˆ˜s2,2 (k) + · · · + PNˆ˜s2,N (k ) ⎥
ŝk = ⎢
⎢ .. ⎥=⎢
⎥ ⎢ .. ⎥,
⎥ (11)
⎣ . ⎦ ⎣ . ⎦
√ √ √
ŝ Nr (k ) P1ˆ˜s Nr ,1 (k) + P2ˆ˜s Nr ,2 (k) + · · · + PNˆ˜s Nr ,N (k )

where ˆ˜s are the symbols at each estimated superposed signal. In Equation (11), the superposed signal
can be decoded by conducting SIC. Therefore, SIC is performed at each receiving antenna. In estimated
√
superposed signal, the strong symbol, i.e., p N s̃ Nr,N is ﬁrst decoded. The ﬁrst decoded symbol can be
represented as follows, ⎡ d ⎤
ˆ˜s (k)
1,N
⎢ ˆ˜sd k ⎥
⎢ ( ) ⎥
ˆ˜sdk = ⎢ 2,N. ⎥. (12)
⎢ . ⎥
⎣ . ⎦
ˆ˜sdN ,N (k )
r

The decoded symbol is then subtracted from the superposed signal.

⎡ √ √ √ ⎤ ⎡ d ⎤ ⎡ d ⎤
1,N −1 ( k )
P ˆ˜s (k) + P2ˆ˜s1,2 (k ) + · · · + PNˆ˜s1,N (k ) ˆ˜s (k ) ˆ˜s
⎢ √ 1 1,1 √ √ 1,N
⎥ ⎢ ˜ˆsd k ⎥ ⎢ ˜ˆsd ⎥
⎢ P2ˆ˜s2,1 (k) + P2ˆ˜s2,2 (k ) + · · · + PNˆ˜s2,N (k ) ⎥ ⎢ 2,N ( ) ⎥ ⎢ 2,N −1 (k ) ⎥
ŝk − ˆ˜sk = ⎢ ⎥−⎢ ⎥=⎢ ⎥.
d
⎢ .. ⎥ ⎢ .. ⎥ ⎢ .. ⎥ (13)
⎣ . ⎦ ⎣ . ⎦ ⎣ . ⎦
√ √ √
P1ˆ˜s Nr ,1 (k ) + P2ˆ˜s Nr ,2 (k) + · · · + PNˆ˜s Nr ,N (k ) ˆ˜s N ,N (k )
d
r
ˆ˜s
Nr ,N −1 ( k )
d

Finally, all the symbols in the superposed signal can be decoded by performing SIC. The receiver
model is described in Figure 6.

130
Electronics 2019, 8, 758

Figure 6. The proposed signal detection model.

3.4. Received SINR

In this subsection, signal to interference plus noise ratio (SINR) for the symbols in a superposed
signal is represented to consider the performance of proposed scheme. If it is assumed that the two
√ √
symbols are superposed in one superposed signal, i.e., si = P1 s̃i,1 + P2 s̃i,1 , the received signal at
the i-th receiving antenna for each user after eliminating MUI can be deﬁned as follows,
- -
yi = λi ( P1 s̃i,1 + P2 s̃i,2 ), (14)

where i is received antenna index, λi is a channel gain in one MIMO parallel channel. In this case,
P1 < P2 subject to total power constraint. Therefore, s̃i,1 is a weak symbol and s̃i,2 is a strong symbol.
Then the received SINR for the strong symbol is deﬁned as follows,

λi P2
SINRs̃2 = . (15)
σ2 + λi P1
The weak symbol can be decoded by employing SIC. If it assumed that perfect SIC decoding is
conducted, the received SINR for weak symbol is deﬁned as follows,

λi P1
SINRs̃1 = . (16)
σ2
In the proposed scheme, SINR shows two negative effects in terms of system throughput. First,
the weak symbol acts as an interference for received SINR of the strong symbol. As shown in Equation
(15), λi P2 term is considered as noise. Second, strong symbols is likely be decoded incorrectly.If a strong
symbol is not properly decoded, it can adversely affect the decoding of weak symbol. The negative
effects for the system throughput are restated in the simulation result. Although there are some
degradations in terms of throughput on exploiting the power domain, the system throughput is
increased since the number of transmit symbols is linearly increased at high SNR.

4. Simulation Results
This section shows simulation results to demonstrate the throughput gain of the proposed scheme
and compares the results with other conventional schemes. Simulation results also provide some
considerations for the proposed scheme. The simulations are performed in seven multi-path Rayleigh
fading and time-invariant channel model. OFDM overcomes the frequency selectivity of the wideband
channel and multiple carriers enable the high rate data transmission [22,23]. The OFDM symbol is
composed of 128 FFT size, four pilots and the 108 data subcarriers based on speciﬁcation for IEEE
802.11n. The remaining 16 subcarriers are zero padding and OFDM symbol duration is 4 microseconds.

131
Electronics 2019, 8, 758

All the simulation are simulated with quadrature phase-shift keying (QPSK) modulation (data bits per
subcarrier is 2).
Figure 7a,b compare the sum throughput between the proposed schemes and the conventional
schemes for the case of two users as a function of SNR. The throughput T is calculated as follows,

T = Nb × (1 − E) L × K ÷ Ts
Nb = Ns × O × Nr × s, (17)
L = Ns × O,

where Nb is the number of transmit data bits and L is the number of data bits in one OFDM symbol. E
is the BER for each user and Ns is the number of data subcarriers. O is the number of data bits per
subcarrier and s is the number of symbols in one superposed signal. Ts is the OFDM symbol duration.
In the simulation, proposed scheme and conventional BD are the MIMO system that BS and each user
have multiple antennas. On the other hand, the conventional NOMA has one antenna at the BS and
each user since multiple access is accomplished by the power domain. The proposed scheme and
BD scheme are simulated in the case of Nt = 4, Nr = 2, K = 2 (4, 2, 2) and NOMA scheme is Nt = 1,
Nr = 1 K = 2 (1, 1, 2). And in the proposed scheme, the number of symbols at each superposed signal
is two ( N = 2). Therefore, the total number of transmit symbols to each user is 8. The power of each
symbol is allocated at a ratio of 8:2 from the total power Pt = 1. In MIMO system, the ML MIMO
detection is applied before conducting SIC. The summary of the simulation parameters is shown in the
Table 1.

Table 1. Simulation Parameters.

Scheme Ns O Nr s K
Conventional BD 108 2 2 1 2
Conventional NOMA 108 2 1 1 2
Proposed Scheme 108 2 2 2 2

In the simulations, there are two types of simulation results: with and without SIC error.
Figure 7a,b show the simulation result with SIC error and Figure 7c,d show the simulation result
without SIC error. Both cases of the proposed scheme outperform the conventional schemes in terms
of maximum throughput since the proposed scheme exploits both the spatial and power dimensions.
The conventional BD shows better performance than the proposed scheme in terms of BER. However,
the proposed scheme has higher throughput since the proposed scheme transmits more symbols.
In Figure 7b, the BER performance of the proposed scheme is better than that of NOMA since the
effect of MIMO detection and SIC decoding is improved at high SNR. Figure 7c,d show the impact of
SIC error on the proposed scheme. Without SIC error, throughput and BER performance are better
than when there is SIC error. If the strong symbol is wrongly decoded at low SNR, the weak signal is
also wrongly decoded. As a result, the error propagation occurs. Therefore, the system needs to be
designed to avoid error propagation and reduce SIC error. As the proposed scheme minimizes the
impact of SIC error, the performance of the proposed scheme approaches the case where there is no
SIC error.

132
Electronics 2019, 8, 758

(a) (b)

Figure 7. Throughput and BER performance of conventional and proposed schemes: (a) Throughput
performance with SIC error; (b) BER performance with SIC error; (c) Throughput performance without
SIC error; (d) BER performance without SIC error.

Figure 8a, b show the performance difference of the proposed scheme according to the number
of users and antennas ( Nt , Nr , K ). The cases of (6, 3, 2) and (6, 2, 3) have the higher throughput
performance than (4, 2, 2) case since (6, 3, 2) and (6, 2, 3) transmit more symbols by exploiting both
space and power dimension. (6, 2, 3) case has lower throughput performance in low SNR than (6, 3, 2)
case since (6, 3, 2) case allocates more symbols to each user. On the other hand, in case of (6, 2, 3), one
more user can be serviced. In terms of BER, (6, 3, 2) has better BER performance than (6, 2, 3) since
(6, 3, 2) transmits more symbols to each user than (6, 2, 3). (4, 2, 2) has better BER performance than
(6, 2, 3) since more power is allocated to each symbol. However, (6, 2, 3) has higher throughput than
(4, 2, 2) since (6, 2, 3) transmits more symbols.

133
Electronics 2019, 8, 758

(a) (b)

Figure 8. Throughput and BER performance of proposed scheme according to the number of users and
antennas: (a) Throughput performance; (b) BER performance.

Figure 9a, b show the difference of the performance between the cases of linear MIMO detection
and non-linear MIMO detection. The method with ML detection outperforms the method with ZF
detection. The ML technique has better detection performance than the ZF technique before performing
SIC on each antenna. Additionally, the ML detection technique mitigates the error propagation
compared to the ZF detection scheme.
As a result, the overall simulation results show better performance compared to conventional
schemes by using two dimensions simultaneously. In the proposed scheme, by exploiting both power
and space domains at the same time, the transmitted symbol is increased. The results show that the
superiority of the proposed scheme, and the proposed scheme uses the dimensions appropriately.

(a) (b)

Figure 9. Throughput and BER performance of ML detection and ZF detection: (a) Throughput
performance; (b) BER performance.

134
Electronics 2019, 8, 758

5. Implementation Issue
This section presents some additional considerations for system application and limitation under
practical constraints. In addition, this section gives some ideas for additional performance gain and
presents some methods to reduce some negative effects in the proposed scheme.

5.1. Complexity
In the proposed scheme, the additional implementation complexity in typical NOMA is not
needed since SIC is performed for the own user’s data. However, even though the superposed signals
are the user’s own data, the symbols with small allocation power in superposed signal have poor
BER performance. To solve this problem, a detection scheme with better performance should be
used. However, a non-linear algorithm has high complexity. For this problem, a complexity-reduced
detection algorithm can be considered. The main consideration is achieving higher throughput with
lower complexity.

5.2. SIC-Error Propagation

SIC is often assumed to be successful with perfect decoding. However, for systems with actual
modulation and coding, decoding error inevitably occurs, causing error propagation and remarkable
performance degradation. As shown in the simulation results, there are performance differences with
and without SIC errors. Ultimately, well-designed system for the proposed scheme which avoids error
propagation and decoding error should be considered for optimum performance.

5.3. Power Allocation

The achievable throughput is affected by the transmit power allocation. If it is assumed that two
symbols are superposed in one signal, we should consider how much power should be allocated for
each symbol. Basically, allocating more power to strong symbols can reduce the error propagation.
However, allocating more power to strong symbols increases the error probability of weak symbols
since the power of a weak symbol is too low. As a result, power allocation to each symbol should be
considered according to the number of the symbols at the superposed signal subject to total power
constraint.

5.4. Optimal Parameters

In the proposed scheme, if the number of the symbols in a superposed signal is increased, the sum
throughput can be increased linearly. However, the sum throughput cannot be increased without limit
since there is a limitation that a receiver can detect the symbols. As more symbols are transmitted,
the power allocated to the symbols is reduced. Therefore, if too many symbols are transmitted,
the receiver can not detect each symbol. Furthermore, there is also the degradation of sum throughput
because of the interference in performing SIC. Therefore, the important issue is to ﬁnd a near-optimal
parameter between the number of users and the number of symbols in a superposed signal in the
overall system.

6. Conclusions
This paper suggests multi-dimensionality and a methodology to improve the throughput in an
MU-MIMO system. This paper also presents the transceiver structure of the proposed scheme. As a
result, the multiple dimensions (space and power) are exploited at the same time, and the overasll
system spectral efﬁciency is improved. If more dimensions are used without degradation or with a
little tradeoff in performance, the system throughput can be increased. Also, various system models
can be implemented by using additional dimensions.

Author Contributions: J.-G.H. proposed throughput enchancement scheme for MU-MIMO downlink channel
and processed the simulation; J.-H.R. analyzed the simulation results and made the ﬁgure; H.-K.S. reviewed

135
Electronics 2019, 8, 758

the algorithm and provided the experimental materials for better computational simulations and revised critical
errors of the manuscript
Funding: This research was supported by Institute for Information & communications Technology
Promotion(IITP) grant funded by the Korea government(MSIT) (No.2017-0-00217, Development of
Immersive Signage Based on Variable Transparency and Multiple Layers) and was supported by the
MSIT(Ministry of Science and ICT), Korea, under the ITRC(Information Technology Research Center) support
program(IITP-2018-2018-0-01423) supervised by the IITP(Institute for Information & communications Technology
Promotion).
Conﬂicts of Interest: The authors declare no conﬂict of interest.

References
1. Gupta, A.; Jha, R.K. A survey of 5G network: Architecture and emerging technologies. IEEE Access 2015, 3,
1206–1232. [CrossRef]
2. Goldsmith, A. Capacity Limits of MIMO Channels. IEEE JSAC 2003, 5, 684–702. [CrossRef]
3. Weingarten, H.; Steinberg, Y.; Shamai, S. The Capacity Region of the Gaussian MIMO Broadcast Channel.
IEEE Trans. Inf. Theory 2006, 52, 3936–3964. [CrossRef]
4. Caire, G.; Shamai, S. On the achievable throughput of a multi-antenna Gaussian broadcast channel. IEEE
Trans. Inf. Theory 2003, 43, 1691–1706. [CrossRef]
5. Angelis, C.T.; Chronopoulos, S.K. System Performance of an LTE MIMO Downlink in Various Fading
Environments. In Proceedings of the Ambient Media and Systems, Berlin, Germany, 27–28 January 2011;
pp. 36–43.
6. Choi, L.U.; Murch, R.D. A transmit preprocessing technique for multiuser MIMO systems using a
decomposition approach. IEEE Trans. Wirel. Commun. 2004, 3, 20–24. [CrossRef]
7. Spencer, Q.H.; Swindlehurst, A.L.; Haardt, M. Zero-forcing methods for downlink spatial multiplexing in
multiuser MIMO channels. IEEE Trans. Signal Process. 2004, 2, 461–471. [CrossRef]
8. Viswanathan, H.; Venkatesan, S.; Huang, H. Downlink capacity evaluation of cellular networks with
known-interference cancellation. IEEE J. Sel. Areas Commun. 2003, 21, 802–811. [CrossRef]
9. Votis, C.; Christoﬁlakis, V.; Raptis, V.; Tatsis, G.; Chronopoulos, S.K.; Kostarakis, P. Design and analysis of
a multiple-output transmitter based on DDS architecture for modern wireless communications. AIP Conf.
Proc. 2010 1203, 421–426.
10. Jindal, N.; Goldsmith, A. Dirty-paper coding versus TDMA for MIMO broadcast channels. IEEE Trans. Inf.
Theory 2005, 51, 1783–1794. [CrossRef]
11. Rim, M. Multi-user downlink beamforming with multiple transmit and receive antennas. Electron. Lett.
2002, 38, 1725–1726. [CrossRef]
12. Chen, R.; Andrews, J.G.; Heath, R.W. Multiuser Space-Time Block Coded MIMO System with Downlink
Precoding. In Proceedings of the 2004 IEEE International Conference on Communications, Paris, France,
20–24 June 2004
13. Shen, Z.; Chen, R.; Andrews, J.G.; Heath, R.W.; Evans, B.L. Low complexity user selection algorithms
for multiuser MIMO systems with block diagonalization. IEEE Trans. Signal Process. 2006, 54, 3658–3663.
[CrossRef]
14. Saito, Y.; Kishiyama, Y.; Benjebbour, A.; Nakamura, T.; Li, A.; Higuchi, K. Non-orthogonal multiple access
(NOMA) for cellular future radio access. In Proceedings of the 2013 IEEE 77th Vehicular Technology
Conference, Dresden, Germany, 2–5 June 2013
15. Ali, M.S.; Hossain, E.; Kim, D. Non-orthogonal multiple access (NOMA) for downlink multiuser MIMO
systems: User clustering, beamforming, and power allocation. IEEE Access 2017, 5, 565–577. [CrossRef]
16. Islam, S.M.R.; Avazov, N.; Dobre, O.A.; Kwak, K.-S. Power-domain non-orthogonal multiple access (NOMA)
in 5G systems: Potentials and challenges. IEEE Commun. Surv. Tutor. 2016, 19, 721–742. [CrossRef]
17. Zeng, M.; Yadav, A.; Dobre, O.A.; Tsiropoulos, G.I.; Poor, H.V. On the sum rate of MIMO-NOMA and
MIMO-OMA systems. IEEE Wirel. Commun. Lett 2017, 6, 534–537. [CrossRef]
18. Ding, Z.; Adachi, F.; Poor, H.V. The application of MIMO to nonorthogonal multiple access. IEEE Trans.
Wirel. Commun. 2016, 15, 537–552. [CrossRef]
19. Ding, Z.; Schober, R.; Poor, H.V. A general MIMO framework for NOMA downlink and uplink transmission
based on signal alignment. IEEE Trans. Wirel. Commun. 2016, 15, 4438–4454. [CrossRef]

136
Electronics 2019, 8, 758

20. Choi, Y.I.; Lee, J.; Rim, M.; Kang, C.G. On the performance of beam division non orthogonal multiple access
for FDD-based large-scale multi-user MIMO systems. IEEE Trans. Wirel. Commun. 2017, 15, 5077–5089.
[CrossRef]
21. Yang, Q.; Wang, H.-M.; Ng, D.W.K.; Lee, M.H. NOMA in downlink SDMA with limited feddback:
Performance analysis and optimization. IEEE J. Sel. Areas Commun. 2017, 35, 2281–2294. [CrossRef]
22. Van Nee, R.; Prasad, R. OFDM for Wireless Multimedia Communications; Artech House, Inc.: Norwood, MA,
USA, 2000
23. Chronopoulos, S.K.; Votis, C.; Raptis, V.; Tatsis, G.; Kostarakis, P. In depth analysis of noise effects in
orthogonal frequency division multiplexing systems, utilising a large number of subcarriers. AIP Conf. Proc.
2010, 1203, 967–972.

137
electronics
Article
LOS-Based Equal Gain Transmission and Combining
in General Frequency-Selective Ricean Massive
MIMO Channels
Qiuna Yan 1,2 , Yu Sun 1 and Dian-Wu Yue 1, *
1 School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China;
[email protected] (Q.Y.); [email protected] (Y.S.)
2 State Key Laboratory of Integrated Services Networks, Xidian University, Xi’an 710071, China
* Correspondence: [email protected]; Tel.: +86-411-8472-4261

Received: 13 December 2018; Accepted: 7 January 2019; Published: 10 January 2019

Abstract: In general frequency-selective Ricean fading environments with doubly-ended spatial

correlation, this paper investigates the spectral efﬁciency of a broadband massive multiple-input
multiple-output (MIMO) system. In particular, in order to reduce overhead of channel estimation
effectively, it proposes a scheme of equal gain transmission and combining, which is only based on
line- of-sight (LOS) component and has low hardware complexity. With the scheme, several interesting
transmit power scaling properties without and with spatial correlation are derived when the number
of antennas at the transmitter or the number of antennas at the receiver grows in an unlimited way.
Furthermore, the asymptotical rate analysis is extended to the cooperative relaying scenarios with
decode-and-forward and amplify-and-forward protocols, respectively, and then two novel power
scaling laws are given.

Keywords: massive MIMO; beamforming; line-of-sight; Ricean fading; frequency-selective; power scaling

1. Introduction
Recently, massive multiple-input multiple-output (MIMO) has attracted great interest in both
academia and industry, and has been a promising solution to meet the demanding spectral efficiency
requirement of 5G systems [1]. Its promising benefits includes significant increase of both spectral
and energy efficiencies [2–4]. Interestingly, the two benefits of massive MIMO can be achieved
by maximum-ration transmission/maximum-ration combining (MRT/MRC) or zero-forcing (ZF)
precoding/detection [3,5].
With MRT/MRC and ZF linear processing, many scholars have given various asymptotic
performance analyses. In particular, the power scaling law in the limit of the large number of antennas
has been widely studied in order to quantify the power savings. For Rayleigh and Ricean fading
environments, with MRC and ZF detectors, authors in [3,6] analyzed uplink massive MIMO system
performance. If perfect channel state information (CSI) is available, they showed that, when the
number of base station (BS) antennas grows large and the transmit power of each user is scaled down
proportionally to it, the ergodic achievable rate can asymptotically be equal to a positive constant.
In order to obtain the needed CSI, channel estimation must be obviously carried on [7]. However,
the channel estimation will result in not only heavy overhead but also pilot contamination in multi-cells,
which will become a serious problem [2,8]. For a point to point massive MIMO system in Ricean
fading, to reduce the heavy overhead to estimate the CSI, we investigated a scheme with equal gain
transmission /equal gain combining (EGT/EGC), which is only based on the line-of-sight (LOS)
component (or say specular component) and has low hardware complexity [9]. It was showed that,
with this scheme, the ergodic achievable rate can converge to that of the corresponding MRT/MRC

Electronics 2019, 8, 79; doi:10.3390/electronics8010079 139 www.mdpi.com/journal/electronics

Electronics 2019, 8, 79

based on the perfect CSI as the two numbers of antennas at the transmitter and receiver go to infinity.
After that, we further considered the novel linear processing scheme for a downlink or uplink multiuser
massive MIMO system [10,11] and showed that each user in the downlink or uplink system can
have asymptotically the same rate as in the single-user case when the number of BS antennas goes
without bound.
It should be pointed out that the above-mentioned results with the novel scheme have only
considered uncorrelated Ricean frequency-flat fading channels without a relay [12]. Recently, we tried
to develop our analysis to frequency-selective Ricean fading channels [13], but only for a very simple
and special scenario [14]. The EGT/EGC linear transmission scheme is very attractive for massive
MIMO systems since it enables low-complexity and inexpensive hardware [15–18]. Motivated by
these facts, in this paper, we make use of a comparatively complicated and general frequency-selective
Ricean fading channel model [14,19,20] to investigate further the LOS-based EGT/EGC scheme.
We firstly derive several interesting power scaling properties for broadband massive MIMO systems
with and without spatial correlation. In particular, it is shown that the ergodic achievable rate of
LOS-based EGT/EGC scheme can have the same asymptotic value as the ergodic achievable rate of
the whole CSI-based MRT/MRC scheme if the two numbers of transmit and receive antennas go
without bound and with a fixed ratio. Then, we extend our asymptotical performance analysis to the
cooperative relaying scenarios with decode-and-forward (DF) protocol and with amplify-and-forward
(AF) protocol, respectively, and obtain two novel power scaling laws for the two scenarios. In particular,
it is also shown that the ergodic achievable rate of LOS-based scheme can have the same asymptotic
value as the ergodic achievable rate of the CSI-based scheme if the number of source antennas and
the two numbers of transmit and receive relay antennas go without bound and with two fixed
ratios, respectively.
The manuscript is organized as follows: in Section 2, the system model is introduced. In Section 3,
the proposed LOS-based transmission scheme is presented and its power scaling law without
correlation and with correlation is derived, respectively. Extension of our analysis to a cooperative
relaying system is given in Section 4. In Section 5, the analysis results are verified by simulation.
Finally, in Section 6, some concluding remarks are given.
Notation: boldface lower and upper case letters denote column vectors and matrices, respectively.
The superscripts (·)† and (·)T stand for conjugate-transpose and transpose operations, respectively.
The expectation operator is denoted by E{·}. α ∼ CN(0, δ2 ) stands for a circularly symmetric complex
Gaussian variable α which has zero mean and variance δ2 .

2. System Model
Since a set of parallel independent frequency flat MIMO channels can be used to describe a
frequency selective MIMO channel, we start with introducing the frequency flat channels [14,19].
For a point-to-point MIMO system over frequency-flat channels, we assume that it has N transmit
antennas and M receive antennas. Then, we can represent a M × 1 received signal vector as
√
y= pH0 x + z, (1)

where z denotes the additive white Gaussian noise (AWGN) vector that has zero-mean and covariance
matrix σ2 I M with I M being the M × M identity matrix, x denotes the transmitted signal vector,
M,N
H0 = [ hmn ]m,n =1 stands for the M × N channel matrix whose element hmn denotes the channel gain
between the m-th antenna at the receiver and the n-th antenna at the transmitter, and p is the average
transmitted power. The channel matrix H0 under Ricean fading consists of a LOS matrix and a scattered
matrix, i.e.,

140
Electronics 2019, 8, 79

√ √
H0 = κ̄0 H0 + 6 0,
κ̃0 H (2)

where κ̄0 = 1+κ0κ0 , κ̃0 = 1

1+κ 0 . Note that κ0 > 0 represents the Ricean K-factor. The LOS matrix H0 can
be written as
H0 = r0 tT0 . (3)

Here, r0 denotes the specular array response at the receiver and can be expressed as

r0 = [1, e j2πdr sin(θ ) , . . . , e j2π ( M−1)dr ,sin(θ ) ]T , (4)

where θ is the angle of arrival of the LOS component and dr is the antenna spacing normalized by
wavelength at the receiver. Similarly, t0 denotes the specular array response at the transmitter and can
be given by
t0 = [1, e j2πdt sin(φ) , . . . , e j2π ( N −1)dt sin(φ) ]T (5)

where φ is the angle of departure of the LOS component and dt is the antenna spacing normalized by
6 0 , [H
wavelength at the transmitter. The entries in the scattering matrix H 6 0 ]mn ∼ CN(0, 1), i.e., they are
circular complex Gaussian random variables with zero mean and unit variance. Furthermore,
we assume that they are independent and identically distributed (i.i.d).
Now, we are concerned with a broadband orthogonal frequency-division multiplexing
(OFDM)-MIMO system with K subcarriers, where ideal OFDM transmission with proper cyclic preﬁx
extension is assumed. For the k-th subcarrier, the input–output relationship is expressed as
√
y= pHx + z, (6)

where x is just the normalized signal vector, z is the AWGN vector, and H is the channel matrix.
The channel matrix can be given by as

L −1
k
H= ∑ ρ H exp(− j2π K ), (7)
=0

L −1 2
where L represents the channel delay spread, {ρ2 } is the power delay proﬁle satisfying ∑= 0 ρ = 1,
and H stands for the channel matrix at time delay . Furthermore, H , = 0, 1, · · · , L − 1 are mutually
uncorrelated, Ricean distributed, and can be expressed as in Label (2)
√ -
H = κ̄ H + 6 .
κ̃ H (8)

6 is also modeled as a random matrix consisting

In particular, H = r tT is just as in Label (3) and H
of i.i.d. elements.

3. LOS-Based EGT/EGC and Power Scaling Laws

3.1. The Scenario without Correlation

The scattered component of k-th subcarrier’s channel matrix can be described as

L −1 -
6 =
H ∑ ρ 6 exp(− j2π k ).
κ̃ H (9)
=0
K

6 is not available, but the LOS component

Now, it is assumed that H

141
Electronics 2019, 8, 79

L −1 √ k
H= ∑ ρ κ̄ H exp(− j2π )
K
(10)
=0

can be available. In what follows, by employing only H, we will present a linear processing scheme
with EGT/EGC and then compare it with the MRT/MRC scheme based on the perfect CSI.
6 is not available, a couple of the normalized weighting vectors wt and wr can be chosen in
Since H
such a way that the effective output signal-to-noise ratio (SNR) can become maximum. The largest
† †
eigenvalue of matrix H H is now denoted by λmax (H H) . Due to the fact that wr† zk ∼ CN(0, σ2 ),
the effective output signal-to-interference-plus-noise ratio (SINR) can be described as

†
(k) p|wr† HwTt † |2 pλmax (H H)
γS = = . (11)
6 T† | 2 + σ 2
p|wr† Hw p | w † 6 T† 2
r Hwt | + σ
2
t

We denote by RS the ergodic achievable rate of the LOS-based scheme. Then,

K −1 K −1
1 (k) 1 (k)
RS = E{
K ∑ log2 (1 + γS )} =
K ∑ RS , (12)
k =0 k =0

(k) (k)
where RS = E{log2 (1 + γS )}. We have the following results through the derivation.

(k) L −1 2
†
Lemma 1. Deﬁne λmax = λmax (H H) and κ̃S = ∑= 0 ρ κ̃ . Then,

(k) (k)
pλmax (k) pλ
log2 (1 + ) ≤ RS ≤ log2 (1 + max ). (13)
pκ̃S + σ2 σ2

Proof of Lemma 1. Regarding the ergodic achievable rate of the k-th subcarrier, it is easy for us to
derive its following lower bound with the help of the well-known Jensen’s inequality:

(k)
(k) 1 pλmax
RS ≥ log2 (1 + (k)
) = log2 (1 + L −1 2
), (14)
p ∑= 0 ρ E + σ
2
E(1/γS )

6 wT† |2 = κ̃ for 0 ≤ ≤ L − 1. Thus,

where E = E|wr† H t

(k)
pλmax
k
RC ≥ log2 (1 + ). (15)
pκ̃S + σ2

Moreover, we can obtain from Label (11) that

(k) (k)
(k) pλmax pλmax
RS = E log2 (1 + ) ≤ log2 (1 + ). (16)
† 6
p|wr HwTt † |2 + σ2 σ2

Thus, Lemma 1 holds.

L −1 2
Lemma 2. Let κ̄U = ∑= 0 ρ κ̄ . and κ̄ L = max{ ρ κ̄ , 0 ≤ ≤ L − 1}. Then,
2

(k)
λmax
κ̄ L ≤ lim ≤ κ̄U . (17)
MN →∞ MN

Proof of Lemma 2. For 0 ≤ ≤ L − 1 and 0 ≤ b ≤ L − 1, we deﬁne ϕb = 2πdr (sin(θb ) − sin(θ )),
and then have that

142
Electronics 2019, 8, 79

H̄† H̄b
= b Ub , (18)
MN
where
r† rb 1, for l = b;
b = = 1−e jMϕb (19)
M , for = b
M (1−e jϕb )

and
tT † tTb N,N
Ub = = [u an ] a,n =1 (20)
N
with u an = N1 e2πdt (( a−1) sin(θb )−(n−1) sin(θ )) . Now, suppose that M ≥ N. Noting that lim M→∞ b = 0
for = b and tr(U ) = 1, we can obtain that

† L −1 †
λmax (H H) H H
lim
MN →∞ MN
≤ ∑ ρ2 κ̄ MNlim
→∞
λmax (
MN
) = κ̄U . (21)
=0

Moreover, we also have

†
λmax (H H)
lim ≥ κ̄ L (22)
MN →∞ MN
since, for any , we can get when wt = √t and wr = √r
N M

λmax (H H)
† 6 T† | 2
|wr† Hw
lim ≥ lim t
= κ̄ . (23)
MN →∞ MN MN →∞ MN

Therefore, Lemma 2 holds when M ≥ N. When N ≥ M, we can also similarly prove that Lemma 2
† †
holds, based on the fact that λmax (H H) = λmax (HH ).

Proposition 1. If E = MN p be ﬁxed as MN → ∞, then we have

Eκ̄ L Eκ̄U
log2 (1 + )≤ lim RS ≤ log2 (1 + ). (24)
σ2 MN →∞ σ2

Proof of Proposition 1. If E = MN p is ﬁxed when MN → ∞, we readily show that Proposition 1

holds by using Lemmas 1 and 2.

Remark 1. This proposition gives the lower and upper bounds of the ergodic achievable rate of the LOS-based
scheme. In the following special cases, we can obtain further the exact expressions of the ergodic achievable rate.

Corollary 1. When N = 1, we have, if E = Mp be ﬁxed as M → ∞,

Eκ̄U
lim RS = log2 (1 + ). (25)
M→∞ σ2

Similarly, when M = 1, we also have if E = N p be ﬁxed as N → ∞

Eκ̄U
lim RS = log2 (1 + ). (26)
N →∞ σ2
†
λmax (H H)
Proof of Corollary 1. When M = 1 or N = 1, due to the fact that lim MN →∞ MN = κ̄U , it easily
follows that lim MN →∞ RS = log2 (1 + Eσκ̄2U ).

143
Electronics 2019, 8, 79

Corollary 2. If E = MN p be ﬁxed as M → ∞ and N → ∞, then we have

Eκ̄ L
lim RS = log2 (1 + ). (27)
M,N →∞ σ2

Proof of Corollary 2. Without loss of generality, H can be rewritten as

L −1 √ k
H= ∑ ρ κ̄ r tT exp(− j2π ), (28)
=0
K
√ √ √
where ρ0 κ̄0 ≥ ρ1 κ̄1 ≥ · · · ≥ ρ L−1 κ̄ L−1 . We can rewrite H in a matrix form as

H = Ar DATt , (29)
√
where D is a L × L diagonal matrix, [D]ll = ρ MN κ̄ , and Ar and At are deﬁned as follows:

1
A r = √ [ r0 , r1 , . . . , r L −1 ] (30)
M

and
1 k k
At = √ [t0 , t1 exp(− j2π ), . . . , t L−1 exp(− j2π ( L − 1))]. (31)
N K K
Since both {r0 , r1 , . . . , r L−1 } and {t0 , t1 , . . . , t L−1 } are orthogonal vector sets when M → ∞ and
N → ∞ [21], Ar and At are asymptotically unitary matrices. For matrix H, thus we can form a singular
value decomposition (SVD) as follows

T
H = UΣV† = [Ar |Ar⊥ ]Σ[At |A⊥
t ] , (32)

where Σ is a diagonal matrix including all singular values on its diagonal, i.e.,
√
ρ MN κ̄ , for 0 ≤ l ≤ L − 1,
[Σ]ll = (33)
0, for l > L − 1.

Then,
(k)
λmax
lim = ρ20 κ̄0 = κ̄ L . (34)
M,N →∞ MN
Thus, we ﬁnally obtain the desired result.

On the other hand, suppose that the perfect CSI is known, i.e., both of the LOS and scattered
components are available at the transmitter and the receiver. Then, the weighting vectors wt and wr
should be chosen in such a way that the exact output SNR is maximized. Thus, the resulting output
SNR can be written as [11]
(k) p
γP = 2 λmax (H† H), (35)
σ
where λmax (H† H) stands for the largest eigenvalue of H† H. For the MRT/MRC scheme based on
(k)
the perfect CSI, let R P represent its ergodic achievable rate, i.e., R P = E{ K1 ∑kK=−01 log2 (1 + γP )}.
Now, we obtain the following power scaling law.

Proposition 2. When M → ∞ and N → ∞, suppose that E = MN p is ﬁxed and N/M → μ . We have that

lim RP = lim RS . (36)

M,N →∞ M,N →∞

144
Electronics 2019, 8, 79

6 we can have
Proof of Proposition 2. Due to the fact H = H + H,

1 1 6 )† (H + H6 )]
[H† H] = [(H + H
M M
† †6 6 †H H 6 †H
6
H H H H H
= + + + . (37)
M M M M
If we let
6
†
H H N,N
G= = [ guv ]u,v =1 , (38)
M
it follows that
M
1 † 6 ]kv .
guv =
M ∑ [H ]uk [H (39)
k =1
† 6 ]kv ∼ CN(0, δ2 ) with
With the aid of (9) and (10), we can have that |[H ]uk |2 ≤ κ̄S ≤ 1, and [H
δ2 6
= κ̃S ≤ 1. As [H]kv , 1 ≤ k ≤ M are independent each other, we know that

1
guv ∼ CN(0, σuv
2
), σuv
2
≤ . (40)
M
Thus, it can follow that, when M → ∞, G → Q, where Q denotes a matrix with all zero elements.
Similarly, we can have that, if M → ∞,

6 †H
H
G† = → Q. (41)
M
Now, M is assumed to be large enough. Then, we certainly have that

H† H
†
H H H6 †H
6
λmax ( ) = λmax ( + )
M M M
†
H H 6 †H
H 6
≤ λmax ( ) + λmax ( ). (42)
M M
When M → ∞, suppose that N/M → μ. Then, we easily derive from ([22], Theorem 2.37),
6 ]mn ∼ CN(0, κ̃S )
only noting that [ H

1 6† 6 √
λmax ( H H) → κ̃S (1 + μ)2 . (43)
M
Thus, we further get

1 1 † √
λmax ( H† H) ≤ λmax ( H H) + κ̃S (1 + μ)2 /N. (44)
MN MN
In addition, we can obtain

1 1 †
λmax ( H† H) ≥ λmax ( H H ). (45)
MN MN
When M → ∞ and N → ∞, we can get, by combining (44) with (45),

(k)
1 λmax
λmax ( H† H) → lim . (46)
MN M,N →∞ MN

It should be noticed that

145
Electronics 2019, 8, 79

K
1 p
RP =
K ∑ E log2 (1 + σ2 λmax (H† (k)H(k)))
k =1

1 K
pMN H† (k )H(k )
=
K ∑ E log2 (1 + σ 2
λmax (
MN
)). (47)
k =1

Therefore, when M → ∞ and N → ∞, if E = pMN is ﬁxed, we can have ﬁnally

lim RP = lim RS . (48)

M,N →∞ M,N →∞

Thus, Proposition 2 holds.

Remark 2. This proposition implies that, when the two numbers of antennas at the transmitter and the receiver
grow large with a ﬁxed ratio, the ergodic achievable rate of the LOS-based scheme has the same asymptotic value
as the ergodic achievable rate of the whole CSI-based scheme.

3.2. The Scenario with Correlation

Now, we consider extending the proposed LOS-based ECT/EGC without spatial correlation to the
scenario in which there exists doubly-ended spatial correlation. The MIMO system model presented
6 , = 0, 1, 2, · · · , L − 1 is now modeled as doubly-correlated
in Section 2 is necessarily modiﬁed. H
Rayleigh fading, with transmit and receive correlation matrices Ψ and Φ , i.e., [12],

H 6 ω [Ψ ]1/2 ,
6 = [Φ ]1/2 H (49)

where H6 ω is an i.i.d. matrix with each entry ∼ CN(0, 1). Since the scattered component of the channel

matrix remains unchanged, the needed weighting vectors wt and wr should also remain unchanged.
The ergodic achievable rate of the scenario with spatial correlation is denoted by

K −1
1 (k) (k) (k)
RC =
K ∑ RC , RC = E{log2 (1 + γC )}. (50)
k =0

With respect to RC , we have the following results by a similar derivation.

Lemma 3.
(k) (k)
pλmax (k) pλ
log2 (1 + ) ≤ RC ≤ log2 (1 + max ), (51)
pκ̃C + σ2 σ2
L −1 2 1 1
T† 2
where κ̃C = ∑= 0 ρ κ̃ wr [ Φ ] [ Ψ ] wt .
† 2 2 2

Proposition 3. If E = MN p is ﬁxed as MN → ∞, then

lim RC = lim RS . (52)

MN →∞ MN →∞

Remark 3. This proposition implies that the two ergodic achievable rates with and without spatial correlation
have the same asymptotic value when MN goes without bound.

4. Cooperative Relaying Systems

4.1. The Scenario with Decode-and-Forward Protocol

Ricean fading often happens in cooperative MIMO systems [23]. Therefore, we can use a
Ricean MIMO channel model to describe both the source-relay and relay-destination links. Still in

146
Electronics 2019, 8, 79

frequency-selective Ricean fading environments, we especially study a classical cooperative relay

system with a source node, a destination node and a relay node. The relay node can be equipped
with a large-scale antenna array while both the source node and the destination node can be also
equipped with a large-scale antenna array. The cooperative system with the three nodes is assumed
to operate in a half-duplex mode, and the replay node employs the DF protocol for transmission.
Each transmission for the system can be completed through two stages. Obviously, the cooperative
system is a composite of two massive MIMO subsystems: one subsystem working at the ﬁrst stage
and the other subsystem at the second stage. Therefore, the rate analysis results mentioned above can
be applied to the cooperative relay system.
We suppose that the relay makes use of M antennas to receive and transmit data and also
suppose that the source has N1 antennas and the destination has N2 antennas. We denote the average
transmitted power at the source and the relay by p1 and p2 , respectively. In addition, we still let R P
and RS represent the ergodic achievable rates for the whole CSI-based MRT/MRC scheme and the
LOS-based EGT/EGC scheme, respectively. Then, we can obtain the following power scaling property
for the cooperative system.

Proposition 4. When M → ∞ and N1 → ∞, let E1 = MN1 p1 be ﬁxed and N1 /M → μ1 for the source-relay
link. When M → ∞ and N2 → ∞, let E2 = MN2 p2 be ﬁxed and N2 /M → μ2 for the relay-destination
link. Then,
lim RP = lim RS . (53)
M,N1 ,N2 →∞ M,N1 ,N2 →∞

Proof of Proposition 4. From [24], it follows that the ergodic achievable rate with the perfect SCI-based
MRT/MRC is written as
(1) (2)
R P = min{ R P /2, R P /2}, (54)
(1) (2)
where R P and R P are the corresponding ergodic achievable rates of the source-relay and
relay-destination transmission links, respectively. Similarly, we also have that the ergodic achievable
rate with the only LOS-based EGT/EGC can be given by

(1) (2)
RS = min{ RS /2, RS /2}, (55)

(1) (2)
where RS and RS are the corresponding ergodic achievable rates of the source-relay and
relay-destination transmission links, respectively. Under the condition of Proposition 4, we get
by Proposition 2
(1) (1)
lim RP = lim RS (56)
M,N1 →∞ M,N1 →∞

and
(2) (2)
lim RP = lim RS . (57)
M,N2 →∞ M,N2 →∞

Thus, it is easy to obtain the desired result (53).

Remark 4. This proposition implies that, when the number of source antennas and the two numbers of relay
antennas at the transmitter and the receiver grow large with ﬁxed ratios, the ergodic achievable rate of the
LOS-based scheme also has the same asymptotic value as the ergodic achievable rate of the whole CSI-based scheme.

4.2. The Scenario with Amplify-and-Forward Protocol

The DF is a regenerative relaying transmission strategy. Now, we consider employing a
nonregenerative strategy involving AF to replace the DF. Then, we can have a power scaling law
as follows.

147
Electronics 2019, 8, 79

Proposition 5. Suppose that N1 = N2 = 1. When M → ∞, let E1 = Mp1 and E2 = MN2 p2 be ﬁxed. Then,

E1 κ̄U1 E2 κ̄U2 E κ̄ E κ̄
lim RS = log2 (1 + ( · )/( 1 2U1 + 2 2U2 + 1)). (58)
M→∞ σ12 σ22 σ1 σ2

(k) (k)
Proof of Proposition 5. We denote by γS1 and γS2 the output instantaneous SNR of the source-relay
and relay-destination links, respectively. From [25], we obtain that

K −1
1 (k)
RS =
K ∑ E{log2 (1 + γS )}
k =0
K −1
1 (k) (k) (k) (k)
=
K ∑ E{log2 (1 + (γS1 · γS2 )/(γS1 + γS2 + 1))}. (59)
k =0

Based on the proof of Lemma 2, we can have the following asymptotical SNR expressions

(k) E1 κ̄U1
lim γS1 = (60)
M→∞ σ12

and
(k) E1 κ̄U2
lim γS2 = (61)
M→∞ σ22
Thus, the power scaling law (58) holds.

5. Simulation Results
For OFDM-MIMO systems in frequency-selective Ricean fading channels, we in this section
provide our analytical results and simulation results. In all simulations, we assume that all of these
spacings between adjacent antennas at the transmitter and the receiver are 0.5. We set the number of
subcarriers K = 256, the channel delay spread L = 3, and the noise variance as σ2 = 1. In addition,
we let ρ20 = ρ21 = ρ22 = 1/3, θ0 = φ0 = π/6, θ1 = φ1 = π/4, and θ2 = φ2 = π/3. In Figures 1 and 2,
the Ricean K-factor κ is fixed and is equal to 5 dB.
In order to verify Propositions 1 and 3, we consider firstly the scenario with spatial correlation
when N = 3 and E = 20 dB. The spatial correlation among antennas is assumed to follow the
exponential model, i.e., the correlation magnitude between antenna p and q can be determined by
c( p, q) = g| p−q| , where g denotes the correlation coefficient [12]. Therefore, we represent the correlation
matrices (i, j)-th of Φ by [Φ ]ij = ( gr )|i− j| and (i, j)-th of Ψ by [Ψ ]ij = ( gt )|i− j| , respectively,
= 0, 1, 2. Moreover, we set gr0 = gr1 = gr2 = gr and gt0 = gt1 = gt2 = gt . For the correlation coefficients
gt = gr = g = 0, 0.3, 0.6, 0.9, as M increases from 6 to 60, Figure 1 provides a curve of the exact average
rate RC and two curves of the upper and lower bounds of RS . It can be observed that the exact ergodic
rate RC increases as the number of receive antenna M grows large, and is always between the two
bounds of RS . As both of the correlation coefficients ( gt , gr ) increase, RC is closer to the upper bound,
and becomes higher and higher than RS . This indicates that, compared to the uncorrelated scenario,
the presence of spatial correlation results in improving the rate performance under the LOS-based
EGT/EGC scheme. Therefore, if the LOS-based scheme can be employed, we can achieve performance
benefits from the spatial correlation, which is obviously different from the traditional point of view.
This implies that it would be practical if a large-scale antenna array is compactly arranged.

148
Electronics 2019, 8, 79

6.5

5.5
Average Rate (Bits/s/Hz)

4.5

4
gt=0 gr=0
3.5
gt=0.3 gr=0.3
3 g =0.6 g =0.6
t r
2.5 gt=0.9 gr=0.9
Upper Bound
2
Lower Bound
1.5
10 20 30 40 50 60
Number of Receive Antennas

Figure 1. The ergodic achievable rate versus the number of receive antennas for comparing the case
with correlation and the case without correlation.

E=30dB
Average Rate (Bits/s/Hz)

4
E=20dB

E=10dB
0
CSI -based BF
LOS-based BF
-2
10 15 20 25 30 35 40 45 50 55 60
Number of Receive Antennas
Figure 2. The ergodic achievable rate versus the number of receive antennas for comparing the
LOS-based scheme with the whole CSI-based scheme.

149
Electronics 2019, 8, 79

Next, we consider validating Proposition 2. For that, we need to compare the ergodic achievable
rate of the proposed LOS-based EGT/EGC scheme with that of the perfect CSI-based MRT/MRC
scheme. We set μ = 1/2 when the numbers of antennas at the transmitter and receiver grow large.
For the parameter E = 10, 20, 30 dB, as M increases from 6 to 60, Figure 2 plots the two ergodic
achievable rates, RS and R P . It can be found from Figure 2 that both of the ergodic achievable rates can
tend to the same limit results for the given values of E. However, with an increase of E, the speed of
rate convergence appears to be slower and slower.
Finally, we pay our attention to the classical DF cooperative relay system consisting of the
source-relay and relay-destination links and set the identical parameters mentioned above in the two
links. When N1 = N2 = 6, as M increases from 6 to 60, Figure 3 plots the two average rates RS
and R P for κ = 5, 15 dB. It can be found from Figure 3 that, with an increasing κ, both RS and R P
improve and RS is closer to R P . It should be noticed that R P denotes the average rate for the traditional
linear processing scheme based on the whole CSI as considered in [6]. For obtaining a comprehensive
comparison with the scheme based on the whole CSI in Rayleigh fading discussed in [24], Figure 3 also
includes a rate curve which corresponds to κ = −∞ dB. Interestingly, with κ = 5 dB, the LOS-based
scheme always obviously outperforms the scheme based on the whole CSI in Rayleigh fading.

1.6

1.5

1.4
Average Rate (Bits/s/Hz)

1.3

1.2

1.1

0.9
CSI: =15 dB
0.8 LOS: =15 dB
CSI: =5 dB
0.7 LOS: =5 dB
CSI: =- dB
0.6
10 15 20 25 30 35 40 45 50 55 60
Number of Receive Antennas
Figure 3. The ergodic achievable rate versus the number of relay antennas for comparing the case with
Ricean fading and the case with Rayleigh fading.

6. Conclusions
In this paper, we have developed the transmission scheme of LOS-based EGT/EGC for
point-to-point massive-MIMO systems in frequency-selective Ricean fading channels without and
with spatial correlation. In particular, we have derived expressions of the system achievable rate and
determined several power scaling laws. In addition, we have also generalized our analysis to the

150
Electronics 2019, 8, 79

cooperative relaying scenarios with DF and AF protocols, respectively. It is shown by our simulation
results that the spatial correlation can improve the system performance and thus is an advantage,
which is contrary to the traditional point of view. Compared to the Rayleigh fading environments,
deployment of large scale antenna arrays in Ricean fading environments would be more suitable.
For instance, massive MIMO can be applied in microwave backhaul links [26].

Author Contributions: Formal Analysis, Q.Y.; Validation, Y.S.; Writing—Original Draft Preparation, Q.Y., Y.S.;
Writing—Review and Editing, D.-W.Y.; Supervision, D.-W.Y.
Funding: This research was funded by the open research fund of State Key Laboratory of Integrated Services
Networks, the Fundamental Research Funds for the Central Universities Grant No. 3132016347, and the Natural
Science Foundation of Liaoning Province Grant No. 201602086.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

References
1. Shafi, M.; Molisch, A.F.; Smith, P.J.; Haustein, T.; Zhu, P.; Silva, P.D.; Tufvesson, F.; Benjebbour, A.; Wunder, G.
5G: A tutorial overview of standards, trials, challenges, deployment, and practice. IEEE J. Sel. Areas Commun.
2017, 35, 1201–1221. [CrossRef]
2. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
3. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
4. Liu, W.; Han, S.; Yang, C. Energy efficiency scaling law of massive MIMO systems. IEEE Trans. Commun.
2017, 65, 107–121. [CrossRef]
5. Yang, H.; Marzetta, T.L. Performance of conjugate and zero-forcing beamforming in large-scale antenna
systems. IEEE J. Sel. Areas Commun. 2013, 31, 172–179. [CrossRef]
6. Zhang, Q.; Jin, S.; Wong, K.-K.; Zhu, H.; Matthaiou, M. Power scaling of uplink massive MIMO systems with
arbitrary-rank channel means. IEEE J. Sel. Top. Signal Process. 2014, 62, 966–981. [CrossRef]
7. Rana, M.M.; Kim, J. Fundamentals of Channel Estimations for Mobile Communications-Existing and New
Techniques of a LTE SC-FDMA System; LAMBERT Academic Publishing: Saarbrucken, Germany, 2012;
ISBN 978-3-8454-3137-6.
8. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Proc. Mag. 2013, 30, 40–60. [CrossRef]
9. Yue, D.-W.; Zhang, Y.; Jia, Y.-N. Beamforming based on specular component for massive MIMO systems in
Rician fading. IEEE Wirel. Commun. Lett. 2015, 4, 197–200. [CrossRef]
10. Yue, D.-W.; Yan, Q. Beamforming only based on specular component for downlink massive MIMO systems.
Wirel. Pers. Commun. 2017, 95, 1799–1810. [CrossRef]
11. Yue, D.-W.; Meng, Z. LOS component-based equal gain combining for Ricean links in uplink massive MIMO.
In Proceedings of the 2016 Sixth International Conference on Instrumentation & Measurement, Computer,
Communication and Control (IMCCC), Harbin, China, 21–23 July 2016; pp.476–481.
12. Yue, D.-W.; Zhang, Q.T. Generic approach to the performance anaysis of correlated transmit/receive diversity
MIMO systems with/without co-channel interference. IEEE Trans. Inf. Theory 2010, 56, 1147–1157. [CrossRef]
13. Yue, D.-W. Specular component-based beamforming for broadband massive MIMO systems with
doubly-ended correlation. Electron. Lett. 2016, 52, 1082–1084. [CrossRef]
14. McKay, M.R.; Collings, I.B. On the capacity of frenquency-flat and frequency-selective Rician MIMO channels
with single-ended correlation. IEEE Trans. Wirel. Commun. 2006, 5, 2038–2043. [CrossRef]
15. Dinis, R.; Montezuma, P. Iterative receiver based on the EGC for massive MIMO schemes using SC-FDE
modulations. Electron. Lett. 2016, 52, 972–974. [CrossRef]
16. Love, D.J.; Heath, R.W. Equal gain transmission in multiple-input multiple-output wireless systems.
IEEE Trans. Commun. 2003, 51, 1102–1110. [CrossRef]
17. Zheng, X.; Xie, Y.; Li, J.; Stoica, P. MIMO transmit beamforming under uniform elemental power constraint.
IEEE Trans. Signal Process. 2007, 55, 5395–5406. [CrossRef]

151
Electronics 2019, 8, 79

18. Fu, H.; Crussière, M.; Hélard, M. BER analysis for equal gain transmission in downlink multiuser MIMO
systems. IEEE Wirel. Commun. Lett. 2015, 4, 533–536. [CrossRef]
19. Jin, S.; Gao, X.; You, X. On the ergodic capacity of rank-1 Ricean fading MIMO channels. IEEE Trans.
Inf. Theory 2003, 53, 502–517. [CrossRef]
20. Bölcskei, H.; Borgmann, M.; Paulraj, A.J. Impact of the propagation environment on the the performance of
space-frequency coded MIMO-OFDM. IEEE J. Sel. Areas Commun. 2003, 21, 427–439. [CrossRef]
21. Chen, J. When does asymptotic orthogonality exist for very large arrays? In Proceedings of the Global
Communications Conference (GLOBECOM), Atlanta, GA, USA, 9–13 December 2013; pp. 4251–4255.
22. Tulino, A.M.; Verdu, S. Random matrix theory and wireless communications. Found. Trends Commun.
Inf. Theory 2004, 1, 1–182. [CrossRef]
23. Wang, C.-X.; Ge, X.; Cheng, X.; Zhang, G.; Thompson, J. Cooperative MIMO channel models: A survey.
IEEE Commun. Mag. 2010, 48, 80–87. [CrossRef]
24. Ngo, H.Q.; Suraweera, H.A.; Matthaiou, M.; Larsson, E.G. Multipair full-duplex relaying with massive
arrays and linear processing. IEEE J. Sel. Areas Commun. 2014, 32, 1721–1736. [CrossRef]
25. Suraweera, H.A.; Krikidis, I.; Yuen, C. Antenna selection in the full-duplex multi-antenna relay channel.
In Proceedings of the 2013 IEEE International Conference on Communications (ICC), Budapest, Hungary,
9–13 June 2013; pp. 4823–4828.
26. Durisi, G.; Tarable, A.; Camarda, C.; Devassy, R.; Montorsi, G. Capacity bounds for MIMO microwave
backhaul links affected by phase noise. IEEE Trans. Commun. 2014, 62, 920–929. [CrossRef]

152
electronics
Article
Hybrid Beamforming for Millimeter-Wave
Heterogeneous Networks
Mostafa Hefnawi
Department of Electrical and Computer Engineering, Royal Military College of Canada,
Kingston, ON K7K 7B4, Canada; [email protected]

Received: 23 December 2018; Accepted: 23 January 2019; Published: 28 January 2019

Abstract: Heterogeneous networks (HetNets) employing massive multiple-input multiple-output

(MIMO) and millimeter-wave (mmWave) technologies have emerged as a promising solution to
enhance the network capacity and coverage of next-generation 5G cellular networks. However,
the use of traditional fully-digital MIMO beamforming methods, which require one radio frequency
(RF) chain per antenna element, is not practical for large-scale antenna arrays, due to the high
cost and high power consumption. To reduce the number of RF chains, hybrid analog and digital
beamforming has been proposed as an alternative structure. In this paper, therefore, we consider
a HetNet formed with one macro-cell base station (MBS) and multiple small-cell base stations (SBSs)
equipped with large-scale antenna arrays that employ hybrid analog and digital beamforming.
The analog beamforming weight vectors of the MBS and the SBSs correspond to the the best-ﬁxed
multi-beams obtained by eigendecomposition schemes. On the other hand, digital beamforming
weights are optimized to maximize the receive signal-to-interference-plus-noise ratio (SINR) of the
effective channels consisting of the cascade of the analog beamforming weights and the actual channel.
The performance is evaluated in terms of the beampatterns and the ergodic channel capacity and
shows that the proposed hybrid beamforming scheme achieves near-optimal performance with only
four RF chains while requiring considerably less computational complexity.

Keywords: hybrid beamforming; massive MIMO; HetNets; mmWaves

1. Introduction
Recently, heterogeneous networks (HetNets) that use massive multiple-input multiple-output
(MIMO) and millimeter-wave (mmWave) technologies has emerged as a promising solution to
enhance the network capacity and coverage of next-generation 5G cellular networks [1–6]. Small cell
deployment in HetNets can achieve high signal to interference plus noise ratio (SINR) and dense
spectrum reuse, mmWave can address the current challenge of bandwidth shortage, and the large
number of antenna arrays [7–10] are essential for mmWaves to compensate for channel attenuation.
In Reference [11] we applied the concept of massive multiuser (MU)-MIMO to enhance both the access
and the backhaul links in HetNets, and it was shown that such a concept could significantly improve
the system performance in terms of link reliability, spectral efficiency, and energy efficiency. Traditional
MIMO-beamforming systems require a dedicated radio frequency (RF) chain for each antenna element,
which becomes impractical with massive MIMO systems due to either cost or power consumption.
To reduce the number of RF chains, hybrid beamforming (HBF), which combines RF analog and
baseband digital beamformers, has been proposed as a promising solution [12–17]. Figure 1 shows
a general hybrid configuration that connects Na antenna elements to Nd RF chains, where Nd < Na ,
using an analog RF beamforming matrix built from only phase-shifters. Two widely-used analog
beamformer architectures for hybrid beamforming are shown in Figure 2. The fully-connected hybrid
beamforming structure of Figure 2a provides a full beamforming gain per transceiver—but with

Electronics 2019, 8, 133; doi:10.3390/electronics8020133 153 www.mdpi.com/journal/electronics

Electronics 2019, 8, 133

high complexity—by connecting each RF chain to all antennas through a network of Nd × Na phase
shifters [12–15]. Figure 2b, on the other hand, shows a partially-connected structure, where each RF
chain is connected to Na /Nd number of sub-arrays. Such a structure has a lower hardware complexity
at the price of reduced beamforming gain.

Figure 1. Hybrid beamforming.

(a) (b)
Figure 2. The architecture of analog beamformers: (a) Fully-connected structure; (b) partially-
connected structure.

Previous studies on massive hybrid MIMO mainly focused on single-user systems [12–14]. On the
other hand, MU-MIMO cases were studied in References [15–17]. In Reference [15] a scheme called Joint
Spatial Division Multiplexing (JSDM) was proposed to create multiple virtual sectors which reduce the
overhead and computational complexity of downlink training and uplink feedback. In References [16,17]
it was shown that the required number of RF chains only needs to be twice the number of data
streams to achieve the same performance of any fully-digital beamforming scheme. These studies,
however, did not consider HBF in the context of HetNets and focused primarily on macro-cellular
systems. In this paper, we consider a HetNet where both the macro-cell base stations (MBSs) and
small-cell base stations (SBSs) are equipped with fully-connected massive hybrid antenna arrays,
while all mobile users have a single antenna. We propose a low-complexity HBF that is fully-based on
eigenbeamforming. The MBSs and the SBSs select the best-ﬁxed multi-beams by eigendecomposition
of the access and backhaul channels. The selected beams are then used by the digital beamformers,
which are based on the maximization of the receive SINR of the effective channels consisting of the
cascade of the analog beamforming weights and the actual channel [18,19].

2. System Model
We consider the access and backhaul uplinks in the HetNet of Figure 3, where K cognitive small
cells and their Ls small-cell users (SUs) are concurrently sharing the same frequency band with one
MBS and their L p macro-cell primary users (PUs). It is assumed that both the MBS and SBSs are
equipped with massive hybrid antenna arrays while the SUs and PUs are equipped with a single

154
Electronics 2019, 8, 133

antenna. The SBSs are acting as smart relays between the SUs and the MBS with Na − element
transmitting/receiving antenna arrays and Nd RF chains. The MBS is equipped with Ma − element
antenna arrays and Md RF chains. It is also assumed that both the SBS and the MBS perform
an OFDM-based transmission and that the analog beamformers are identical for all subcarriers while
adapting digital beamformers
! in each
" subcarrier. ! "
p p p
Let xs [ f n ] = x1s , x2s , · · · , x sLs and x p [ f n ] = x1 , x2 , · · · , x L p denote, respectively, the set of Ls
SUs signals and L p PUs signals transmitted on each subcarrier, and f n , n = 1, · · · , Nc , where Nc
denotes the number of subcarriers per OFDM symbol in the system. The analysis is done separately
on each subcarrier. For brevity therefore, we drop the frequency index f n .

Figure 3. System model: hybrid beamforming-based HetNet with one macro-cell and K small-cells.

2.1. Access Link

The Na × 1 received signal vector yk,SBS at the kth SBS is given by

yk,SBS = Gk,SU xs + Gk,PU x p + nk,SBS , (1)

where Gk,SU ∈ C Na × Ls is the channel matrix between the kth SBS and its Ls users, Gk,PU ∈ C Na × L p is
the channel matrix between the kth SBS and the L p PUs, xs ∈ C Ls ×1 is the transmitted signal vector of
Ls users in the kth small-cell, x p ∈ C L p ×1 is the transmitted signal vector of L p users in the HetNet,
and nk,SBS ∈ C Na ×1 is the received complex additive white Gaussian noise (AWGN) vector at the
kth SBS.
It should be noted that in Equation (1), the interference between small-cells was neglected.
This was justiﬁed by the fact that small-cell base stations are using a large number of antennas,
which enables sharp beamforming towards their users without harming neighboring small-cells.
The kth SBS received signal, yk,SBS , is ﬁrst applied to an Na × Nd receive analog beamforming
weight matrix, ASBS R,k,ls , whose output is directed to an Nd × Nd receive digital beamforming weight
vector DSBS
R,k,ls . If we denote the combined digital-analog receive beamformer for the lsth user as
wR,k,ls = AR,k,ls DR,k,ls , then the detection of the lsth user signal by its kth SBS can be expressed as
SBS SBS

rk,ls = wH
R,k,ls yk,SBS = w R,k,ls Gk,SU xs + w R,k,ls Gk,PU x p + w R,k,ls nk,SBS
H H H
Ls (2)
= wR,k,l
H g x s + wR,k,l
s k,ls ls
H
s
∑ gk,ixs + wR,k,l
H
s
+ Gk,PU x p + wR,k,l
H n
s k,SBS
i
i =1,i =ls

155
Electronics 2019, 8, 133

where gk,ls is the lsth column of Gk,SU that represents the channel between the kth SBS and its lsth user.
H
If we denote H AL,k,ls = ASBSR,k,ls gk,ls as the effective access channel between the kth SBS and its
H
lsth user, then for a set of selected beams, i.e. known ASBS R,k,ls , the SINR can be expressed in terms of
the digital beamformer, DSBS
R,k,ls , as

H H
DSBS
R,k,ls H AL,k,ls xlss xlss H H SBS
AL,k,ls D R,k,ls
k,ls =
γSBS H , (3)
DSBS
R,k,ls B AL D SBS
R,k,ls

where B AL is the covariance matrix of the interference-plus-noise given by

Ls H
H
B AL = ∑ (ASBS s s H H SBS
R,k,i ) gk,i xi ( xi ) gk,i A R,k,i + (A R,k,ls ) Gk,PU x p x p Gk,PU A R,k,ls + σn (A R,k,ls )A R,k,ls , (4)
2
SBS
34
H H SBS
5
2 SBS SBS
i =1,i =ls
2 34 5 Interference from L p PUs
Interference from Ls −1 SUs

2.2. Backhaul Link

The hybrid beamforming weights at the backhaul link are obtained based on orthogonal pilot
signals transmitted from each SBS to the MBS. The kth SBS applies its pilot signal sk ∈ C Nd ×1 to
p

an Nd × Nd transmit digital beamforming weight vector DT,k followed by an Na × Nd transmit analog

SBS

beamforming matrix ASBST,k . If we denote the combined digital-analog transmit beamformer for the k
th

SBS as wT,k = AT,k DT,k , then the array output of the MBS can be written as
SBS SBS

K
∑ Hk,MBS wT,k sk + HPU,MBS x p + n MBS ,
p p
yMBS = (5)
k =1

p
where yMBS is the Ma × 1 vector containing the outputs of the Ma − element antenna array of the
MBS, Hk,MBS is the Na × Ma channel matrix representing the transfer functions from the Na − element
antenna array of the kth SBS to the Ma − element antenna array of the MBS, HPU,MBS is the Ma × Lp
channel matrix from the L p PUs to the MBS’s Ma − element antenna array, and n MBS is the received
Ma × 1 complex AWGN vector at the MBS.
p
The MBS detects the kth SBS signal by applying the output of the array yMBS to the Ma × Md receive
analog weight matrix AR,k followed by the Md × Md receive digital beamforming weight vector Dk,R
MBS MBS .

If we denote the combined digital-analog receive beamformer for the k SBS as ck = Ak,R Dk,R ,
th MBS MBS

then the detection of the kth SBS signal by the MBS can be expressed as
p p
x̂k = ckH yMBS = Sk + S Ik + S I p + ckH n MBS , (6)

p p p
where Sk = ckH Hk,MBS wT,k sk is the kth SBS signal, S Ik = ckH ∑iK=1,i =k Hk,MBS wT,k sk is the interference
from K − 1 other SBSs, and SIp = ckH HPU,MBS x p is the interference from L p PUs.

p p
Assuming that sk are complex-valued random variables with unit power, i.e., E sk 2 = 1,
H H
and denoting HBL,k = AR,kMBS Hk,MBS ASBS
T,k as the effective channel between the kth SBS and the
MBS, the SINR at the MBS for the kth SBS can be expressed as
H H
MBS
DR,k HBL,k DSBS DSBS H MBS
T,k T,k H BL,k D R,k
γkMBS = , (7)
ckH BBL ck

156
Electronics 2019, 8, 133

where BBL = ∑iK=1,i =k Hi,MBS wT,i wT,i

H HH
i,MBS + H PU,MBS x p x p H PU,MBS + σn I Ma is the covariance
H H 2

matrix of the interference-plus-noise at the backhaul link.

2.3. End-to-End SINR and Channel Capacity

Once the hybrid beamforming weights of the backhaul link are obtained, they can be used to
forward the SUs signals to the MBS. The kth SBS applies the received lsth user signal, wR,k,l
H g x s , to the
s k,ls ls
hybrid beamformer, wT,k . The NT,a × 1 transmitted signal sk,ls at the output of the antenna array can
then be expressed as
sk,ls = wT,k wR,k,l
H
g xs ,
s k,ls ls
(8)

and the expression for the array output of the MBS can be written as

K
y MBS = HPU,MBS x p + yk,MBS + ∑ yi,MBS + n MBS , (9)
i =1, i =k

where yk,MBS = Hk,MBS wT,k wR,k,l

H y is the array output of the MBS from the kth SBS.
s k
Using Equation (2), yk,MBS can be expressed in terms of the lsth user signal, xlss , as follows:

yk,MBS = Hk,MBS wT,k wR,k,l

H g xs
s k,ls ls
+ Hk,MBS wT,k wR,k,ls ∑iL=s 1,i =ls gk,i xis
H
(10)
+ Hk,MBS wT,k wR,k,l
H G
s k,PU
xp
+ Hk,MBS wT,k wR,k,ls nk,SBS ,
H

When the MBS applies the output of the array, y MBS , to the hybrid weight, ckH , the detection of
the lsth
user signal of the kth SBS by the MBS can be expressed as

x̂k,ls = ckH yMBS = ckH Sk,ls + SSBSs
I + SSU
k,I + S I + N MBS
PU
, (11)

where

Sk,ls = Hk,MBS wT,k wR,k,l

H g x s is the lsth user signal of the kth SBS,
s k,ls ls
Ls
k,I = Hk,MBS wT,k w R,k,ls ∑i =1,i =ls gk,i xi is the interference from the Ls − 1 other SUs of k SBS,
SSU H s th

SSBS
I = ∑iK=1, i =k Hi,MBS wT,i wR,i,l
H y
s i,SBS
is the interference from the K − 1 other SBSs.

I = Hk,MBS wT,k w R,k,ls Gk,PU x p + ck H PU,MBS x p

SPU H H

N MBS = Hk,MBS wT,k wR,k,ls nk,SBS + n MBS
H

The end-to-end SINR at the MBS for the lsth user of the kth SBS can be expressed as
H
ckH Hk,MBS wT,k wR,k,l
H g x s xlss gk,l
H w H H
s k,ls ls s R,k,ls wT,k H k,MBS ck
MBS
γk,l = , (12)
s
ckH B AL− BL ck
H H H
where B AL− BL = SSU SU
k,I Sk,I + SSBS
I SSBS
I + SPU
I SPU
I + N MBS (N MBS ) H is the covariance
matrix of the interference-plus-noise for the lsth user end-to-end link.
The ergodic channel capacity for each user, ls , is given by [19]
H
#
ckH Hk,MBS wT,k wR,k,l
H g xs x Hs gk,l
s k,ls
H w H
R,k,ls wT,k H k,MBS ck
C = E log2 1 + s
, (13)
ckH B AL− BL ck

where E(.) denotes the expectation operator.

157
Electronics 2019, 8, 133

2.4. Channel Model

For the access and backhaul links, we consider mmWave propagation channels with limited
scattering which can be modelled at each subcarrier by the clustered channel representation [13].
We assume a scattering environment with Ncl scattering clusters randomly distributed in space and
within each cluster, there are Nray closely located scatterers.
For the backhaul link, the channel matrix at subcarrier f n between the kth SBS and the MBS can be
expressed as
7
Na Ma
a∗k,SBS, f n ∅i,l
N N
Hk,MBS, f n =
Ncl Nray ∑i cl ∑l=ray1 αil, fn a MBS, fn ∅ri,l , θi,l
r t t
, θi,l , (14)

αil e− j2πi f n /Nc are the complex gains of the jth ray in the ith scattering cluster and 6
where αil, f n = 6 αil are
assumed i.i.d CN(0, σ6α2,i ). With σ6α2,i representing the average power of the ith cluster, ∅ri,l and ∅i,l t are

the azimuth angles of arrival and departure, respectively, θi,j r and θ t are the elevation angles of arrival
i,j
and departure, respectively, and a MBS, f n ∅ri,l , θi,l r and ak,SBS, f n ∅i,l
t , θt
i,l represent, respectively,
the normalized receive and transmit array response vectors of the MBS and the kth SBS.
For the access link, the channel matrix at subcarrier f n between the kth SBS and its Ls users can be
written as
7
Ls Ma
a∗k,SU, f n ∅i,l
Ncl Nray
Ncl Nray ∑i ∑l =1 il, f n SBS, f n
Gk,SU, f n = α a ∅ri,l , θi,l
r t t
, θi,l , (15)

where ak,SU, f n ∅i,l
t , θt
i,l represents the spatial signature vector of the Ls single antenna users.
The Nray azimuth and elevation angles, ∅r,t r,t
i,l and θi,l , within the cluster i are assumed to be
randomly distributed with a uniformly-random mean cluster angle of ∅r,t r,t
i and θi , respectively,
and a constant angular spread of σ∅r,t and σθ r,t , respectively.
For simplicity, the access links between the MBS and its L p users (HPU,MBS ) and between PUs and
the kth SBS (Gk,PU ) are modeled by convetional i.i.d MIMO channels.
Note that in this per-subcarrrier representation, it is assumed
that foreach subcarrierf n , the carrier

frequency f c is much larger than f c ± f n and that a MBS, f n ∅ri,l , θi,lr and ak,SBS, f n ∅i,lt , θt
i,l can
approximately be considered equal for all subcarriers. Consequently, the channel covariance
matrices are approximately similar with the same set of eigenvectors for all subcarriers and can be
Nc
AL,k,ls H AL,k,ls = Nc ∑n=1 H AL,k,ls , f n H AL,k,ls , f n ,
1
replaced by the average of the covariance matrices, i.e., H H H

N N
BL,k = f n H BL,k, f n , and H PU,MBS H PU,MBS =
H H 1 H H 1 H
HBL,k Nc ∑n=c 1 HBL,k, Nc ∑n=c 1 HPU,MBS, f n H PU,MBS, f n .

3. Proposed Hybrid Beamforming

3.1. Access Link

The kth SBS communicates with each SU through a set of selected beams that corresponds to a set
of weight vectors. These weight vectors are obtained using the eigenbeamforming scheme and can be
expressed as
R,k,ls = a R,k,ls ,1 , a R,k,ls ,2 , · · · , a R,k,ls ,Nd
ASBS SBS SBS SBS
2 , (16)

subject to ASBS R,k,ls (i, j ) = 1

R,k,ls ,i denote the i selected Ma × 1 eigenvector corresponding to the i maximum eigen

where aSBS th th
H
value of g k,ls gk,ls .

158
Electronics 2019, 8, 133

Since the analog beamforming matrix ASBS R,k,ls is realized using phase shifters only, its elements,
2
SBS
AR,k,ls (i, j), satisfy AR,k,ls (i, j) = 1. It should be noted that each SBS is using a different analog matrix
SBS

for each user and that the system model shown in Figure 1 focuses on the detection of the lsth user of the
kth SBS and shows the analog beamformer and the RF chains for one user only. The analog beamformer
can be implemented using the Butler matrix as shown in Figure 4, where four users ( Ls = 4) and
four RF chains per user ( Nd = 4) are assumed. Depending upon which 4 ports are activated, 4 beams
are produced in speciﬁed directions. Since we are assuming 4 different channels, we should expect
4 different ports for each user.

Figure 4. Hybrid beamforming based on Butler matrix for the access link.

Once the analog beams are selected, the received optimal digital weights, DSBS R,k,ls , are obtained
based on the maximization of the access link receive SINR, γSBS
k,ls , given by Equation (3):

−1
R,k,ls = B AL,k,ls V AL,k,ls ,
DSBS (17)

where V AL,k,ls denote the eigen vector corresponding to the maximum eigenvalue of the effective access
channel, H H
AL,k,ls H AL,k,ls .

3.2. Backhaul Link

The transmit analog weights of the kth SBS are based on the eigenbeamforming scheme and are
given by
T,k = aT,k,1 , aT,k,2 , · · · , aT,k,Nd
ASBS SBS SBS SBS
2 , (18)

T,k (i, j ) = 1
subject to ASBS

T,k,i denote the i selected Na × 1 eigenvector corresponding to the i maximum eigenvalue

where aSBS th th
H
of H k,MBS Hk,MBS .
Assuming channel reciprocity with Na = Ma , the receive analog weight vectors of the MBS are
MBS = ASBS . It should also be noted that the MBS is using a different analog matrix for each
given by AR,k T,k
SBS, which can be implemented using the Butler matrix of Figure 4, where mobile users are replaced
by SBSs.

159
Electronics 2019, 8, 133

For ﬁxed analog beamforming weights, ASBS MBS

T,k and A R,k , the transmit optimal digital weight vector
th SBS MBS , are obtained base
of the k SBS, DT,k , and the receive optimal digital weight vector of the MBS, DR,k
on the maximization of the backhaul link receive SINR and are given by

−1
T,k = D R,k = B BL,k HBL,k V BL ,
DSBS MBS
(19)

where VBL is the eigenvector corresponding to the maximum eigenvalue of HBL,kH H

BL,k , with H BL,k
H H
representing the effective channel given by HBL,k = AR
MBS SBS
Hk,MBS AT .
k k

4. Simulation Results
In our simulation setups, we considered a HetNet organized into four SBSs (K = 4) and one
macro-cell. The SBSs and the MBS used the same number of antennas, Na = Ma = 64, and the same
number of RF chains, Nd = Md = 2 or 4. Each SBS is serving Ls = 4 users and the macro-cell is
serving 4 users, each transmitting with a single antenna. We assumed QPSK modulation. For the
OFDM conﬁgurations, we assumed the 256-OFDM system (Nc = 256), which is widely deployed in
broadband wireless access services.
Figure 5 shows the beampattern of the proposed HBF with four RF chains and the optimal
fully-digital one for the access link. It is noted that the optimal beamformer has about ﬁve dominant
beams, three of which are similar to the selected beams of the proposed HBF. This beampattern means
that the data streams can be successfully transmitted through those three beams using the proposed
HBF and that near optimal performance could be achieved if we were to bring the number of RF chains
close to the number of dominant beams of the optimal beamformer. For the backhaul link, Figure 6
shows very similar beampatterns with more dominant beams.

(a) (b)
Figure 5. Beampattern of the access link: (a) Proposed HBF, 4 RF chains; (b) fully-digital
beamforming (optimal).

(a) (b)

Figure 6. Beampattern of the backhaul link: (a) Proposed HBF, 4 RF chains; (b) fully-digital
beamforming (optimal).

160
Electronics 2019, 8, 133

Figure 7, on the other hand, compares the ergodic channel capacity of the proposed HBF and the
optimal fully-digital one. It is observed that for both cases the optimal beamformer is outperforming
the proposed HBF. However, as we increase the number of RF chains, the performance gap between
the two schemes was reduced, and a near-optimal solution was achieved by the proposed HBF using
four RF chains. On the other hand, for the single cell MU-MIMO case presented in References [12–14],
near optimal performance was achieved with only ﬁve RF chains, and for the MU-MIMO case in [16,17],
it was shown that the required number RF chains could be reduced to two to achieve fully digital
beamforming performance. However, unlike our case, where we have assumed a HetNet with a macro
cell and multiple small cognitive cells, these studies focused primarily on macro-cellular systems and
did not consider HBF in the context of HetNets.

Figure 7. Ergodic channel capacity of the proposed HBF for different number of RF chains.

5. Conclusions
In this paper, we employed hybrid beamforming at the access and backhaul links of a mmWave
HetNet system. We proposed a low-complexity HBF that was fully-based on MRT/MRC Eigen-
beamforming schemes. The performance evaluation in terms of the beam patterns and the ergodic
channel capacity showed that the proposed HBF scheme achieved near-optimal performance with
only four RF chains and required considerably less computational complexity.

Funding: This research received no external funding.

Acknowledgments: The author would like to thank the Canadian Microelectronics Corporation (CMC) for
providing the Heterogeneous Parallel Platform to run the computationally-intensive Monte-Carlo Simulations.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

References
1. Siddique, U.; Tabassum, H.; Hossain, E.; Kim, D.I. Wireless backhauling of 5G small cells: Challenges and
solution approaches. IEEE Wirel. Commun. 2015, 22, 22–31. [CrossRef]
2. Gao, Z.; Dai, L.; Mi, D.; Wang, Z.; Imran, M.A.; Shakir, M.Z. MmWave Massive MIMO Based Wireless
Backhaul for 5G Ultra-Dense Network. IEEE Wirel. Commun. 2015, 22, 13–21. [CrossRef]
3. Tabassum, H.; Hamdi, S.A.; Hossai, E. Analysis of massive MIMO-enabled downlink wireless backhauling
for full-duplex small cells. IEEE Trans. Commun. 2016, 64, 2354–2369. [CrossRef]
4. Shariat, M.; Pateromichelakis, E.; Quddus, A.; Tafazolli, R. Joint TDD Backhaul and Access Optimization in
Dense Small-Cell Networks. IEEE Trans. Veh. Technol. 2015, 64, 5288–5299. [CrossRef]
5. ElSawy, H.; Hossain, E.; Kim, D.I. HetNets with cognitive small cells: User ofﬂoading and Distributed
channel allocation techniques. IEEE Commun. Mag. 2013, 51, 28–36. [CrossRef]

161
Electronics 2019, 8, 133

6. Yan, Z.; Zhou, W.; Chen, S.; Liu, H. Modeling and Analysis of Two-Tier HetNets with Cognitive Small Cells.
IEEE Access. 2016, 5, 2904–2912. [CrossRef]
7. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
8. Rusek, F.; Persson, D.; Lau, B.; Larsson, E.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
9. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efﬁciency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
10. Hoydis, J.; ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
11. Hefnawi, M. Capacity-Aware Multi-User Massive MIMO for Heterogeneous Cellular Network. In Proceedings
of the IEEE International Conference on Selected Topics in Mobile and Wireless Networking, Tangier,
Morocco, 20–22 June 2018.
12. Sohrabi, F.; Yu, W. Hybrid beamforming with ﬁnite-resolution phase shifters for large-scale MIMO systems.
In Proceedings of the IEEE Workshop Signal Processing Advances in Wireless Communications, Stockholm,
Sweden, 28 June–1 July 2015.
13. El Ayach, O.; Rajagopal, S.; Abu-Surra, S.; Pi, Z.; Heath, R. Spatially sparse precoding in millimeter wave
MIMO systems. IEEE Trans. Wirel. Commun. 2014, 13, 1499–1513. [CrossRef]
14. Alkhateeb, A.; El Ayach, O.; Leus, G.; Heath, R. Channel estimation and hybrid precoding for millimeter
wave cellular systems. IEEE J. Sel. Top. Signal Process. 2014, 8, 831–846. [CrossRef]
15. Sohrabi, F.; Yu, W. Hybrid digital and analog beamforming design for large-scale MIMO systems.
In Proceedings of the IEEE International Conference on Acoustics, Speech, Signal Process (ICASSP), Brisbane,
Australia, 19–24 April 2015.
16. Liang, L.; Dai, Y.; Xu, W.; Dong, X. How to approach zero-forcing under RF chain limitations in large
mmwave multiuser systems? In Proceedings of the IEEE/CIC International Conference on Communications
in China, Shanghai, China, 13–15 October 2014.
17. Liang, L.; Xu, W.; Dong, X. Low-complexity hybrid precoding in massive multiuser MIMO systems.
IEEE Wirel. Commun. 2014. [CrossRef]
18. Kang, M. A comparative study on the performance of MIMO MRC systems with and without cochannel
interference. IEEE Trans. Commun. 2004, 52, 1417–1425. [CrossRef]
19. Sulyman, A.I.; Hefnawi, M. Adaptive MIMO Beamforming Algorithm Based on Gradient Search of the
Channel Capacity in OFDMSDMA System. IEEE Commun. Lett. 2008, 12, 642–644. [CrossRef]

© 2019 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

162
electronics
Article
Spatio-Radio Resource Management and Hybrid
Beamforming for Limited Feedback
Massive MIMO Systems
Hedi Khammari 1 , Irfan Ahmed 2, *, Ghulam Bhatti 3 and Masoud Alajmi 1
1 Department of Computer Engineering, College of Computers and Information Technology, Taif University,
Taif 21974, Saudi Arabia; [email protected] (H.K.); [email protected] (M.A.)
2 Department of Electrical Engineering, Higher Colleges of Technology, Ruwais 12389, UAE
3 Department of Computer Science, College of Computers and Information Technology, Taif University,
Taif 21974, Saudi Arabia; [email protected]
* Correspondence: [email protected]

Received: 5 August 2019; Accepted: 17 September 2019; Published: 20 September 2019

Abstract: In this paper, a joint spatio–radio frequency resource allocation and hybrid beamforming
scheme for the massive multiple-input multiple-output (MIMO) systems is proposed. We consider
limited feedback two-stage hybrid beamformimg for decomposing the precoding matrix at the
base-station. To reduce the channel state information (CSI) feedback of massive MIMO, we utilize the
channel covariance-based RF precoding and beam selection. This beam selection process minimizes
the inter-group interference. The regularized block diagonalization can mitigate the inter-group
interference, but requires substantial overhead feedback. We use channel covariance-based
eigenmodes and discrete Fourier transforms (DFT) to reduce the feedback overhead and design
a simpliﬁed analog precoder. The columns of the analog beamforming matrix are selected based on
the users’ grouping performed by the K-mean unsupervised machine learning algorithm. The digital
precoder is designed with joint optimization of intra-group user utility function. It has been shown
that more than 50% feedback overhead is reduced by the eigenmodes-based analog precoder design.
The joint beams, users scheduling and limited feedbacK-based hybrid precoding increases the
sum-rate by 27.6% compared to the sum-rate of one-group case, and reduce the feedback overhead
by 62.5% compared to the full CSI feedback.

Keywords: hybrid beamforming; massive MIMO; resource allocation

1. Introduction
The scarcity of available frequency band for wireless communications has led to the inclusion
of millimeter Wave (mmWave) frequencies in cellular communications. This has opened the doors
for massive multiple-input multiple-output (MIMO) systems. Due to high transmission frequencies,
fabrication of large number of antennas with small form factor has become possible. MmWave band
has inherent hindrances, like, high path-loss and absorption-loss. It has been known that MIMO
systems advantages (spatial multiplexing or diversity gain) are scaled-up with the number of antennas.
In summary, one can enjoy the beneﬁts of the large bandwidth available at mmWave frequencies by
combating high path and absorption losses with massive MIMO directional beamforming. Future
mmWave massive MIMO-based cellular networks will be as shown in Figure 1. Due to the high
pathloss on one hand and high directional gain on the other hand, the inter-cell interference and cell
boundaries will become meaningless. The ﬁxed area size cell boundaries of traditional cellular will
probably no longer exist in the future mmWave massive MIMO systems. Narrow beams can serve
distant user equipment (UE) without interfering other UEs provided that there is no obstacle between
BS and intended UE, whereas a closely located UE may deprive of connection due to the obstacles.

Electronics 2019, 8, 1061; doi:10.3390/electronics8101061 163 www.mdpi.com/journal/electronics

Electronics 2019, 8, 1061

ĂƐĞ^ƚĂƚŝŽŶ

ĂƐĞ^ƚĂƚŝŽŶ ĂƐĞ^ƚĂƚŝŽŶ

ĂƐĞ^ƚĂƚŝŽŶ

Figure 1. MmWave massive MIMO based cellular system.

The cost of massive MIMO is in terms of excessive feedback overhead for channel estimation
along with the hardware complexity of RF chains (increased number of radio-frequency (RF) chains).
The feedback overhead has been tackled separately for frequency division duplex (FDD) and time
division duplex (TDD) systems. In FDD systems, the uplink channel estimation consists of fewer
overheads compared to the downlink channel estimation, because generally, the number of transmit
antennas Nt is larger than the number of users K, and the number of receive antennas per user nk
(Nt K and Nt nk ). The most common technique to reduce the downlink channel estimation
overhead is joint spatial division multiplexing (JSDM) [1]. The JSDM uses two-stage precoding:
second order channel statistics (covariance)-based user grouping and the traditional MU-MIMO
linear precoding (zero-forcing) for the inter-user interference mitigation based on the low-dimensional
effective channel. In TDD, only uplink channel estimation is done and the downlink channel estimates
are obtained by the transpose of the uplink channel using the channel reciprocity principle. The TDD
massive MIMO systems suffer from pilot contamination when the BS receives non-orthogonal pilot
signals from the neighboring cells. This pilot contamination degrades the channel estimation and
hence, affects both uplink combining and downlink precoding.
In traditional MIMO systems, a separate RF chain (analog-to-digital converter/digital-to-analog
converter, serial-to-parallel/parralel-to-serial converter, up/down converter etc) is required for each
antenna, but the high power consumption makes it infeasible for the case of massive MIMO systems.
Hybrid beamforming technique resolves this problem by dividing the precoding/combining into
baseband digital processing and RF analog processing. The hybrid precoding and combining
offer extra degrees of freedom in space domain with a large number of antennas and analog
beamforming [2]. The hybrid beamforming can be realized by using MU-MIMO precoding as
baseband digital precoding and the statistical channel state information-based pre-beamforming as RF
analog precoding. This limited feedback (due to average CSI) conﬁguration is particularly suited for
massive MIMO mmWave systems with a large number of antennas but relatively small number of RF
chains [3]. It has been shown [4], that the covariance-based limited feedback works well for mmWave
massive MIMO systems, where the number of users is small with respect to the number of BS antennas
and the channels are formed by a few multi path components (MPCs) with small angular spread.

164
Electronics 2019, 8, 1061

Limited work has been done on the joint multiuser massive MIMO resource allocation and
hybrid beamforming design. Although mmWave massive MIMO system has a potential of tremendous
increase of spectral efficiency. However, the cost and power consumption of power-hungry radio
frequency chains (analog-to-digital converter (ADC)/digital-to-analog converter (DAC), parallel to
serial converter, serial to parallel converter, up converter/down converter) make it impractical to build
a complete RF chain for each antenna. A promising solution to this problem is hybrid beamforming,
where the precoder at the transmitter is divided into two parts: analog precoder and digital precoder.
The analog precoder (usually a network of phase shifters) at the RF stage reduces the number of
RF chains required for the digital precoder. In order to configure these precoders, the transmitter
requires channel state information in the form of uplink feedback from users, but in the presence
of massive antennas, this feedback becomes a huge load on the wireless uplink, especially in FDD
mode. JSDM [4] is a technique used to reduce the feedback overhead. It uses slowly varying average
channel statistics to implement the analog precoder; then, the digital precoder is realized by using a
low-dimensional effective channel. Till now, different variants of the JSDM have been proposed. Li et
al. [5] generalize the JSDM scheme to support non-orthogonal virtual sectorization and with multiple
RF chains at both link ends. It uses the Kronecker channel model to decouple the transmit and
receive beamforming. Under this channel, the analog beamformer is obtained by stacking strongest
eigenbeams of the channel covariance matrix and then the digital beamformer is based on a weighted
minimum mean squared error (MMSE) with effective channel. However, the Kronecker model does
not characterize the mmWave channel where transmitter and receiver have coupling effects due to
highly directional transmission. In [6], the authors apply JSDM using a geometrical channel model and
find hybrid precoder and combiner at transmitter and receivers, respectively. Hybrid beamforming
with switches (HBwS) has been introduced in [7], where, L × Nt analog beamformer is controlled by
NRF × L instantaneous CSI based switches. Nt is the number of transmit antennas, NRF is the number
of RF chains, and NRF < L < Nt . Another switch-based analog beamforming is proposed in [8] but it
requires instantaneous CSI for both switching network and the phase shifter network. Also it contains
L = Nt . The JSDM implementation also requires the training in the downlink to estimate the channel
covariance matrix. Most of the work assumes that the CSI is known at both ends. In [9], authors
consider the joint optimization of the training resource allocation and channel-statistics-based analog
beamformer design by using user centric virtual sectorization. There are different structures for the
phase shifter-based analog beamformer, namely, fully connected, sub-connected, and dynamically
connected [10]. Park [11] investigate JSDM with these analog beamformer architectures. The dynamic
architecture gives better result at the cost of added complexity. In [12], authors propose a hybrid
beamforming method with unified analog beamformer by Subspace Construction (SC) based on
partial CSI in massive MIMO OFDM system. In [13], statistical CSI based analog beamformer uses
regularized block diagonalization to mitigate the inter-group interference and instantaneous CSI based
digital beamformer utilizes the weighted MMSE to suppress intra-group interference. Jiang et al. [14]
jointly optimize the user selection and beam selection during analog beamforming design. They use
Lyapunov-drift optimization framework to obtain the optimal solution. Their work only focuses on the
design of statistical CSI based analog precoder and user/beam selection. Our previous work [15] on
resource allocation for transmit beamforming develops digital and analog precoders which maximize
the sum rate with total power and desired number of RF chains constraints. The provided solutions
require full instantaneous CSI at the transmitter and receiver, which, in case of the massive MIMO,
consists of large number of pilot transmission in downlink and channel information feedback in
the uplink. In this work we exploit the channel similarities by grouping (K-Mean machine learning)
the users based on the location information. Low complexity DFT matrix based analog precoder is
derived using statistical CSI. This greatly reduces the feedback overhead for the design of zero-forcing
digital precoder.
Machine learning (ML) applications for the physical layer of wireless communication systems
have been widely reported in [16]. Most of the conventional transmitter and receiver blocks can be

165
Electronics 2019, 8, 1061

replaced by an ML-based auto encoder as suggested by the authors. The large number of antennas
in massive MIMO leads to the challenging issue of channel estimation in mmWave communications.
A common practice in TDD massive MIMO systems is to utilize the channel reciprocity to get the
downlink CSI from uplink channel information estimates. However, in FDD, the channel reciprocity is
not applicable and the downlink CSI estimation is very difﬁcult. The downlink channel estimation is
known to be hampered by the pilot contamination effect (user to base-station). The quality of channel
estimates is deteriorated by the mutual interference caused by the non-orthogonal pilots in a cell. In
[17], a supervised learning-based pilot decontamination scheme for massive MIMO uplink is reported.
In the proposed ML-based solution, the users’ locations in all cells and the pilot assignments stand
for the input features and output labels, respectively. In [18], a deep learning network CsiNet is
used to learn the CSI-to-codeword transformation (codebook approach is usually adopted to reduce
the feedback overhead) at users’ terminals and inverse CsiNet at base-station. The authors of [19],
suggest a learning-based antenna selection for massive MIMO systems. It uses a multiclass K-NN and
support vector machine (SVM) for data-driven optimal antenna selection. Wang et al. [20] employs
K-nearest neighbor (K − NN ) supervised learning for the N beams allocation among K users. In [21],
a reinforcement learning based framework for radio resource management in radio access networks
has been proposed. In our previous work [22], we used neural networks to reduce the execution time
of the computationally intensive resource allocation part of the joint resource allocation and hybrid
beamforming design in [15]. However, in this work, we use K-mean based unsupervised machine
learning scheme to group the users based on their spatial locations. To the best of our knowledge,
there is no research work that jointly consider the spatio–radio resource management and the hybrid
beamforming in massive MIMO systems.
In this work, we use spatial channel covariance matrices for the analog beamforming design.
We also consider the users to RF beam mapping. This mapping requires channel state information
and a search over all possible beam combinations at the base-station. This search is exponential in the
number of users [23]. Due to this exponential increase in complexity, we use DFT-based eigenmode
beams with RF switches.
Contribution: In this paper, we develop joint spatio–radio resource and hybrid precoding
algorithms for limited feedback wideband massive MIMO systems. The contributions of this paper are
summarized as follow.

• First, we consider the problem of joint hybrid precoder design with limited feedback and
user-beam selection to maximize the sum proportional rate under the total power constraint.
The formulated mixed integer programming problem is then transformed to the relaxed-convex
optimization problem.
• Second, a low complexity suboptimal solution is provided for the optimization problem.
The algorithm generates the analog beamforming matrix, digital beamforming matrix, and the set
of users in each group. The DFT/eigenmodes-based analog beamforming is formed using limited
statistical CSI feedback from the users. Then, the digital precoder design with users selection is
done iteratively.
• Finally, we develop a K-Mean algorithm based unsupervised machine learning scheme for users
grouping. These users groups are used to form the limited feedback (statistical channel state
information) based analog beamforming matrix. The proposed machine learning based analog
beamforming along with the zero-forcing digital precoding and user scheduling gives better
performance than the DFT/eigenmodes-based solution.

The rest of the paper is organized as follows. System, signal, and channel model along with the
problem formulation are described in Section 2. Section 3 introduce the relaxed-convex transformation
of the formulated mixed integer optimization problem. Suboptimal solution to the joint resource
allocation and hybrid beamforming based on eigenmodes and discrete Fourier transform is given
in Section 4. Section 5 proposes machine learning based users grouping and beam selection for

166
Electronics 2019, 8, 1061

joint optimization problem. Simulation results are given in Section 6, followed by the conclusions
in Section 7.
Notations: Bold upper and lower case letters denote vectors and matrices, respectively.
The notations X−1 , X† , X T , X H , and tr (X) denote the inverse, pseudo-inverse, transpose, Hermitian
transpose, and trace of a matrix X. vec{·} is a vector operator, diag{ x1 , ..., xn } is diagonal matrix,
and ⊗ is the Kronecker product. · F denotes the Frobenius norm. The n × n identity matrix is
denoted by In . E{·} represents the expectation with respect to the random variable within the brackets.

2. System Model
Consider a FDD MU-MIMO downlink system where a base station (BS) with Nt antennas is
located at the cell center and transmits to K single antenna users as shown in Figure 2. There are G
groups of users such that the group g ∈ G = {1, ..., G }. Each group contains K g users.

Őсϭ

ŐсϮ

Őс'

Figure 2. System Model.

Assume that the BS and users have the knowledge of the channel. We consider multi-carrier
OFDM transmission with narrow-band blocK-fading channel. The BS is equipped with Nt antennas in
linear antenna array (ULA) conﬁguration. The information signal block S ∈ CK × N f at the input of the
BS transceiver for the user k is given as

s k = [ s1 , s2 , . . . , s N f ], ∀k, (1)

and for the subchannel n,

s n = [ s1 , s2 , . . . , s K ] T , ∀n, (2)

where N f and Ns are the number of subchannels and the number of symbols per subchannel,
respectively. In a subchannel n, the information symbol vector is s ∈ C Ns ×1 . We assume Ns = K,
such that the transmit signal per subchannel n satisfying E{sn snH } = PKn IK , where Pn = PT /N f is the
transmit power per subchannel and PT is the total transmit power of the BS. The transmit signal vector
X is obtained from F B S, where F B ∈ C Nt × Ns is the precoding matrix. The hybrid beamforming divides
the precoding matrix into baseband digital precoding matrix F DB ∈ C NRF × Ns and RF analog precoding

167
Electronics 2019, 8, 1061

matrix F AB ∈ C Nt × NRF , where NRF is the number of RF chains as shown in Figure 3. The transmit
signal X ∈ C Nt × N f is given by

X = FB S
= F AB F DB S (3)

Also, the precoding matrix must satisfy

E{tr (XX H )} ≤ PT
H DB H AB H
E{tr (F AB DB
F SS F F )} ≤ PT
H H
tr (F AB F DB E{SS H }F DB F AB ) ≤ PT (4)

PT
since E{SS H } = N f K IKN f , therefore,

H H
tr (F AB F DB F DB F AB ) ≤ N f K (5)

The transmit signal in subchannel n is xn ∈ C Nt ×1 . Thus, the received signal vector yn ∈ CK ×1 at

K users in subchannel n is given by

yn = HnH xn + wn
= HnH F AB FnDB sn + wn (6)

where Hn [h1,n , ..., hK,n ] ∈ C Nt ×K is the channel matrix with hk,n = [ h1,k , ..., h Nt ,k ] H being the channel
vector from BS to user k in subchannel n, xn = FnDB F AB sn , and wn ∼ CN (0, σ2 IK ) be the additive white
Gaussian noise (AWGN) in subchannel n at the users. The RF beamforming F RF is performed in time
domain and the same beamforming is applied on all subchannels, whereas, the digital beamforming
FnDB is performed in frequency domain on the per subchannel basis [11]. In the nth subchannel,
the jth UE receives the sum of all transmitted signals for K UEs over its MIMO channel H j,i as

K
y j,n = ∑ h j,n
H
xk,n + w j,n (7)
k =1

where h j,n is the Nt × 1 channel vector. We denote the rank of the channel matrix H j,n by r j,n , where
0 ≤ r j,n ≤ min(K, Nt ), ∀n. In matrix form, the above equation is given as

y j,n = h j,n
H
xn + w j,n (8)

The 1 × N f received signal at the kth UE is given by

yk = [hk,1
H AB DB H
F F1 s1 , ..., hk,N f N f s N f ] + wk ,
F AB F DB (9)

Combining the signals for all UEs in a K dimensional received signal vector y = [y1 , ..., yK ] H ,
we get the system equation as
Y = H H F AB F DB S + W, (10)

where Y, W ∈ CK × N f .

168
Electronics 2019, 8, 1061

DŝǆĞƌ

DŝǆĞƌ
п ^ͬW &&d Ǉϭ

/&&d Wͬ^ п

^ĐŚĞĚƵůŝŶŐ
E^ ŝŐŝƚĂů Eƚ
ƚŽƐĞůĞĐƚ< ŶĂůŽŐ
WƌĞĐŽĚĞƌ
DhͲD/DK < &
EZ&
WƌĞĐŽĚĞƌ&
<
ĂƚĂ
ƵƐĞƌƐ
ƐƚƌĞĂŵƐ

/&&d Wͬ^ п DŝǆĞƌ

п ^ͬW &&d Ǉ<

Figure 3. Block diagram of mmWave massive MIMO BS and UEs.

2.1. Channel Model

Generally, massive MIMO channel models are categorized in two types (i) analytical models and
(ii) physical models [4]. Analytical models are commonly used for the theoretical analysis of wireless
communication systems. The most commonly used analytical model is Kronecker channel model.
It is a correlation-based model and characterizes the MIMO channel matrix in terms of the separate
transmit and receive side spatial correlation matrices [24],

R H = E{vec{H}vec{H H }}
= Rtx ⊗ Rrx (11)

under the above assumptions, the channel model H is simpliﬁed to Kronecker model,

1/2
H = R1/2
rx KRtx (12)

where K ∼ CN (0, 1) is an i.i.d. unit variance MIMO channel matrix, Rtx and Rrx are the transmit
and receive corrrelation matrices, respectively. The transmit and receive correlation matrices are
given as [24],
Rtx = E{H H H}, Rrx = E{HH H } (13)

The physical models explicitly model wave propagation parameters like the complex amplitude,
DoD, DoA, and delay of an MPC [24,25]. MmWave propagation leads to limited spatial scattering
due to the high free-space pathloss. In addition, the large tightly packed antenna arrays lead to high
levels of antenna correlation. The sparse scattering and antennas spatial correlation makes many of the
commonly used statistical fading distributions inaccurate for mmWave channel modeling. Therefore,
we use extended Saleh-Valenzuela model, which accurately describes the mathematical structure
present in mmWave channels [26,27]. For simplicity, we assume that each scattering cluster around the
transmitter and receiver contributes a single propagation path [28].
In general, the mmWave MIMO channel matrix between the BS with Nt transmit antennas and
a user k with nr receive antennas in subchannel n, can be modeled as double directional channel,
7
Nt nr L
ρk,n L l∑
Hk,n = αk,n,l a(φk,n,l )b H (θk,n,l ), (14)
=0

where L is the total number of multipaths, αk,n,l is the complex gain of the l th path with i.i.d. CN (0, 1),
and ρk,n is the distance dependent pathloss between the BS and user k [29]. The LOS path is included
with l = 0. Moreover, a and b are the receive and transmit steering vectors, respectively. The variables
φk,n,l ∈ [0, 2π ) and θk,n,l ∈ [0, 2π ) are the l th path’s azimuth angles (boresight angles in the receive
array and transmit array) of arrival and departure, respectively. The steering vectors are given by
1
a(φk,n,l ) = √ [ a1 (φl ), ..., anr (φk,n,l )] T (15)
Nr

169
Electronics 2019, 8, 1061

1
b(θk,n,l ) = √ [b1 (θl ), ..., b Nt (θk,n,l )] T (16)
Nt
The elements of transmit and receive steering vectors are given by
dt
bi (θk,n,l ) = e− jωτi,l = e− j2π (i−1) λ sin(θk,n,l )
, i = 1, 2, ..., Nt (17)
dr
ai (φk,n,l ) = e− jωτi,l = e− j2π (i−1) λ sin(φk,n,l )
, i = 1, 2, ..., nr (18)

where λ is the wavelength, ω = 2π

λ , τi
is the beamforming delay, and dt and dr are the antenna spacing
at the transmitter and receiver, respectively.
The channel matrix in (14) can also be written in more compact form as
√
Hk,n = H
νAk,n Dk,n Bk,n (19)

where ν = NρL t nr
and, Ak,n and Bk,n consist of stacked steering vectors of AoA and AoD, respectively,
i.e., Ak,n = [a(φk,n,1 ), a(φk,n,2 ), . . . , a(φk,n,L )] and Bk,n = [b(θk,n,1 ), b(θk,n,2 ), . . . , b(θk,n,L )]. The matrix
Dk,n is a diagonal matrix, given as Dk,n = diag{αk,n,1 , αk,n,2 , . . . , αk,n,L }. The small scale fading at user
k in subchannel n in multipath component (MPC) is given by αk,n,l with zero mean and variance σk,n,l 2 .

Assume that each MPC is i.i.d. such that ∑lL=1 σk,n,l

2 = 1. We can express the channel model in (19) as
√
Hk,n = H
νAk,n Σk,n D̄k,n Bk,n , (20)

αk,n,l
where Σk,n = diag{σk,n,1
2 , σk,n,2
2 . . . , σk,n,L
2 } and D̄k,n = diag{ᾱk,n,1 , ᾱk,n,2 . . . , ᾱk,n,L } with ᾱk,n,l = σk,n,l
such that E{ᾱk,n,l } = 0 and E{ᾱ2k,n,l } = 1.
Substituting (20) in (13) and averaging over small scale fading, we get the transmit and receive
correlation matrices for user k in the subchannel n as

Rtx,k,n = νBk,n Σ2 Bk,n

H
, (21)

Rrx,k,n = νAk,n Σ2 Ak,n

H
, (22)

For mmWave massive MIMO systems with large number of antennas, the steering vectors are
asymptotically orthogonal to each other [6]:

a H (φk,n )a(φk,n ) ≈ 0,

b H (θk,n )b(θk,n ) ≈ 0. (23)

Moreover, in mmWave massive MIMO, acquisition of the instantaneous full CSI is not practical.
Instead, an average CSI in terms of [Ak,n ], [Bk,n ], and [Σk,n ] is a practical solution for the beamforming
design because the coherence time of the channel statistics based CSI is of the order of few seconds or
more as compared to the small scale of the order of milli-second [6].

2.2. Problem Formulation

The hybrid beamforming divides the beamforming matrix into two parts: covariance-based
pre-beamforming matrix F AB realized by analog beamformers and the reduced dimension MU-MIMO
digital precoding based on the effective channel H H F AB (omitting the subchannel subscript for
simplicity). We assume that K users are divided into G groups, such that, the group g contains
K g number of users. Since users are near the ground level and surrounded by the scatterers compared
to the scatterer-free elevated base-station, we assume one-ring model [1] and all users in group g

170
Electronics 2019, 8, 1061

experience the same azimuth center angle (θ g ) and angular spread (Δ g ). In this case, Rrx = I in (12),
therefore, the channel covariance matrix of each user in group g is given by [30]

R g = E{h gk h gHk } (24)

for which the eigenvalue decomposition gives

R g = U g Λ g U gH (25)

where U g ∈ C Nt ×r g is a tall unitary matrix (U g U gH = I) comprises the eigenvectors of R g and Λ g ∈

Rrg ×rg is diagonal matrix with r g nonzeros positive eigenvalues along the diagonal. The i, j − th
element of covariance matrix R g represents the correlation between the channel coefﬁcients antenna
element i and j as
θ g +Δ g
1
e− j2π λ (i− j) sin θ dθ,
d
[R g ]i,j = (26)
2Δ g θ g −Δ g
where d is the distance between antenna elements of ULA and λ is the wavelength of carrier frequency.
Using the Karhunen-Loeve representation, the channel vector of user k in group g is given as

h gk = U g Λ1/2
g z g = U g h̃ gk (27)

where z g ∈ C Nt ×1 ∼ CN (0, Ir g ) and h̃ gk is beam domain channel. For large Nt , U g tends to discrete
Fourier transform (DFT) matrix Δ Nt ∈ C Nt × Nt [31]. Each column of U g represents one direction of
angle-of-departure (AoD), i.e., a beam.
Alternatively, for the case, when dominant eigenvalues r̂ g ≤ r g , then, the channel matrix can be
written as ([13], Equation (5))
h gk = Nt1/2 R1/2
g z gk (28)

The limited feedback-based hybrid beamforming consists of analog pre-beamforming matrix

F gAB ∈ C Nt × NRF,g responsible for spatial group formation and inter-group interference mitigation;
NRF,g ×Sg
g ∈C
and the digital multi-users precoding matrix F DB for spatial multiplexing inside the group
and inter-user interference mitigation. Here, NRF,g is the number of RF chains for group g such that
Sg < NRF,g < r̂ g and Sg is the number of multi-carrier information symbols vectors for group g with
NRF = ∑G AB ∈ C Nt × NRF is
g=1 NRF,g and S = ∑ g=1 S g . The overall analog pre-beamforming matrix F
G

given by
F AB = [F1AB , ..., FG
AB
], (29)

and the overall digital beamforming matrix F DB ∈ C NRF × Ns is given by

F DB = diag[F1DB , ..., FG
DB
] (30)

and the overall channel matrix

H = [H1 , ..., HG ], (31)

where the channel matrix of group g is deﬁned as H g [h g1 , ..., h gKg ].

The analog pre-beamforming F gAB is based on the slowly varying channel covariance matrix R g
and can be implemented by the DFT matrix (when Nt is large), whereas, the digital beamformer F DB
g is
based on the instantaneous channel information of the reduced dimension effective channel H gH F gAB .
The overall effective channel is given by

171
Electronics 2019, 8, 1061

⎡ ⎤
H1H F1AB H1H F2AB · · · H1H FGAB
⎢ H AB
⎢ H2 F1 H2H F2AB · · · H2H FGAB ⎥⎥
HeHf f = H H F AB =⎢
⎢ .. .. .. .. ⎥
⎥ (32)
⎣ . . . . ⎦
HG F1AB
H HG F2AB
H · · · HG FG
H AB

The excessive pilot transmission in downlink and feedback in uplink of FDD system can be
reduced by only sending the group-wise average CSI based channel estimates in uplink. This is
accomplished by using the diagonal elements H gH F gAB as feedback information with the size of
K g × NRF,g for g = 1, ...G. The analog pre-beamforming is designed in such a way that the other
elements of matrix (32) H gH F gAB ≈ 0 for all g = g. This group-wise division creates virtual sectors,
each group corresponds to a virtual sector [30].
The second order channel statistics-based RF beamformer F AB remains the same across multiple
coherence blocks which gives the effective instantaneous channel between BS and user k as

H
hn,g k ,e f f
hn,g
H
F AB ,
k g
(33)

with hn,gk ,e f f ∈ C NRF,g ×1 . Therefore, channel statistics-based CSI sufﬁciently reduces the feedback
overhead on each user, otherwise, for instantaneous CSI, each user have to send the Nt × 1 size
of channel estimate on the uplink channel. The covariance of effective channel hn,g H is given by
k ,e f f
using (13) as,

H
E{hn,gk ,e f f hn,g
H
k ,e f f
} = E{F gAB hn,gk hn,g
H
F AB }
k g
H
= νF gAB Bk,n Σ2 Bk,n
H AB
Fg (34)

The analog beamformer consists of columns of the DFT matrix, which can be easily implemented
AB can be obtained by eigenvalue decomposition of channel
by phase shifter network. Therefore, Fn,g
covariance matrix. With the group-wise hybrid beamforming, the received signal y g,n for group g in
subchannel n becomes

y g,n = H g,n F g F g,n s g,n + H g,n

H AB DB H
∑ g ,n s g ,n + w g,n
F gAB F DB (35)
g =g

and the received signal of user k in group g in subchannel n is given by

y gk ,n = h gHk ,n F gAB f DB
gk ,n s gk ,n + h gk ,n
H
∑ F gAB f DB
g ,n s gk ,n
k
k =k
2 34 5
Inter-user interference
+ h gHk ,n ∑ ∑ FgAB fDB
g j ,n s g j ,n + w gk ,n (36)
g =g j
2 34 5
Inter-group interference

The received signal to interference and noise ratio (SINR) at the user k in group g and subchannel
n is given by

The spectral efﬁciency of user k in group g and subchannel n is expressed as

R gk ,n = Ψ gk ,n log2 (1 + SI NR gk ,n ), (38)

172
Electronics 2019, 8, 1061

where Ψ gk ,n is the binary variable such that it is equal to 1 if user k is selected in group g in the
subchannel n. In order to achieve balance tradeoff between throughput and fairness [32], we use
proportional fairness (PF) based throughput maximization. We deﬁne per user proportional fairness
metric as
R gk ,n (t)
U (F gAB , f DB
gk ,n ) = , ∀n, gk , (39)
R̄ gk ,n (t)
where R̄ gk ,n (t) is average throughput (moving average) over a past window of length Tw = 1/α [33], as

R̄ gk ,n (t) = αR gk ,n (t − 1) + (1 − α) R̄ gk ,n (t − 1), (40)

The large number of antennas in massive MIMO systems enable the use of the eigenmodes of the
channel covariance matrix, i.e., Bk,n comprises of the columns of the DFT matrix [6]. DFT-based beams
with Nt = 16 and Nt = 64 are shown in Figure 4a,b, respectively.

(a) (b)
Figure 4. DFT-based beams in a 120 sector. (a) DFT-based beams in a 120 sector with Nt = 16;
(b) DFT-based beams in a 120 sector with Nt = 64.

The beam steering matrix Bk,n consists of selected columns of Nt × Nt DFT matrix Δ Nt such that

F gAB = Bk,n = Δ Nt Υn (41)

where Δ Nt = [b1 , b2 , ..., b Nt ] consisting of all eigenmodes and Υn is an Nt × r R binary beam selection
matrix, with r R is the rank of the channel covariance matrix. The selection matrix Υn ∈ C Nt ×rR
with only a single one on each row and column such that ∑i [Υn ]i,j = 1 ∀ j. Now we formulate our
optimization problem for joint spatio–radio resource allocation and precoders design with the objective
to maximize the utility function as

G Nf Kg
max
F gAB ,f DB
∑ ∑ ∑U F gAB , f DB
gk ,n (42)
g ,n
k g =1 n =1 k =1
subject to
H H
C1 : tr (FnDB F AB F AB FnDB ) ≤ Pn , ∀n
C2 : rank (F AB F DB ) ≤ NRF ,
Kg
C3 : ∑ Ψk,n ≤ Kg , ∀ g, n,
k =1
C4 : tr (Υn ) ≤ G, ∀n,
C5 : [Ψk,n ], [Υn ] ∈ {0, 1}.

173
Electronics 2019, 8, 1061

The above optimization problem is a mixed integer programming (MIP) problem with coupling
between the digital and RF precoders in the power constraint. This MIP problem is NP-hard [14].

3. Relaxed-Convex Transformation
Though the above MIP optimization problem is NP-hard, it can be transformed to a relaxed
convex optimization problem by (i) relaxing the binary integer constraints to real number between 0
and 1 [14], and (ii) decoupling the digital and analog precoders. For decoupling purpose, we make
H
use of change of variables FnDB = (F AB F AB )− 2 F̃nDB , where F̃nDB is the equivalent digital precoder [34].
1

Thus, the problem in (42) can be written as

G Nf Kg
max
F gAB ,f̃ DB
∑ ∑ ∑ Ũgk ,n F gAB , f DB
gk ,n (43)
g ,n
k g =1 n =1 k =1
subject to
DB H DB
C1 : tr (F̃n F̃n ) ≤ Pn , ∀n
C2 : rank (F̃ DB
) ≤ NRF ,
Kg
C3 : ∑ Ψk,n ≤ Kg , ∀ g, n,
k =1
C4 : tr (Υn ) ≤ G, ∀n,
C5 : 0 ≤ [Ψk,n ] ≤ 1, 0 ≤ [Υn ] ≤ 1,

For a given RF precoder F AB and the knowledge of perfect CSI at the base-station, the digital
precoder can be obtained by conventional MU-MIMO techniques, e.g., the zero-forcing and block
diagonalization [15].
For the digital precoder, we adopt the ZF precoder for no multiuser interference among the
users in each groups. The beamforming vector of user k is chosen to be orthogonal to the effective
channel vectors of all the other users in the group. Zero-forcing is a suboptimal but low complexity
approach within the linear precoders’ class. ZF precoder is asymptotically optimal among all downlink
beamforming techniques in high SNR region. It guarantees high spectral efﬁciency for large-scale
antennas with low-complexity linear processing [35]. For Nt Nr , it has shown that zero-forcing
beamforming can achieve up to 98% of the non-linear dirty paper coding (DPC) capacity [36].
In order to make this paper self-contained, we describe the block diagonalization brieﬂy. Since digital
precoder is used to mitigate the multiuser interference within a groups and all groups are independent,
we omit the subscript g. First we consider the downlink transmission over one subchannel n with
the general case of BS with Nt antennas and Kn users with nk antennas each, such that ∑kK=1 nk = Nr .
The downlink channel on the subchannel n is expressed as Nr × Nt matrix,

Hn,e f f = [H1,n,e
T
f f , ..., HKn ,n,e f f ]
T T
(44)

For user k, we deﬁne the following ( Nr − nk ) × Nt channel matrix

Hk,n,e f f = [H1,n,e
T
f f , ..., HK −1,n,e f f , Hk +1,n,e f f , ..., HKn ,n,e f f ]
T T T T
(45)

Let the rank of Hk,n,e f f be denoted by rk,n , then the nullspace of Hk,n,e f f has dimension
Nt − rk,n ≥ nk . Performing the SVD of each user’s channel matrix in subchannel n leads to the following

(1) (0)
Hk,n,e f f = Uk,n Σk,n Vk,n
H
= Uk,n Σk,n [Vk,n Vk,n ] H , (46)

where Uk,n and Vk,n are the unitary matrices. The columns of Uk,n are the left singular vectors of
Hk,n,e f f , the columns of Vk,n are the right singular vectors of Hk,n,e f f , and Σk,n is a diagonal matrix in

174
Electronics 2019, 8, 1061

(1)
which the diagonal entries are the singular values of Hk,n,e f f . In the last equality of (46), Vk,n holds the
(0)
ﬁrst rk,n right singular vectors of Hk,n,e f f and Vk,n contains the Nt − rk,n singular vectors of Hk,n,e f f
(0)
which are in the nullspace of Hk,n,e f f . The columns of Vk,n are best suited for user k beamforming
DB , because they will provide zero interference at other UEs. Usually V (0)
matrix Fk,n k,n contains more
(0)
number of columns than the nk , therefore we use some linear combinations of the columns of Vk,n to
make at most nk columns.

(0) Σk,n 0 (1)
(0) H
Hk,n,e f f Vk,n = Uk,n Vk,n Vk,n , (47)
0 0

(0) (0)
where Hk,n,e f f Vk,n gives the matrix with columns as the linear combinations of the columns of Vk,n .
(0)
The right hand side of the equation is the SVD of Hk,n,e f f Vk,n , where Σk,n is rk,n × rk,n diagonal
(1) (0)
matrix and represents the rk,n singular vectors with nonzero singular values of Hk,n,e f f Vk,n .
Vk,n
The Equation (47) can also be written as,

(0) (1) H
Hk,n,e f f Vk,n = Uk,n Σk,n Vk,n
(1) H (0) H
Hk,n,e f f = Uk,n Σk,n Vk,n Vk,n (48)

The transmit beamforming matrix that maximizes the user k throughput without any inter-user
interference is obtained as,
(0) (1)
DB
Fk,n = Vk,n Vk,n (49)

The transmit digital beamforming matrix for subchannel n is deﬁned as

FnDB = [F1,n
DB
, ..., FKDB
n ,n
]P1/2
n , (50)

HB = I,
B
where Fk,n Fk,n 1 ≤ k ≤ Kn and Pn is a block diagonal matrix whose elements scale the power
allocated to each interference-free virtual subchannel for all UEs. The receive combining matrix for this
user is Uk,n [37].
In the case of single antenna users, complete diagonalization is achieved entirely at the BS by
channel inversion, i.e., FnDB = (Hn,e
H
f f ) , where ( Hn,e f f ) is the pseudo-inverse of Hn,e f f [38].
† H † H

†
FnDB = β n Hn,eH
ff
†
[f1,n
DB DB
, ..., fK,n ] = β n [h1,n , ..., hK,n ] H F AB (51)

where β n is a normalization factor chosen to satisfy the power constraint and is given by

K
β2n = (52)
F AB FnDB 2F

Using the deﬁnition of the pseudo-inverse, we get,

−1
FnDB = β n Hn,e f f Hn,e
H
f f + NRF I NRF Hn,e f f (53)

175
Electronics 2019, 8, 1061

where is the regularization parameter, = 0 for ZF precoding and = Ns

NRF η for regularized ZF,
with η = PT,n /σ2 . Lastly, introducing the group subscript again, the SINR of user gk is given by

and the PF sum rate is calculated as

G Nf Kg
Û = ∑ ∑ ∑ Ũgk ,n . (55)
g =1 n =1 k =1

4. Suboptimal Solution
Joint optimization of analog and digital beamformers is challenging because they use different
channel information for the design of analog and digital beamformers. Hybrid beamforming methods
consider decoupled designs of analog and digital beamforming to reduce the complexity of joint
optimization, but the main challenge remains the use of different channel information. To approximate
the optimal solution to this mixed integer programming problem, we summarize our proposed
algorithm below:
The analog precoder is formed by selecting K g columns of DFT matrix of eigenvectors of channel
covariance matrix R g of users’ group g in (41) to minimize the inter-group interference I g ,

min I g (Υ) (56)

Υ
subject to
tr (Υn ) ≤ G, ∀n,

where I g = ∑iG=1 ∑G
j=1 | Hi F j | . To solve the MIP problem, we divide the solution into two parts.
H AB 2

j =i
In the first part, we get the analog precoder using the selected columns of the DFT matrix which
maximize the PF sum-rate. The inherent benefit of the DFT matrix is its constant modulus which enables
the use of analog phase shifters and RF switches to realize the analog beamforming. In the second part,
for the given analog precoder, intra-group users scheduling is performed and a ZF digital precoder is
designed to maximize the sum-rate utility function. The decoupling of the analog and digital precoders
design makes the solution suboptimal but tractable [34]. The joint hybrid beamforming and user
scheduling Algorithm 1 takes K g , N f , NRF , Nt , and K. It generates the analog beamforming matrix F AB ,
digital beamforming matrix F DB , and the set of users in each group. The first part of the algorithm
(line 9 to line 19) forms DFT/eigenmodes-based analog beamforming using limited statistical CSI
feedback from the users. Beam and users pairing within each group is taken place in this part of the
algorithm. The while loop at line 12 executes till all the binary combinations in Nt × Nt are exhausted
with the condition that each column contain exactly one binary 1 and total number of 1s are equal to
the number of streams (or number of RF chains). The second part (starts from line 39) assigns the radio
resources to users to maximize the utility function.

176
Electronics 2019, 8, 1061

Algorithm 1 Joint Resource Allocation and Hybrid Beamforming Design Algorithm.

1: Inputs
2: K g : Number of users per group
3: N f : Number of subchannels
4: Kn : Number of UEs to be scheduled in subchannel n
5: NRF : Number of RF chains
6: Nt : Number of transmit antennas
7: Initialization
8: K g = {1, ..., K g }, U (FnDB )last = 0, Υlast
g = 0, Kn = {}, ∀n

{Average CSI-based Beam Selection and RF Precoder Design}

9: Evaluate channel covariance matrix R g from CSI at users: R g = E{h gk h gHk }
10: Find transmit steering Bk,n by Eigenvector based unitary matrix U g of R g or DFT matrix Δ Nt .
11: while g ≤ G do

12: while tr (Υ g ) ≤ NRF,g , set {[Υ g ]i,i , ..., [Υ g ] Nt ,Nt } = {0, 1} do

13: Calculate U (Υ g ) , employing zero forcing Fn,g

14: if U (Υ g ) ≥ U (Υlast
g ) then

15: Υlast
g ← Υg
16: end if
17: end while
18: end while
19: Get F gAB = Δ Nt Υlast
g

{Instantaneous CSI-based Users Selection and Digital Precoder Design}

20: while n ≤ N f do

21: while b ≤ NRF,g do

22: while k ≤ K g do

23: Compute U (FnDB ) = U (FnDB ) + U (fkDB,n ), ∀ k ∈ Kg

24: k ∗ = arg max{U (FnDB )}
k
25: Ψk ∗ ,n = 1
26: Update FnDB and U (FnDB )update with user k ∗
27: if U (FnDB )updated ≥ U (FnDB )last then

28: Kn = Kn ∪ {k ∗ }
29: K g = K g − {k ∗ }
30: U (FnDB )last = U (FnDB )updated
31: k++
32: else

33: break
34: end if
35: end while
36: b++
37: end while
38: n++
39: end while
40: Stack the beamforming matrices F DB = [F1DB , ..., F DB
N ] f
41: Output
42: F AB = [F1AB , ..., FG
AB ]

43: F DB = [F1DB , ..., F DB

N ] f
44: [K1 , ..., K N f ]

177
Electronics 2019, 8, 1061

The Algorithm 1 is illustrated in ﬂowchart Figure 5.

No
while n≤Nf
Initialization
Yes

Rg at users No
while b≤NRF,g

FAB=Bk,n Yes

No
while k≤Kg
No
while g≤G
Yes

Yes Compute U(FnDB)

k*’=arg maxk’ {U(FnDB)}
No Ψk*’,n=1
while tr(Υ )≤NRF,g Update U(FnDB)update with user k*’

Yes

No
U(Υg) If U(FnDB)update
≥U(FnDB)last

No Yes
If U(Υg)≥ U(Υg)last
Кn=КnU{k*’}
Кg=Кg-{k*’}
Yes U(FnDB)last=U(FnDB)update
k++
U(Υg)last ← U(Υg)
b++

FgAB=ΔNtΥg(Last) n++

FDB={F1DB,…,FNfDB}

Output:
FAB={F1AB,…,FGAB}
DB
F ={F1DB,…,FNfDB}
К={К1,…,КNf}

Figure 5. Flowchart illustration of Algorithm 1.

5. Machine Learning: K-Means Based Optimal Users Grouping for Analog Beamforming
In this section, we use machine learning technique to group the users. Then, the DFT based ﬁxed
switched-beams are used to realize the analog beamforming matrix. The joint users scheduling and
hybrid beamforming architecture with ML-based users grouping is shown in Figure 6.

178
Electronics 2019, 8, 1061

User 1
FDB NRF RF FAB Nt
Ns NRF
ZF Chains DFT

Kg Selected
users beams Υg

User K
K-Mean ML
Intra-group G groups of Kg grouping
Users scheduling users per group Location coordinates

Low dimensional effective channel

ch
h feedback

Base station

Figure 6. Joint users scheduling and hybrid beamforming architecture with ML-based users grouping.

Machine learning algorithms can broadly be divided into two main categories, namely supervised
learning and unsupervised learning algorithms. The former class of algorithms learn by training on
the input labeled examples, called training dataset, {( x (1) , y(1) ), ( x (2) , y(2) ), ( x (3) , y(3) ), ..., ( x (m) , y(m) )},
where the ith example ( x (i) , y(i) ) consists of the ith instance of feature vector x (i) and the corresponding
label y(i) . Given a labeled training dataset, these algorithms try to find the decision boundary that
separates the positive and negative labeled examples by fitting a hypothesis to the input dataset.
Unsupervised machine learning algorithms, on the other hand, are given an unlabeled input dataset.
These algorithms are used for extracting information or features from the dataset. These features might
be related, but not confined, to the underlying structures or patterns in the input data, relationships in
data items, grouping/clustering of data items, etc. Discovered features are meant to provide a deeper
insight into the input dataset that can subsequently be exploited for achieving specific goals. Clustering
algorithms make an important part of unsupervised learning where the input examples are grouped
into two or more separate clusters based on some features. The K-Means (KM) algorithm, is probably
the most popular clustering algorithm. It is an iterative algorithm that starts with a set of initial
centroids given to it as input. During each iteration, it performs the following two steps.

1. Assign Cluster: For every user, the algorithm computes the distance between the user and every
centroid. The user is then associated to the cluster with the closest centroid. During this step,
a user might change its association from one cluster to another one.
2. Recompute centroids: Once all users have been associated to their respective cluster,
the new position of centroid for every cluster is then calculated.

Figure 7a depicts how the cluster centroids keep moving across iterations until the system
stabilizes for an example network consisting of thirty users being grouped in five clusters. The system
becomes stable in only five iterations and the final cluster layout is shown in Figure 7b.
Let us define the following notations to be used later in this section.

K = Total number of clusters being formed.

x (i) = Location coordinates of user u(i) . In our case, x (i) ∈ IR2
c(i) = Cluster to which the user u(i) is currently associated.
μk = Centroid of kth cluster, μk ∈ IR2
μc(i) = Centroid of the cluster to which the user u(i) is currently associated.

179
Electronics 2019, 8, 1061

Now the cost function J can be deﬁned as

m
1
J (c(1) , c(2) , ..., c(m) , μ1 , μ2 , . . . , μK ) = ∑ ||x(i) − u(ci ) ||2 (57)
m i =1

with the following optimization objective function.

min J (c(1) , c(2) , ..., c(m) , μ1 , μ2 , . . . , μK )

c(1) ,,...,c(m) ,μ1 ,...,μK

It may be pointed out that Equation (57) allows us to compare multiple clustering layouts based
on their cost and select the one with the lowest cost. The above optimization objective function
constitutes a non-convex and NP-hard problem because it has many possible local minima and
integer optimization variable c(i) . The KM algorithm heuristically optimize this function by alternate
minimization method. It iterates between two steps (Assign cluster and Recompute centroids) as
described above.

(a) (b)

Figure 7. Change in position of centroids as K-Means clustering algorithm progresses. (a) shows
the transition of cluster centroids (shown as crosses) up to iteration 5, whereas, (b) shows only
the ﬁnal stable state after iteration 5. In the ﬁgures, cross-sign represents cluster centroids and the
colored-circle-sign represents the user associated with the same group or cluster.

In this section, we use the KM algorithm for optimal clustering of m users competing for resources
in a particular cell. The clustering is performed based on their geographic location, thus our input
dataset {u(1) , u(2) , u(3) , ..., u(m) } has m vectors u(i) , 1 ≤ i ≤ m, consisting of location coordinates,
of ith user. For the sake of simplicity, we assume these users are deployed in a two dimensional
(i ) (i )
area, i.e., a plane and so u(i) = ( x1 , x2 ), i.e., an ordered pair of location coordinates. Our clustering
algorithm is summarized in Algorithm 2.
The proposed algorithm takes the location coordinates of m users as input. It also takes
two numbers mink and maxk as additional input. The algorithm outputs the best number of clusters,
k, such that mink ≤ k ≤ maxk , and corresponding members of each cluster. It starts with k = mink
and randomly selects k user locations as the initial centroids (line 6). It assigns the closest centroid
to each user (line 8) and then computes new centroids by calculating the center/average location of
all nodes in each cluster (line 11). So, in effect, the location of centroids keeps moving in successive
iterations. It repeats the above two steps until the change in centroid positions is zero or negligible. We
repeat the test maxt times with a new set of randomly chosen initial centroids every time. During every
test, the discovered centroids, corresponding centroid assignment to users, and the cost are saved
(lines 14–16) for later comparison. After running the loop for maxt times, we select and store the best
k centroids resulting from the test with the lowest cost while discarding the remaining (lines 19–21).

180
Electronics 2019, 8, 1061

The same is repeated for the next value of k, i.e., k = k + 1, until k > maxk . At the end we have
cnt = maxk − maxk vectors μk , one for each value of k, the corresponding assignment vector ak and
cost ck . Finally, we choose the vector μ having the lowest cost and corresponding assignment vector
a among cnt stored cases. That is the best number of clusters and corresponding centroids that the
algorithm found.

Algorithm 2 K-Means based users grouping algorithm.

1: cnt = 0
2: for k = mink maxk do

3: cnt = cnt + 1
4: for t = 1 : maxt do

5: repeat

6: Randomly choose initial k centroids μ1 , μ2 , μ3 , ..., μk

7: for i = 1 : m do

8: a(i) = j, 1 ≤ j ≤ k, such that μ j is the centroid closest to u(i)

9: end for
10: for l = 1 : k do

11: μl = mean of all users/points u(i) assigned to lth centroid

12: end for
13: until converges
14: μ ( t ) = ( μ1 , μ2 , μ3 , . . . , μ k )
15: a ( t ) = ( a (1) , a (2) , a (3) , . . . , a ( m ) )
16: c(t) = cost(μ1 , μ2 , μ3 , . . . , μk )
17: end for
18: idx = argmin {c(t), 1 ≤ t ≤ maxt }
19: μk (k) = μ(idx) , 1 ≤ idx ≤ maxt
20: ak (k) = a(idx), 1 ≤ idx ≤ maxt
(k)
21: ck = c(idx) , 1 ≤ idx ≤ maxt
22: end for
(k)
23: index = argmin {ck , 1 ≤ k ≤ cnt}
24: μ = μk (index) , 1 ≤ index ≤ cnt
25: a = ak (index) , 1 ≤ index ≤ cnt
26: n = index

After the groups formation, BS sends this information to all users, where users use this information
to form reduced average statistical CSI. For example, a user in a group of 5, needs to send the average
statistical CSI only after 1/5 of regular feedback interval time.

6. Simulation Results
Consider the downlink of a multiuser massive MIMO single cell with three 120 degree sectors.
We neglect inter-sector interference and focus on a single 120 degree sector served by a ULA of Nt = 64
isotropic antennas at BS. The users grouping forms virtual sectors inside 120 physical sector.

181
Electronics 2019, 8, 1061

In simulation, the results are obtained by averaging over 100 drops. In each drop we randomly
generate spatial correlation matrices R g . For each realization of spatial correlation matrix R g ,
we simulate 1000 realizations of instantaneous channel H.
The joint spatio–radio scheduling and hybrid precoder scheme first forms the users groups and
then selects the beams that maximizes the sum-rate through downlink training process. Secondly,
it calculates the ZF based digital precoder using low dimensional effective channel feedback from
the users.
Figure 8 shows the CDF of the non-zero eigenvalues of channel covariance matrix. Notice that
approximately 50% of the non-zero eigenvalues are close to zero. The sum-rate increases as the number
of groups increases at the cost of increased feedback overhead as shown in Figure 9. Using machine
learning technique in Section 5 we can get optimal number of groups from channel covariance feedback.
This results in increased sum-rate with substantial reduced feedback. The optimal G = 3 gives 27.6%
increase in sum-rate compared to when G = 1 and 62.5% decrease in feedback overhead compared
to G = 8. The comparison of performance of ML-based users grouping with previous work cannot
be provided because there is no previous work that uses ML-based technique to reduce the CSI
feedback overhead in massive MIMO systems. Many papers use users grouping in massive MIMO
hybrid beamforming [3,5,39,40], but they do not utilize ML-based users grouping. Therefore, we have
compared our proposed solution with two benchmarks of full-CSI (G = K) and coarse-CSI (G = 1).
Figure 10 shows sum-rate with number of users at 10dB SNR. For a fixed number of groups G = 3,
the increase in number of users, increases number of users per group. Due to the fixed number of
groups, the feedback overhead remains the constant. Sum-rate is increasing with users because we
assumed NRF = Ns = K. If we fix the number of RF chains to some hardware limit, then the sum-rate
will saturate at specific number of users. It can be seen in Figure 10, that increasing number of users
per group decreases the slop of the sum-rate for limited CSI schemes. This decrease is due the increase
in intra-group interference.
Sum-rate also depends on the number of RF chains but this dependence is not linear as shown in
Figure 11. This figure shows sum-rate variation with number of RF chains NRF when Ns = 8, K = 8,
Nt = 64, and SNR = 10 dB. Sum rate increases with number of RF chains because it yields better
conditioned effective channel matrix. It can be seen that the spectral efficiency does not increase
monotonically with NRF and saturates at NRF = Nt where hybrid precoding is turned to the pure
digital precoding. The increase in spectral efficiency with the number of RF chains comes at the cost of
higher dimensional effective channel feedback overhead and power consumption in RF chains.
The spectral efficiency of the proposed scheme also varies with number of transmit antennas as
shown in Figure 12. In the figure, NRF = Ns = K = 8, SNR = 10 dB, and BS has 16, 64, 128 or 256 ULA
antennas. The performance gain increases with the increase in number of transmit antennas because
large antennas array increases the resolution of the transmit beams (also depicts in figure 4) and, hence,
decreases the potential of inter-beams interference.
In general, the spectral efficiency is a function of SNR and for the SNR = 10 dB, our ML-based
users grouping and hybrid beamforming scheme gives 27.6% increased sum-rate at the cost of 33.3%
extra feedback overhead as compared to the coarse-CSI case (G = 1). Our proposed scheme incurs 62.5%
reduced feedback at the cost of 25.2% reduction in sum-rate as compared to the full-CSI case (G = K).

182
Electronics 2019, 8, 1061

(PSLULFDO&')

&')RI(LJHQYDOXHV

(LJHQYDOXHV

Figure 8. CDF of non-zero eigenvalues of channel covariance matrix R g for Nt = 64.

*
* .
RSWLPDO*

6XPUDWH ESV+]

615 G%

Figure 9. Sum-rate Vs SNR with different number of groups, K = 8.

2SWLPDOJURXSLQJ/LPLWHG&6,
1RJURXSLQJ)XOO&6,
2QHJURXS/LPLWHG&6,

6XPUDWH ESV+]

8VHUV

Figure 10. Sum-rate Vs number of users with different number of groups and CSI, SNR = 10 dB.

183
Electronics 2019, 8, 1061

*
* .
RSWLPDO*

6XPUDWH ESV+]

1XPEHURI5)FKDLQV15)

Figure 11. Sum-rate Vs number of RF chains. Number of users K = 8, number of transmit antennas
Nt = 64, and SNR = 10 dB.

* ZLWK615 G%
* .ZLWK615 G%
6XPUDWH ESV+]

RSWLPDO*ZLWK615 G%
* .ZLWK615 G%
RSWLPDO*ZLWK615 G%

* ZLWK615 G%

1XPEHURIWUDQVPLWDQWHQQDV

Figure 12. Sum-rate Vs number of transmit antennas. The number of RF chains at BS is ﬁxed at 8,
number of users is 8 and the results are obtained for SNR = 10 dB and SNR = 20 dB.

7. Conclusions
This paper studied the limited feedback two-stage hybrid beamformimg for decomposing the
precoding matrix at the base-station. The huge channel state information feedback of massive MIMO
has been reduced by the channel covariance-based RF precoding and beam selection. The well-known
regularized block diagonalization can mitigate the inter-group interference, but requires substantial
feedback. We used K-mean algorithm based unsupervised machine learning technique for users
grouping and channel covariance-based eigenmodes/discrete Fourier transforms to reduce the
feedback overhead and designed a simpliﬁed analog precoder. The digital precoder is designed with
joint optimization of intra-group user utility function. It has been shown that more than 50% feedback
overhead is reduced by the eigenmodes-based analog precoder design. The spatio–radio resources
scheduling and limited feedback-based hybrid precoding increases the sum-rate by 27.6% compared to
the sum-rate of one-group case at the cost of 33.3% extra feedback overhead, and reduces the feedback
overhead by 62.5% at the cost of 25.2% reduction in sum-rate, compared to the full CSI feedback.

Author Contributions: H.K. and I.A., contributed the key idea and deﬁned the main system model. G.B. and
M.A. assisted with the system model and the mathematical analysis. I.A. developed the problem formulation.

184
Electronics 2019, 8, 1061

G.B. and H.K. developed the model for users grouping. H.K., I.A., G.B. and M.A. analysed all the results and
added the relevant discussions. All authors contributed to the paper write up.
Funding: This research was supported by King Abdul Aziz City for Science and Technology Project under Grant
PC-37-66.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

Abbreviations
The following abbreviations are used in this manuscript:

ADC Analog-to-digital converter

AWGN Additive white Gaussian noise
BS Base station
CDF Cumulative distribution function
CSI Channel state information
DAC Digital-to-analog converter
DFT Discrete Fourier transforms
DoA Direction of arrival
DoD Direction of departure
DPC Dirty paper coding
FDD Frequency division duplex
HBwS Hybrid beamforming with switches
LOS Line-of-sight
MIMO Multiple-input multiple-output
MIP Mixed integer programming
ML Machine learning
MMSE Minimum mean squared error
MPCs Multi path components
MRT/MRC Maximum ratio transmission/Maximum ratio combining
MU-MIMO Multi-User MIMO
OFDM Orthogonal frequency division multiplexing
RBD Regularized block diagonalization
RF Radio frequency
SC Subspace construction
SINR Signal-to-interference-plus-noise ratio
SVD Singular value decomposition
SVM Support vector machine
TDD Time division duplex
UE User equipment
ULA Uniform linear array

References
1. Adhikary, A.; Nam, J.; Ahn, J.Y.; Caire, G. Joint spatial division and multiplexing-The large-scale array regime.
IEEE Trans. Inf. Theory 2013, 59, 6441–6463. [CrossRef]
2. Kutty, S.; Member, S.; Sen, D. Beamforming for millimeter wave communications: An inclusive survey. IEEE
Commun. Surv. Tutor. 2016, 18, 949–973. [CrossRef]
3. Nam, J.; Adhikary, A.; Ahn, J.Y.; Caire, G. Joint Spatial Division and Multiplexing: Opportunistic
Beamforming, User Grouping and Simpliﬁed Downlink Scheduling. IEEE J. Sel. Top. Signal Process. 2014,
8, 876–890. [CrossRef]
4. Adhikary, A.; Al Safadi, E.; Samimi, M.; Wang, R.; Caire, G.; Rappaport, T.; Molisch, A. Joint Spatial Division
and Multiplexing for Wave Channels. IEEE J. Sel. Areas Commun. 2014. [CrossRef]
5. Li, Z.; Han, S.; Sangodoyin, S.; Wang, R.; Molisch, A.F. Joint Optimization of Hybrid Beamforming for
Multi-User Massive MIMO Downlink. IEEE Trans. Wirel. Commun. 2018, 1276, 1–14. [CrossRef]

185
Electronics 2019, 8, 1061

6. Li, Z.; Han, S.; Molisch, A.F. Optimizing Channel-Statistics-Based Analog Beamforming for Millimeter-Wave
Multi-User Massive MIMO Downlink. IEEE Trans. Wirel. Commun. 2017, 16, 4288–4303. [CrossRef]
7. Ratnam, V.; Molisch, A.; Bursalioglu, O.Y.; Papadopoulos, H.C. Hybrid Beamforming with Selection for
Multi-user Massive MIMO Systems. IEEE Trans. Signal Process. 2018, 66, 4105–4121. [CrossRef]
8. Liu, A.; Lau, V. Phase only RF precoding for massive MIMO systems with limited RF chains. IEEE Trans.
Signal Process. 2014, 62, 4505–4515. [CrossRef]
9. Li, Z.; Han, S.; Molisch, A.F. User-Centric Virtual Sectorization for Millimeter-Wave Massive MIMO
Downlink. IEEE Trans. Wirel. Commun. 2018, 17, 445–460. [CrossRef]
10. Ahmed, I.; Khammari, H.; Shahid, A.; Musa, A.; Kim, K.S.; Poorter, E.D.; Moerman, I. A Survey on Hybrid
Beamforming Techniques in 5G: Architecture and System Model Perspectives. IEEE Commun. Surv. Tutor.
2018, 20, 3060–3097. [CrossRef]
11. Park, S.; Alkhateeb, A.; Heath, R.W. Dynamic Subarrays for Hybrid Precoding in Wideband mmWave
MIMO Systems. IEEE Trans. Wirel. Commun. 2017, 16, 2907–2920. [CrossRef]
12. Zhu, D.; Li, B.; Liang, P. A Novel Hybrid Beamforming Algorithm with Uniﬁed Analog Beamforming by
Subspace Construction Based on Partial CSI for Massive MIMO-OFDM Systems. IEEE Trans. Commun. 2017,
65, 594–607. [CrossRef]
13. Jeon, Y.; Song, C.; Lee, S.R.; Maeng, S.; Jung, J.; Lee, I. New Beamforming Designs for Joint Spatial Division
and Multiplexing in Large-Scale MISO Multi-User Systems. IEEE Trans. Wirel. Commun. 2017, 16, 3029–3041.
[CrossRef]
14. Jiang, Z.; Chen, S.; Zhou, S. Joint User Scheduling and Beam Selection Optimization for Beam-Based Massive
MIMO Downlinks. IEEE Trans. Wirel. Commun. 2018, 17, 2190–2204. [CrossRef]
15. Ahmed, I.; Khammari, H.; Shahid, A. Resource allocation for transmit hybrid beamforming in decoupled
millimeter wave multiuser-MIMO downlink. IEEE Access 2017, 5, 170–182. [CrossRef]
16. O’Shea, T.J.; Hoydis, J. An Introduction to Deep Learning for the Physical Layer. IEEE Trans. Cogn. Commun.
Netw. 2017, 3, 563–575. [CrossRef]
17. Kim, K.; Lee, J.; Choi, J. Deep Learning Based Pilot Allocation Scheme (DL-PAS) for 5G Massive
MIMO System. IEEE Commun. Lett. 2018, 22, 828–831. [CrossRef]
18. Wen, C.K.; Shih, W.T.; Jin, S. Deep Learning for Massive MIMO CSI Feedback. IEEE Wirel. Commun. Lett.
2018, 1–4. [CrossRef]
19. Joung, J. Machine Learning-Based Antenna Selection in Wireless Communications. IEEE Commun. Lett.
2016, 20, 2241–2244. [CrossRef]
20. Wang, J.B.; Wang, J.; Wu, Y.; Wang, J.Y.; Zhu, H.; Lin, M.; Wang, J. A Machine Learning Framework for
Resource Allocation Assisted by Cloud Computing. IEEE Netw. 2018, 32, 144–151. [CrossRef]
21. Calabrese, F.D.; Wang, L.; Ghadimi, E.; Peters, G.; Soldati, P. Learning Radio Resource Management in 5G
Networks: Framework, Opportunities and Challenges. arXiv 2016, arXiv:1611.10253.
22. Ahmed, I.; Khammari, H. Joint Machine Learning based Resource Allocation and Hybrid Beamforming
Design for Massive MIMO Systems. In Proceedings of the IEEE Globecom Workshops (GC Wkshps),
Abu Dhabi, UAE, 9–13 December 2018; pp. 1–6.
23. Rahman, M.S.; Josiam, K. Low complexity RF beam search algorithms for millimeter-wave systems.
In Proceedings of the 2014 IEEE Global Communications Conference (GLOBECOM 2014), Austin, TX, USA,
8–12 December 2014; pp. 3815–3820. [CrossRef]
24. Almers, P.; Bonek, E.; Burr, A.; Czink, N.; Debbah, M.; Degli-Esposti, V.; Hofstetter, H.; Kyölsti, P.; Laurenson,
D.; Matz, G.; et al. Survey of channel and radio propagation models for wireless MIMO systems. Eurasip J.
Wirel. Commun. Netw. 2007, 2007, 019070. [CrossRef]
25. Andrews, J.G.; Bai, T.; Kulkarni, M.; Alkhateeb, A.; Gupta, A.; Heath, R.W. Modeling and Analyzing
Millimeter Wave Cellular Systems. IEEE Trans. Commun. 2017, 65, 403–430. [CrossRef]
26. Ayach, O.E.; Rajagopal, S.; Abu-Surra, S.; Pi, Z.; Heath, R.W. Spatially sparse precoding in millimeter wave
MIMO systems. IEEE Trans. Wirel. Commun. 2014, 13, 1499–1513. [CrossRef]
27. Sohrabi, F.; Yu, W. Hybrid digital and analog beamforming design for large-scale MIMO systems.
In Proceedings of the IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP),
Brisbane, Australia, 19–24 April 2015; pp. 2929–2933. [CrossRef]

186
Electronics 2019, 8, 1061

28. Ayach, O.E.; Heath, R.W.; Abu-Surra, S.; Rajagopal, S.; Pi, Z. Low complexity precoding for large millimeter
wave MIMO systems. In Proceedings of the IEEE International Conference on Communications (ICC),
Ottawa, ON, Canada, 10–15 June 2012; pp. 3724–3729. [CrossRef]
29. Rappaport, T.S.; Gutierrez, F.; Ben-Dor, E.; Murdock, J.N.; Qiao, Y.; Tamir, J.I. Broadband millimeter-wave
propagation measurements and models using adaptive-beam antennas for outdoor Urban cellular
communications. IEEE Trans. Antennas Propag. 2013, 61, 1850–1859. [CrossRef]
30. Nam, J.; Ahn, J.Y.; Adhikary, A.; Caire, G. Joint spatial division and multiplexing: Realizing massive MIMO
gains with limited channel state information. In Proceedings of the 2012 46th Annual Conference on
Information Sciences and Systems (CISS 2012), Princeton, NJ, USA, 21–23 March 2012; Volume 1. [CrossRef]
31. Lu, C.; Wang, W.; Zhong, W.; Gao, X. User scheduling and beam allocation for massive MIMO systems with
two-stage precoding. In Proceedings of the IEEE International Symposium on Personal, Indoor and Mobile
Radio Communications (PIMRC), Valencia, Spain, 4–8 September 2016. [CrossRef]
32. Andrews, M. A survey of scheduling theory in wireless data networks. In Wireless Communications; Springer:
New York, NY, USA, 2007; pp. 1–17.
33. Escudero-Garzas, J.J.; Devillers, B.; Garcia-Armada, A. Fairness-Adaptive Goodput-Based Resource
Allocation in OFDMA Downlink with ARQ. IEEE Trans. Veh. Technol. 2014, 63, 1178–1192. [CrossRef]
34. Alkhateeb, A., Jr.; Heath , R.W. Frequency Selective Hybrid Precoding for Limited Feedback Millimeter
Wave Systems. IEEE Trans. Commun. 2016, 64, 1801–1818. [CrossRef]
35. Kang, J.; Lee, J.H.; Choi, W. Dimension-optimized Two-Stage Precoder Design for Massive MIMO Systems
with Limited Feedback. In Proceedings of the IEEE Military Communications Conference MILCOM,
Los Angeles, CA, USA, 29–31 October 2018; pp. 530–535. [CrossRef]
36. Tataria, H.; Smith, P.J.; Greenstein, L.J.; Dmochowski, P.A.; Shaﬁ, M. Performance and analysis of downlink
multiuser MIMO systems with regularized zero-forcing precoding in Ricean fading channels. In Proceedings
of the IEEE International Conference on Communications, Kuala Lumpur, Malaysia, 22–27 May 2016;
pp. 1–7.
37. Ho, W.; Liang, Y. Optimal Resource Allocation for Multiuser MIMO-OFDM Systems With User Rate
Constraints. IEEE Trans. Veh. Technol. 2009, 58, 1190–1203. [CrossRef]
38. Spencer, Q.; Swindlehurst, A.; Haardt, M. Zero-forcing methods for downlink spatial multiplexing in
multiuser MIMO channels. IEEE Trans. Signal Process. 2004, 52, 461–471. [CrossRef]
39. Li, Z.; Han, S.; Molisch, A.F. Hybrid beamforming design for millimeter-wave multi-user massive MIMO
downlink. In Proceedings of the 2016 IEEE International Conference on Communications (ICC 2016), Kuala
Lumpur, Malaysia, 22–27 May 2016. [CrossRef]
40. Kang, J.; Lee, J.H.; Choi, W. Machine Learning-Based Dimension Optimization for Two-Stage Precoder in
Massive MIMO Systems with Limited Feedback. Appl. Sci. 2019, 9, 2894. [CrossRef]

187
electronics
Article
Evaluation of Multi-Beam Massive MIMO
Considering MAC Layer Using IEEE802.11ac
and FDD-LTE
Fumiya Muramatsu 1 , Kentaro Nishimori 1, *, Ryotaro Taniguchi 1 , Takefumi Hiraguri 2
and Tsutomu Mitsui 2
1 Graduate School of Science and Technology, Niigata University, Niigata-shi, Niigata 950-2102, Japan;
[email protected] (F.M.); [email protected] (R.T.)
2 Faculty of Fundamental Engineering, Nippon Institute of Technology, 4-1 Gakuendai, Miyashiro-machi,
Minamisaitama-gun, Saitama 345-8501, Japan; [email protected] (T.H.); [email protected] (T.M.)
* Correspondence: [email protected]; Tel.: +81-25-262-7431

Received: 26 January 2019; Accepted: 14 February 2019; Published: 18 February 2019

Abstract: Massive multiple-input multiple-output (MIMO) transmission has attracted attention as

a key technology for use in ﬁfth-generation mobile communication systems. Multi-beam massive
MIMO systems that apply beam selection in analog components and blind algorithms in digital
components to eliminate the requirement for channel state information have been proposed as
a method for reducing overhead. In this study, we developed an adaptive modulation scheme
for implementing multi-beam massive MIMO and used computer simulation to compare it with
digital and analog–digital hybrid beam-forming methods. The effectiveness of the proposed system
was veriﬁed in a medium access control layer based on the IEEE802.11ac and frequency division
duplex-LTE representative wireless communication standards.

Keywords: massive MIMO; analog multi-beam; hybrid beam-forming; PHY layer; MAC layer

1. Introduction
Cellular network data traffic volumes have increased significantly with the advent of smart
devices. This trend will continue to grow as Internet of Things (IoT) equipment and big-data services
become more common. To meet this demand, recent research and development have focused on
achieving a 20-Gbps or more standard for future wireless communication [1,2].
Multiple-input multiple-output (MIMO) systems can be used to improve the transmission rate
within a limited frequency band [3]. To this end, multi-user MIMO (MU-MIMO) has been developed
to enable MIMO transmission to multiple users [4]. MU-MIMO technologies have been standardized
for LTE-advanced and IEEE802.11ac [5,6]. Accordingly, massive MIMO is viewed as a fifth-generation
(5G) technology, and it is expected to play an important role in achieving 5G targets [1,7].
In a MIMO system, a base station (BS) acquires channel state information (CSI) between itself and
a user terminal (UT) [8]. As CSI acquisition under massive MIMO involves numerous UT–BS antenna
pair channels, one of the most important challenges in implementing it is the acquisition of large
amounts of CSI with a small overhead. To achieve this, it is necessary to evaluate not only the physical
(PHY) layer but also the medium access control (MAC) layer. The MAC layer is a communication
protocol of a part of the data link layer corresponding to the second layer of the OSI reference model in
IEEE802, and it is located in a layer that is one layer above the physical layer.
Implicit beamforming (BF) has been proposed as an approach for eliminating CSI feedback in
MU-MIMO systems [9]. However, the communication efficiency of short-packet communication
decreases even if implicit BF is applied in a massive MIMO system [10].

Electronics 2019, 8, 225; doi:10.3390/electronics8020225 189 www.mdpi.com/journal/electronics

Electronics 2019, 8, 225

CSI estimation and initial UT tracking, which require a significant amount of overhead,
are necessary in hybrid BF massive MIMO systems [1]. A hybrid BF approach referred to as multi-beam
massive MIMO transmission has been proposed to solve this problem. CSI estimation is unnecessary
in this approach. The effectiveness of multi-beam massive MIMO has been demonstrated through
computer simulation [11–13]. In multi-beam massive MIMO systems, a number of analog multi-beams
are created for initial UT tracking and a subset of these beams with high received power are selected
for use. As multi-beams have narrow beam widths, interference signals can be mitigated and residual
interference can be cancelled by applying a blind array algorithm to digital beam components using
only the information pertaining to a received signal [14–16]. This paper proposes a multi-beam
massive MIMO approach that achieves high transmission efficiency and flexible communication using
asynchronous UT transmission. As the proposed system does not require timing synchronization
among UTs, all UTs can freely transmit signals while avoiding collision between their signals.
The diminished signal transmission time resulting from this configuration reduces the total power
consumption by the UTs.
In our previous studies, we proposed multi-beam massive MIMO beam-selection methods [13]
that utilized received-signal information such as the power difference and amplitude correlation
between beams. This approach was capable of appropriately performing beam-selection at a high
signal-to-interference noise ratio (SINR). Furthermore, this method could be performed through
signal processing using a simple configuration and was highly suitable for hybrid analog-digital
massive MIMO.
Channel quality indicator (CQI) values, which can serve as an index of modulation scheme
determination, cannot be obtained under multi-beam massive MIMO because it does not perform
CSI estimation. In this paper, we propose a simple adaptive modulation scheme for multi-beam
massive MIMO transmission based on amplitude correlation and received power. Under this method,
an appropriate modulation scheme can be simply determined based on the relationship between
amplitude correlation and the SINR. We assessed the proposed method by evaluating the transmission
rates it achieved using the IEEE802.11ac and frequency division duplex (FDD)-LTE standards.
In addition, we evaluated the throughput of the method under the MAC layer of each standard
to consider overhead.
The rest of this paper is organized as follows: The general massive MIMO transmission approach
is described in Section 2. In Section 3, we describe the use of amplitude correlation for modulation and
propose a simple adaptive modulation scheme for multi-beam massive MIMO. In Section 4, we present
our simulation model and describe its application in evaluating simple adaptive modulation schemes.
Then, we describe the results of the performance evaluation of the proposed multi-beam massive MIMO
method through the computer simulation of the IEEE802.11ac and FDD-LTE standard environments.

2. Massive MIMO Transmission and Proposed System

The bandwidth required by 5G systems is achieved using frequencies of 20 GHz or higher.
Massive MIMO utilizes BF technology to resolve propagation loss, which is one of the more serious
problems encountered under high-frequency bands. Typically, a technique known as digital BF (DBF)
is adopted, as shown on the right-hand side of Figure 1. In this technique, weight values are calculated
through digital signal processing. However, the massive MIMO systems that apply DBF experience
problems with respect to power consumption and implementation cost [1,17,18].
The left-hand side of Figure 1 shows the configuration of a typical hybrid BF massive MIMO
with sub-arrays [19]. In the figure, NL and NK indicate the number of elements and receivers in a
sub-array, respectively. This configuration produces a hybrid analog-digital BF that tracks a desired
signal through analog control while removing interference using digital signal processing, and it has
attracted significant interest [20–22]. Massive MIMO is known to be highly effective in addressing
the power consumption and implementation cost problems owing to its hybridization of analog and
digital approaches.

190
Electronics 2019, 8, 225

Multi-beam massive MIMO eliminates the requirement for CSI estimation by forming
numerous narrow directional beams and spatially separating the signals of each UT. As a result,
the synchronization between a BS and a UT, which is required for CSI estimation, becomes unnecessary.
Under standard MU-MIMO, UTs must transmit signals to a BS simultaneously to employ CSI feedback
prior to communication. As multi-beam massive MIMO does not require CSI feedback, a UT can
communicate with a BS at any time.
Figure 2 shows the conﬁguration of analog multi-beam massive MIMO under uplink
communication, along with its 16 multi-beam pattern. The proposed system generates N orthogonal
multiple beams in the analog component and uses a butler matrix circuit to achieve multi-beam
formation [23,24]. The butler matrix circuit, which was introduced in [24], forms an orthogonal beam
using a Fast Fourier Transform (FFT) on a spatially arranged array. This system is also effective from
the viewpoint of the pilot contamination problem in the multi-cell scenario because of the signals by
multiple UTs can be received without the CSI estimation by the multi-beam circuit.

Figure 1. Uplink conﬁgurations of typical analog–digital hybrid (left) and full-digital (right)
beamforming (BF) massive multiple-input multiple-output (MIMO) systems.

Figure 2. Uplink conﬁguration of multi-beam massive MIMO in uplink and 16 multi-beam pattern.

191
Electronics 2019, 8, 225

The proposed method performs beam selection using an amplitude detector and a processor.
Through beam selection, it is possible to spatially separate a specific UT using beam directivity. However,
interference signals from the other UTs, which are also received by side lobes, cannot be rejected
through only the multi-beam forming network. A method that uses only amplitude information, such as
amplitude correlation, must be used to achieve beam selection with low interference power.
The system developed in this study applies robust independent component analysis (ICA) using
a blind adaptive algorithm [16,25]. ICA is a blind signal processing technique that is commonly used
in image processing and medicine. In this technique, an observed random vector is decomposed into
statistically independent variables [16]. ICA does not require CSI because it utilizes only received
signals. The use of ICA allows for a hybrid conﬁguration for efﬁcient transmission in massive MIMO
systems in which multi-beams are applied to analog components and a blind algorithm is applied to
digital components.

3. Adaptive Modulation for Multi-Beam Massive MIMO

As discussed in the previous section, multi-beam massive MIMO does not require CSI estimation,
thereby eliminating the requirement for a process to obtain channel information at the BS side. As a
result, it is not possible to use a modulation and coding scheme (MCS) index to combine the modulation
scheme and coding rate from the CQI value, as is done under current wireless communication
standards [5,6]. Therefore, we developed an adaptive modulation scheme based on the received power
and amplitude correlation obtained from performing beam selection during uplink.
Here, we explain the beam-selection method and amplitude correlation in multi-beam massive
MIMO. Beam-selection is one of the major challenges in multi-beam massive MIMO. As a directivity
peak cannot be directed to a given UT, it is possible for one beam to receive signals from multiple
UTs. It has been confirmed that in such situations, signal separation cannot be perfectly performed in
digital components, even by a blind algorithm. To address this, the authors previously proposed a
beam-selection method that utilizes the power differences between the multi-beams and correlation
values obtained from amplitude information [13]. By employing this method, it is possible to achieve
beam selection with a high SINR by setting thresholds.
Under the proposed beam-selection method, amplitude correlation is calculated based on the
covariance between the amplitudes of received and adjacent beams. The amplitude of a received
signal is defined as a vector, x, whose number of elements is equivalent to the lengths of all
transmission-signal symbols. The correlation coefficient, ρ( x, y), with respect to the signal amplitudes,
x and y, is denoted as

N
∑ ( xi − x )(yi − y)
i =1
ρ( x, y) = 7 7 , (1)
N N
∑ ( x i − x )2 ∑ ( y i − y )2
i =1 i =1

where x is the mean of all elements in x.

Figure 3 shows the relationship between amplitude correlation and the SINR. The SINR of
the received signal tends to be high when the amplitude correlation is high. Based on this ﬁnding,
we examined a method of determining a simple adaptive modulation approach on the downlink using
the amplitude correlation and received power obtained on the uplink.

192
Electronics 2019, 8, 225

Amplitude correlation
0.8

0.6

0.4

0.2

0
-2 0 2 4 6 8 10 12
SINR [dB]

Figure 3. Relationship between amplitude correlation and signal-to-interference noise ratio (SINR).

4. Computer Simulation
The results of the computer simulation of the proposed system were verified in terms of the
Rayleigh fading and angular spread of quadrature amplitude modulation (QAM) signals [26,27].
The simulation conditions are listed in Table 1. Figure 4 shows the simulation model, which is a
scattering ring model [27] with a specific angular spread. The model applies 101 paths per UT with an
angular spread of 1.0 degree. We assume flat fading in narrowband signals and do not consider delay
spread. Sixty-four elements are arranged in the horizontal direction at intervals of 0.5 wavelengths,
and beam width is approximately 1.60 degrees. As the assumed transmission method involves a
single carrier for narrowband signals, delay waves have no influence on signal separation. The signals
for each sub-carrier are regarded as narrowband signals to apply them as actual broadband signals
through orthogonal frequency division multiplexing. In the UT distribution, the center direction of
the BS is set as 0 degrees and individual UTs are placed at random angles within a range of −60 to
60 degrees. A constant distance is set between the BS and all UTs so that a signal-to-noise ratio (SNR)
of 20 [dB] per UT is received at the BS.

Figure 4. Simulation model.

193
Electronics 2019, 8, 225

Table 1. Simulation conditions.

Number of receiving antennas 64

Number of UTs 20
Number of demodulated UTs 8
SNR per UT, before array gain acquisition 20 [dB]
Modulation scheme 4QAM (QPSK) – 1024QAM
Trial number 3000
Data length 3000
Data smoothing size 3000
Number of iterations 30

First, the processing of the uplink was simulated transmitting QPSK signals from all 20 UTs to the
BS, which recorded the amplitude correlation and received power for each beam. On the downlink,
the BS transmitted QPSK 1024-QAM signals to eight users with high reception power. Then, the UTs
received signals from the BS calculated bit error rate (BER). We assumed that a modulation scheme
yielding a BER of less than 10−2 could be applied [28,29]. We evaluated the applicable modulation
schemes in terms of the SINR and received power recorded on the uplink to simulate an adaptive
modulation scheme.

4.1. Adaptive Modulation

The results produced by the simulation procedure described in Section 4 are shown in Figure 5,
in which the left-hand and right-hand ﬁgures show the relationship between the applicable modulation
method and amplitude correlation and received power, respectively. Even though amplitude
correlation increases with the modulation order, it is not appropriate to select a modulation method
using only amplitude correlation because the standard deviation of the results is large. Therefore,
the received power relationship shown in the right-hand ﬁgure was used to develop an indicator for
determining the modulation method (Table 2). In the table, ρ and P represent amplitude correlation
and received power, respectively.

100
0.7
Amplitude correlation

80
0.6
CDF [%]

60
0.5

0.4 Median
40
16 QAM
32 QAM
Standard 20
0.3 deviation
64 QAM
128 QAM
0.2 0
0 2 4 6 8 10 20 22 24 26 28 30 32 34
Modulation order [bits/symbol] Received power [dB]
Figure 5. Relationships between applicable modulation scheme and amplitude correlation (left) and
received power (right).

194
Electronics 2019, 8, 225

Table 2. Adaptive modulation scheme for eight user terminals (UTs).

Modulation Scheme Conditions

None ρ < 0.4
16-QAM ρ ≥ 0.4 and P < 20 [dB]
32-QAM ρ ≥ 0.4 and 20 ≤ P < 24 [dB]
64-QAM ρ ≥ 0.4 and 24 ≤ P < 28 [dB]
128-QAM ρ ≥ 0.4 and P ≥ 28 [dB]

4.2. IEEE802.11ac
We evaluated the indicator shown in Table 2 and the downlink effectiveness of multi-beam
massive MIMO transmission based on the IEEE802.11ac procedures [5,30]. The simulation conditions
and model were the same as those used in Section 4. Equation (2) was used as an index of evaluation
based on the BER calculated on the UTs.

M (1 − BER) (BER ≤ 10−2 )
R= [bit/symbol], (2)
0 (otherwise)

where M denotes the number of bits per symbol and R denotes the number of bits per symbol obtained
independently of the coding rate.
The assessment was divided into parts, i.e., the evaluation of the PHY layer without considering
the overhead arising from the control signal and the evaluation of the MAC layer considering
overhead. Table 3 shows the relationship between the MCS index and the transmission rate under the
IEEE802.11ac 20-[MHz] operation [5]. The values of R are obtained by multiplying the modulation
order by the coding rate, and they are equivalent to R, which is the number of bits per symbol.
Accordingly, it was assumed that an MCS index that satisﬁes R ≥ R could be applied to the R
obtained on the UTs. Based on this, we could evaluate the transmission rate corresponding to the
MCS index.

Table 3. Relationship between MCS index and transmission rate under IEEE802.11ac at 20 MHz.

I MCS Modulation Coding Rate R [bits/symbol] Transmission Rate [Mbps]

0 BPSK 1/2 0.5 7.2
1 QPSK 1/2 1.0 14.4
.. .. .. .. ..
. . . . .
9 256QAM 3/4 6.0 86.7

In the MAC layer, throughput is calculated according to the downlink MU-MIMO procedure
shown in Figure 6 [5]. As seen in the ﬁgure, the BS ﬁrst transmits request signals to all UTs to obtain
channel information. Once the BS has obtained channel feedback from each UT data transmission
signal, data reception processing is performed for all UTs. Throughput is obtained by dividing the
data size of frame aggregation, which was set as 63,000 bytes in this evaluation, by the time required
by the processes. As multi-beam massive MIMO does not require procedures such as CSI estimation
or feedback, we assumed that no time other than that necessary for data transmission was required.

195
Electronics 2019, 8, 225

Propagation channel
estimation request CSI feedback Data transmission Confirm data reception
DIFS
+ Backoff

BS
NDPA NDP BRP BRP DATA BAR BAR

UT#1 BR BA
BR BA
UT#2
⋮ ⋮ ⋮ ⋮
BR BA
UT#NU
NDPA : Null Data Packet Announcement, NDP : Null Data Packet, BR : Beamforming Report,
BRP : BR Poll, BA: Block ACK, BAR: BA Request, DIFS: DCF Inter Frame Space

Figure 6. Downlink multi-user (MU)-MIMO procedure under IEEE802.11ac.

We evaluated the performance of a typical hybrid BF massive MIMO and compared it with
those of the sub-array and full-digital BF conﬁgurations discussed in Section 2. Table 4 shows the
precoding and decoding methods used under each conﬁguration in the downlink. For the hybrid
BF, we assumed that maximum ratio combining (MRC) was performed in the analog component
using a phase shifter [31]. In the digital component, the minimum mean square error (MMSE) weight
calculated in the uplink was used as a precoder [31]. Eight sub-arrays (NK = 8) and eight sub-array
elements (NL = 8) were utilized, resulting in 64 elements. In the full-digital BF, transmission precoding
was performed using a block diagonalization (BD) method with 64 elements [32].

Table 4. Precoding and decoding on downlink.

Massive MIMO Conﬁguration Precoding Decoding

Multi-beam massive MIMO (Figure 2) Analog multi-beam Robust ICA
Hybrid BF massive MIMO (Figure 1) MRC and MMSE MMSE
Full-digital BF massive MIMO (Figure 1) BD method MMSE

Figure 7 shows the cumulative distribution function (CDF) characteristics of transmission rate
and throughput. The adaptive modulation results show the characteristics obtained for each trial
when the highest modulation order is selected in the modulation scheme in which the BER is 10−2
or less. The proposed modulation results are obtained using the characteristics based on Table 2.
At the assumed transmission rate, the full-digital BF configuration achieves 86.7 [Mbps], which is
the maximum transmission rate obtainable under IEEE802.11ac, because it scans a sharp beam width
using 64 elements. The median value of multi-beam massive MIMO is approximately 70 [Mbps], and it
is asymptotic to the ideal characteristic obtained by utilizing the proposed simple adaptive modulation.
The hybrid BF configuration characteristic achieves a low modulation order because beam width
is widened when there are eight elements per sub-array. The full-digital configuration can achieve
a throughput of only approximately 40 [Mbps] because of the overhead of channel estimation and
feedback obtained when there are 64 antenna elements. In hybrid BF, overhead occurs owing to eight
elements, which is the number of sub-arrays. As shown in Figure 6, throughput decreases when the
transmission rate of even one UT is slow because IEEE802.11ac requires the synchronization of all UTs,
as shown in Figure 6. This reduces the throughput characteristic of the hybrid BF. The multi-beam
massive MIMO characteristic is the best among the three configurations because it does not require
synchronization and overhead and can effectively utilize the transmission rate obtained in the
PHY layer.

196
Electronics 2019, 8, 225

100 100
Multi-beam
( adaptive modulation)
80 80
Multi-beam

CDF [%]
CDF [%]

( proposed modulation)
60 60
Hybrid BF
( adaptive modulation)
40 40
Full-digital BF
( adaptive modulation)
20 20

0 0
0 20 40 60 80 100 0 20 40 60 80 100
Transmission rate [Mbps] Throughput [Mbps]

Figure 7. Characteristics of transmission rate and throughput under IEEE802.11ac.

4.3. FDD-LTE
We evaluated the proposed method under FDD-LTE (Release 15) [6].We assumed a bandwidth
of 20 [MHz] per UT and 100 resource blocks (RBs). Table 5 shows the relationship between the MCS
index and the transmission rate in LTE when there is one antenna. The transport block size (TBS) is
based on [6], and the transmission rate is calculated based on the efﬁciency per bandwidth. The TBS
corresponds to an actual data component divided into RBs and transmitted, while user data indicate
the resource size for a given control signal. As in the IEEE802.11ac simulation, we compared the
values of R and R produced by the respective approaches and evaluated the results in terms of the
corresponding transmission rates.

Table 5. Relationship between MCS index and transmission rate under frequency division duplex
(FDD)-LTE (100 RBs, 20 MHz).

I MCS Modulation TBS User Data Coding Rate R [bits/symbol] Transmission Rate [Mbps]
0 QPSK 2792 30000 0.094 0.188 7.22
1 QPSK 4548 30000 0.154 0.308 9.61
2 QPSK 7224 30000 0.242 0.484 21.7
.. .. .. .. .. ..
. . . . . .
27 256QAM 84760 120000 0.707 5.656 127.0

The throughput in LTE, which is denoted here by T, can be calculated as follows: [6,33].

T = TBS × Nstr
Nsubc × Nslot × Nsym × M × Nstr × CR [bps],

where Nsubc is the number of sub-carriers, Nslot is the number of slots per second, Nsym is the number
of symbols per slot, Nstr is the number of streams, and CR is the coding rate.
We apply Equation (3) to consider control signals because Equation (3) cannot express the number
of control signals that can be eliminated under multi-beam massive MIMO.

Userdata
T = TBS × × Nsubf × Nstr [bps], (3)
Userdata

where Nsubf is the number of sub-frames per second. The value of Userdata is obtained by increasing or
decreasing the number of control signals from secured Userdata, which is defined in [6]. For example,
when I MCS = 27 in Table 1, Userdata is obtained as 120,000 [bits]. This corresponds to a Userdata of
128,000 [bits] in the multi-beam massive MIMO, which is obtained by removing the control signal from
Userdata. As there is no complete definition of a control signal in the case of 64 elements, we assumed
that one antenna port would be added to handle the additional control signal required each time the
number of antennas was doubled. Therefore, Userdata was 83,200 [bits] for 64 antennas under the
full-digital configuration.

197
Electronics 2019, 8, 225

Figure 8 shows the relationship between the transmission rate and throughput based on
Equation (3). We used this ﬁgure to evaluate throughput based on overhead for each conﬁguration.

100
Overhead-less

Throughput [Mbps]
80 NT = 1
NT = 8
NT = 64
60

0
0 20 40 60 80 100 120
Transmission rate [Mbps]
Figure 8. Transmission rate versus throughput from Equation (3).

Figure 9 shows the CDF characteristics of the transmission rate and throughput under FDD-LTE.
The modulation scheme for each characteristic is determined in the same manner as that used in
the evaluation under IEEE802.11ac. The transmission rate follows the same trend as that under
IEEE802.11ac. However, unlike IEEE 802.11ac, a high transmission rate can be achieved even with
a low modulation order. This suggests that the performance of the hybrid BF approaches that of
multi-beam massive MIMO. An evaluation of throughput based on Figure 8 reveals that it increases
linearly with transmission rate under all configurations. Given that multi-beam massive MIMO has
the lowest number of control signals of all configurations, it achieves the highest throughput at a given
transmission rate. This verifies the effectiveness of multi-beam massive MIMO, which can effectively
utilize the transmission rate in the PHY layer.

100 100
Multi-beam
( adaptive modulation)
80 80
Multi-beam
CDF [%]
CDF [%]

( proposed modulation)
60 60
Hybrid BF
( adaptive modulation)
40 40
Full-digital BF
( adaptive modulation)
20 20

0 0
0 20 40 60 80 100 120 0 20 40 60 80 100
Transmission rate [Mbps] Throughput [Mbps]

Figure 9. Characteristics of transmission rate and throughput under FDD-LTE.

5. Conclusions
In this paper, we proposed a simple adaptive modulation method for multi-beam massive
MIMO and evaluated its performance when implemented in a MAC layer. The effectiveness of the
proposed transmission method was validated through computer simulation. The performance of the
method asymptotically approached the ideal characteristic because of the implementation of simple
adaptive modulation using amplitude correlation and received power. The proposed method was also
shown to achieve higher throughput characteristics compared to a hybrid approach with sub-arrays
and a full-digital beam-forming conﬁguration under the IEEE802.11ac and FDD-LTE environments.
Future works include the further theoretical analysis and experimental evaluation using the analog
multi-beam by Butler matrix circuit.

Author Contributions: This study was led by F.M. while K.N., R.T., T.H. and T.M. assisted with the
computer simulations.

198
Electronics 2019, 8, 225

Funding: This work was partially supported by the SCOPE #165004002, #185004002 and KAKENHI, Grant-in-Aid
for Scientific Research (B) (17H01738, 17H03262).
Conflicts of Interest: The authors declare no conflict of interest.

References
1. Papadopoulos, H.; Wang, C.; Bursalioglu, O.; Hou, X.; Kishiyama, Y. Massive MIMO technologies and
challenges towards 5G. IEICE Trans. Commun. 2015, E99-B, 602–621. [CrossRef]
2. Obara, T.; Suyama, S.; Shen, J.; Okumura, Y. Joint Processing of Analog Fixed Beamforming and
CSI-Based Precoding for Super High Bit Rate Massive MIMO Transmission Using Higher Frequency Bands.
IEICE Trans. Commun. 2015, E98-B, 1474–1481. [CrossRef]
3. Foschini, G.J.; Gans, M.J. On limits of wireless communications in a fading environment when using multiple
antennas. Wirel. Pers. Commun. 1998, 6, 331–335. [CrossRef]
4. Gesbert, D.; Kountouris, M.; Heath, R.W., Jr.; Chae, C.B. ; Salzer, T. Shifting the MIMO paradigm. IEEE Signal
Process. Mag. 2007, 24, 36–46. [CrossRef]
5. IEEE P802.11ac./D5.0, Part 11: Wireless LAN Medium Access Control (MAC and Physical Layer (PHY)
Specifications; IEEE Standard for Information Technology; IEEE: New York, NY, USA, 2013.
6. 3rd Generation Partnership Project (3GPP). TS 36.213 Physical Layer Procedures; V10.2.0; 3GPP:
Sophia Antipolis, France, 2018.
7. Lu, L.; Li, G.Y.; Swindlehurst, A.L.; Ashikhmin, A.; Zhang, R. An overview of massive MIMO: Benefits and
challenges. IEEE J Sel. Top. Signal Process. 2014, 8, 742–758. [CrossRef]
8. Spencer, Q.H.; Peel, C.B.; Swindlehurst, A.L.; Haardt, M. An introduction to the multi-user MIMO downlink.
IEEE Commun. Mag. 2004, 42, 60–67. [CrossRef]
9. Murakami, T.; Fukuzono, H.; Takatori, Y.; Mizoguchi, M. Multiuser MIMO with implicit channel feedback in
massive antenna systems. IEICE Commun. Express 2013, 2, 336–342. [CrossRef]
10. Hiraguri, T.; Nishimori, K. Survey of transmission methods and efficiency using MIMO technologies for
wireless LAN systems. IEICE Trans. Commun. 2015, E98-B, 1250–1267. [CrossRef]
11. Kameyama, K.; Nishimori, K.; Hiraguri, T.; Yamada, H.; Makino, H. Analog multi-beam massive MIMO
transmission eliminating CSI estimation. IEICE Trans. Commun. 2016, J99-B, 743–752. (In Japanese)
12. Taniguchi, R.; Nishimori, K.; Makino, H. Multi-beam massive MIMO using constant modulus algorithm for
QAM signals employing amplitude and phase offset compensation. IEICE Trans. Commun. 2017, E100-B,
262–268. [CrossRef]
13. Muramatsu, F.; Nishimori, K.; Taniguchi, R.; Hiraguri, T. Multi-beam massive MIMO with beam-selection
using only amplitude information in uplink channel. IEICE Trans. Commun. 2018, E101-B, 1544–1551.
[CrossRef]
14. Treichler, J.R.; Agee, B.G. A new approach to multipath correction of constant modulus signals. IEEE Trans.
Acoust. Speech Signal Process. 1983, 31, 459–472. [CrossRef]
15. Agee, B. The least-Square CMA: A new technique for rapid correction of constant modulus signals.
Proc. IEEE ICASSP 1986, 953–956.
16. Hyvarinen, A. Fast and robust fixed-point algorithms for independent component analysis. IEEE Trans.
Neural Netw. 1999, 10, 626–634. [CrossRef]
17. Sohrabi, F.; Yu, W. Hybrid digital and analog beamforming design for large-scale antenna arrays. IEEE J. Sel.
Top. Signal Process. 2016, 10, 501–513. [CrossRef]
18. Bogale, T.E.; Le, L.B. Beamforming for multiuser massive MIMO systems: Digital versus hybrid
analog-digital. In Proceedings of the 2014 IEEE Global Communications Conference, Austin, TX, USA,
8–12 December 2014; pp. 4066–4071.
19. Molisch, A.F.; Ratnam, V.; Han, S.; Li, Z.; Nguyen, L.H.; Li, L.; Haneda, K. Hybrid beamforming for massive
MIMO: A survey. IEEE Commun. Mag. 2017, 55, 134–141. [CrossRef]
20. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
21. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]

199
Electronics 2019, 8, 225

22. Hoydis, J.; Brink, S.T.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
23. Butler, J.; Lowe, R. Beamforming matrix simpliﬁes design of electronically scanned antennas. Electron. Des.
1961, 9, 170–173.
24. Yamamoto, S.; Hirokawa, J.; Ando, M. A single-layer hollow-waveguide 8-way Butler matrix with modiﬁed
phase shifters. In Proceedings of the ISAP2015, Hobart, Australia, 9–12 November 2015; pp. 937–940.
25. Zarzoso, V.; Comon, P. Robust independent component analysis by iterative maximization of the kurtosis
contrast with algebraic optimal step size. IEEE Trans. Neural Netw. 2010, 21, 248–261. [CrossRef]
26. Gesbert, D.; Bolcskei, H.; Gore, D.A.; Paulraj, A.J. Outdoor MIMO wireless channels: Models and
performance prediction. IEEE Trans. Commun. 2002, 50, 1926–1934. [CrossRef]
27. Oestges, C.; Erceg, V.; Paulraj, A.J. A physical scattering model for MIMO macrocellular broadband wireless
channels. IEEE J. Sel. Areas Commun. 2003, 21, 721–729. [CrossRef]
28. Chung, S.T.; Goldsmith, A.J. Degrees of freedom in adaptive modulation: A unified view. IEEE Trans. Commun.
2001, 49, 1561–1571. [CrossRef]
29. Nishimori, K.; Yomo, H.; Popovski, P. Distributed interference cancellation for cognitive radios using periodic
signals of the primary system. IEEE Trans. Wirel. Commun. 2011, 10, 2791–2981. [CrossRef]
30. Kikuma, N.; Nishimori, K.; Hiraguri, T. Effect of user antenna selection on block beamforming algorithms
for suppressing inter-user interference in multiuser MIMO system. IEICE Trans. Commun. 2018, E101-B,
1523–1535. [CrossRef]
31. Tanbourgi, R.; Dhillon, H.S.; Andrews, J.G.; Jondral F.K. Effect of spatial interference correlation on the
performance of maximum ratio combining. IEEE Trans. Wirel. Commun. 2014, 13, 3307–3316. [CrossRef]
32. Stankovic, V.; Haardt, M. Generalized design of multi-user MIMO precoding matrices. IEEE Trans.
Wirel. Commun. 2008, 7, 953–961. [CrossRef]
33. Rathi, S.; Malik, N.; Chahal, N.; Malik, S. Throughput for TDD and FDD 4G LTE systems. Int. J. Innov.
Technol. Explor. Eng. (IJITEE) 2014, 3, 73–77.

200
electronics
Article
Computationally Efﬁcient Channel Estimation in 5G
Massive Multiple-Input Multiple-output Systems
Imran Khan 1 , Mohammad Haseeb Zafar 1 , Majid Ashraf 1 and Sunghwan Kim 2, *
1 Department of Electrical Engineering, University of Engineering & Technology, Peshawar P.O.B. 814,
Pakistan; [email protected] (I.K.); [email protected] (M.H.Z.);
[email protected] (M.A.)
2 School of Electrical Engineering, University of Ulsan, Ulsan 44610, Korea
* Correspondence: [email protected]; Tel.: +82-052-259-1401

Received: 9 November 2018; Accepted: 28 November 2018; Published: 3 December 2018

Abstract: Traditional channel estimation algorithms such as minimum mean square error (MMSE)
are widely used in massive multiple-input multiple-output (MIMO) systems, but require a matrix
inversion operation and an enormous amount of computations, which result in high computational
complexity and make them impractical to implement. To overcome the matrix inversion problem,
we propose a computationally efﬁcient hybrid steepest descent Gauss–Seidel (SDGS) joint detection,
which directly estimates the user’s transmitted symbol vector, and can quickly converge to obtain
an ideal estimation value with a few simple iterations. Moreover, signal detection performance was
further improved by utilizing the bit log-likelihood ratio (LLR) for soft channel decoding. Simulation
results showed that the proposed algorithm had better channel estimation performance, which
improved the signal detection by 31.68% while the complexity was reduced by 45.72%, compared
with the existing algorithms.

Keywords: 5G; massive MIMO; computational efﬁciency; precoding algorithms; channel estimation

1. Introduction
Multiple-input multiple-output (MIMO) technology is becoming more and more mature,
especially when combined with orthogonal frequency division multiplexing (OFDM) [1–5], which has
been successfully applied in multiple wireless communications fields such as Long-Term Evolution
(LTE) and LTE-Advanced. However, traditional MIMO technology can only achieve a 4 × 4 or
8 × 8 scale system [6], which makes it difficult to meet the explosive growth in mobile data services.
Therefore, in recent years, massive MIMO has been proposed based on traditional MIMO technology [7].
Massive MIMO systems configure up to hundreds of antenna arrays at the base station to serve multiple
single-antenna end-users simultaneously [8], which can improve spectrum utilization and power
utilization in wireless communications systems by two to three orders of magnitude [9–11]. This has
become one of the most promising enabling technologies and one of the hottest research directions in
5G [12]. The maximum likelihood (ML) algorithm is the optimal algorithm in massive MIMO detection
algorithms, but its computational complexity increases exponentially with the number of system
antennas and the modulation order of baseband signals. It is difficult for it to be fast, effective, and
realized in practical applications. Linear detection methods, such as the zero-forcing (ZF) algorithm
and minimum mean square error (MMSE) algorithm, can achieve near-optimal detection performance
in massive MIMO systems. The complexity in this kind of detection algorithm is greatly reduced,
compared with the complexity of the ML algorithm, but introduces a complex high-dimensional
matrix inversion operation, so a low-cost and efficient engineering implementation is still a problem
to be solved. Aimed at this problem, many simplified algorithms based on the MMSE detection
scheme have been proposed in recent years, and can be roughly divided into three types: The series

Electronics 2018, 7, 382; doi:10.3390/electronics7120382 201 www.mdpi.com/journal/electronics

Electronics 2018, 7, 382

expansion class-approximation method [13,14], the iterative class-approximation method [15,16] and a
gradient-based search for an approximate solution [17–20]. The authors in [13] proposed a method of
using Neumann series expansion to approximate the inverse MMSE filter matrix, but when the number
of expansion stages was gradually increased (i > 2), the computational complexity was still high,
even equal to or exceeding the exact MMSE. The complexity of the filter matrix inversion algorithm
also loses a large degree of detection performance. The authors in [14] applied the Newton algorithm
derived from the first-order Taylor series expansion (similar to Neumann series expansion) to massive
MIMO signal detection, and used the iterative method to improve the estimation accuracy of the
MMSE filter matrix inversion. However, from the aspects of detection performance and computational
complexity, the algorithm based on the Newton iteration was not dominant. Different from the two
series expansion-based algorithms above, it is necessary to estimate the signal vector sent by the user
by inverting the approximate matrix. Some iterative algorithms based on solving linear equations, such
as the Richardson iterative (RI) algorithm [15] and the successive over-relaxation (SOR) algorithm [16],
use the special properties of the MMSE filter’s symmetric positive definite matrix. Through the method
of solving linear equations, they directly estimate the transmission vector, thus avoiding the inversion
of high-dimensional matrices.
The RI and SOR algorithms mentioned above have lower computational complexity at a fixed
number of iterations, but RI convergence is slower and requires a higher number of iterations to
achieve certain detection performance requirements. In SOR, although the detection performance
is close to excellent, its internal iterative structure means the algorithm cannot be implemented in
parallel in practical applications. The third type of algorithm is mainly designed and implemented
based on the idea of a matrix gradient, including the conjugate-gradient (CG) method [17] and the
steepest descent (SD) method [18]. This type of algorithm uses the matrix gradient search method and
avoids the high-dimensional matrix inversion problem. However, compared to the method of series
expansion, the CG and SD algorithms bring about great improvement in detection performance, but
calculation of the matrix gradient after each iteration also causes higher complexity.
In this paper, a low-complexity joint detection algorithm was proposed. The SD algorithm had a
good convergence direction at the beginning of the iteration, and the Gauss–Seidel (GS) algorithm with
low complexity mentioned in [19] was combined with the SD method (called SDGS), which provided
an effective search direction for GS iterations, speeding up convergence and improving the detection
performance. Furthermore, applying it to soft output detection gave an approximate calculation
method for the bit log-likelihood ratio (LLR) of the channel decoder input. A good compromise
between detection performance and computational complexity was achieved.
The rest of this paper is organized as follows: Section 2 discusses the system model and analytical
derivations, Section 3 explains signal detection, while Section 4 explains the mixed iterative algorithm
and the proposed algorithm. Section 5 provides the simulation results, while Section 6 concludes
the paper.

2. System Model
The research object considered in this paper was the uplink for a massive MIMO system consisting
of a base station equipped with N antennas and K single-antenna users where N K, as shown
in Figure 1. Let sc = [s1 , s2 , . . . , sK ]T denote the K × 1 dimensional symbol vector sent by all users
simultaneously, where sk ∈ ε was the transmitted symbol from the kth user, and ε was the modulation
symbol set.
Let Hc ∈ CN×K represent the Rayleigh fading channel matrix; then, the N × 1 dimensional signal
vector received by the base station could be recorded as:

yc = Hc sc + nc (1)

202
Electronics 2018, 7, 382

where nc represented an additive white Gaussian noise (AWGN) vector with an N × 1 dimensional
mean of 0 and a covariance matrix of σ 2 IN . Converting the complex model of Equation (1) into an
equivalent real model gave:
y = Hs + n (2)

Among these terms, s ∈ R2K , H ∈ R2N×2K , y ∈ R2N , and n ∈ R2N , which were:

(Hc ) −(Hc ) (yc )
H= , y=
(Hc ) (Hc ) (yc )
(3)
(sc ) (nc )
s= , n=
(sc ) (nc )

Among those, (·) and (·) indicated the real part and imaginary part, respectively.

8(
.
.
.
8(
%6L

8(.

&HOOM

8(
.
.
.
8(
%6L

8(.

&HOOL

8(
.
.
.
%6L

8(.

&HOOO

Figure 1. Massive multiple-input multiple-output system model.

2.1. Minimum Mean Square Error Signal Detection

The main task of signal detection was to accurately determine user transmission vector s at the
base station through received signal vector y. The transmitted signal vector ŝ detected by the MMSE
algorithm could be expressed as:
−1
ŝ = H H H + σ2 I2K H H y = W −1 ŷ (4)

where ŷ = H H y. The ﬁlter matrix W of the MMSE detector could be expressed as:

W = G + σ2 I2K (5)

203
Electronics 2018, 7, 382

where G = H H H was the Gram matrix. In massive MIMO systems, the computational complexity of

W −1 is O K3 , which makes the implementation of the MMSE algorithm very complex.

2.2. Log Likelihood Ratio Calculation

Various channel coding techniques are commonly employed in wireless communication systems
to improve their error performance, since channel reliability can be used to improve system stability.
Conventional MIMO system signal detection generally uses a hard decision method to directly execute
symbol decisions on the estimated value of the user-transmitted signal vector, i.e., ŝ in Equation (4).
In order to output the soft detection information to the back end of the detector, after the MMSE
detector estimates ŝ, the LLR soft information used for channel decoding could be calculated with the
following method. First, we needed to restore the estimated ŝ and the calculated W −1 to the equivalent
complex ﬁeld to get ŝc and Wc−1 . Let U = Wc−1 Gc = Wc−1 HcH Hc denote the equalized channel matrix.
The equalized signals obtained by the MMSE ﬁlter matrix could be obtained from Equations (2) and (4)
as follows:
ŝc = Wc−1 Gc sc + Wc−1 HcH nc
(6)
= Usc + Wc−1 HcH nc
Then, the estimated value of the symbol transmitted by the ith user is ŝc,i = μi sc,i + ei , where
μi = [U ]ii = Uii represented the effective channel gain after equalization, and ei represented the noise
2
plus interference (NPI) term contained in the ŝc,i . The noise variance was v2i = ∑Kj=i Uji + Eii σ2 ,
where Uji and Eii represented the ( j, i )th element of the matrix U and the ith diagonal of the matrix
H
E, respectively, where E = Wc−1 HcH Wc−1 HcH = Wc−1 Gc Wc−1 . Using the max-log approximation
representation given in [11], the LLR Li,b corresponding to the bth bit transmitted by the ith user was
expressed as:

min ŝc,i 2 min ŝc,i 2
a
Li,b = Υi a − (7)
a ∈ Ob0 μi a ∈ Ob1 μi

where Υi = μ2i /v2i represented the signal-to-interference plus noise ratio (SINR), and Ob0 and Ob1
represented the modulation symbol set with the bth bit being 0 and 1, respectively.

3. Low Complexity Signal Detection

3.1. Neumann Series Expansion

In a massive MIMO system, the MMSE signal detection algorithm involves a high-dimensional

matrix inversion, W −1 , with a computational complexity of O K3 . In order to reduce the
computational complexity of W −1 , the authors in [11] proposed using Neumann series expansion to
approximate matrix inversion results. When W approximates the invertible matrix X and satisﬁes

lim ( I − WX )n = 0 (8)
n→∞

then, the Neumann series can be expressed as

n
W −1 = Σ ∞ −1
n =0 X ( X − W ) X (9)

The decomposition matrix is W = D + E, where D is the diagonal matrix of W, and E is the

hollow matrix corresponding to W. Since the number of antennas equipped at the base station was
much larger than the number of single-antenna users ( N K ), matrix W has a diagonal dominant
characteristic [3]; that is, W ≈ D. Substituting D for X in Equation (9) gives:
n
W −1 = Σ ∞ −1
n =0 − D E D −1 (10)

204
Electronics 2018, 7, 382

n
when lim − D −1 E = 0, the progression of Equation (10) converges. If we only expand the ﬁrst i
n→∞
term of the Neumann series, we can get:
n
Wi−1 = Σin−=10 − D −1 E D −1 (11)

when the value of i is small, the Neumann series expansion can approximate W −1 with lower
complexity. For example, when i = 2, W2−1 = D −1 − D −1 ED −1 , which is computationally complex,

and the complexity is O K2 .

3.2. Gauss–Seidel Algorithm

In the Neumann series expansion algorithm, when the number of expansion terms i ≥ 3, the

computational complexity is still O K3 , which is equal to or even exceeds the complexity of the
exact inverse calculation of the MMSE ﬁlter matrix. Unlike the Neumann series expansion, which
approximates W, the GS algorithm [19] can solve N-dimensional linear equations of the form Ax = b
without inverting the matrix, where matrix A is an N × N dimensional symmetric positive deﬁnite
matrix, x is the N × 1 dimensional solution vector, and b is the N × 1 dimensional measurement vector.
Decomposing matrix A into a diagonal element matrix, D A , a strict lower triangular element matrix,
L A , and a strict upper triangular element matrix, L H A , the GS algorithm can estimate x by the following
iterative method:
x̂ (i) = ( D A + L A )−1 b − L HA x̂
( i −1)
(12)

where i = 1, 2, . . . represents the number of iterations of the GS algorithm. In a massive MIMO system,
as the number of base station antennas increases substantially, when it is much larger than the number
of single-antenna users ( N K ), the individual column vectors of the uplink channel matrix H are
progressively orthogonal [20], and W = G + σ2 I2K is a symmetric positive deﬁnite matrix. Similarly,
W can be decomposed into:
W = D + L + LH (13)

Among those terms, D, L, and L H , respectively, is the diagonal element matrix of W, the strict
lower triangular element matrix, and the strict upper triangular matrix. The GS algorithm can be
used to avoid inverting the high-dimensional matrix, which directly estimates the transmitted signal
vector ŝ:
ŝ(i) = ( D + L)−1 ŷ − L H ŝ(i−1) (14)

where ŝ(0) represents the initial solution and is usually set to a zero vector.

4. Proposed Algorithm

4.1. Hybrid Iterative Algorithm Structure

The SD algorithm based on matrix gradient search can have a good convergence direction at the
beginning of the iteration [18], while the GS iterative algorithm has lower complexity. Therefore, using
the above characteristics, this paper proposed a hybrid iteration of the joint SD and GS algorithm.
The joint algorithm (called the SDGS algorithm) speeds up convergence of the iterative effect of the
algorithm without increasing the complexity, and achieves error performance close to the MMSE ideal
matrix inversion detection method. The steps are in SDGS Algorithm.
SDGS Algorithm

• Step 1: For the diagonal approximation’s initial value setting, Equation (4) can be converted to
W ŝ = ŷ; W is a symmetric positive deﬁnite matrix and a diagonally dominant matrix, so W −1 is
also a diagonally dominant matrix.
• Step 2: Determine the initial solution using D −1 instead of W −1 :

205
Electronics 2018, 7, 382

s(0) = D −1 ŷ (15)

Since D is a diagonal matrix, it is obvious that calculating D −1 requires only low complexity, and
the initial value, s(0) , is set to the initial value of the SD algorithm according to Equation (15).
• Step 3: The iterative results of the ﬁrst two GS algorithms are represented by the SD algorithm,
and the second GS iteration result can be expressed as:

s (2) = ( D + L ) −1 ŷ − L H s(1) = ( D + L)−1 (( D + L) − W )s(1) + ŷ
(16)
= s(1) + ( D + L)−1 ŷ − Ws(1) = s(1) + ( D + L)−1 r (1)

H
r (0) r (0)
where r (1) = ŷ − W s(0) + ur (0) = ŷ − Ws(0) − uWr (0) = r (0) − up(0) ; u = H (0)
(17)
p (0) r

• Step 4: Combine single SD and GS iterations into one hybrid iteration by substituting
Equation (17) and

s(1) = s(0) + ur (0) → s(2) = s(0) + ur (0) + ( D + L)−1 r (0) − up(0) (18)

This represents the ﬁrst two GS iterations as Equation (18); update the mixed iteration value
ŝ(1) = s(2) , and then perform the next GS iteration.
• Step 5: Using the (i − 1)th GS iteration using Equation (14), ideal estimated value ŝ(i) of the
transmitted signal vector s can be obtained by setting the appropriate number of iterations, i:

ŝ(i) = ( D + L)−1 ŷ − L H ŝ(i−1) (19)

Then, ŝ(i) is related to the complex domain for the next soft decision, so the hybrid iterative
algorithm can converge very quickly after a small number of iteration.

4.2. Approximate Log-Likelihood Ratio Calculation

The low-complexity MMSE signal detection algorithm described in [13–16] directly estimates the
transmitted signal vector ŝ without calculating W−1 . The exact calculation of the LLR for the channel
decoder input is described in Section 1 (i.e., using the exact W−1 matrix inversion information), which
is not difficult to find with Equation (7). When the LLR of the first bit transmitted by the ith user is Li,b ,
the inverse W−1 of the MMSE detector filter matrix W needs to be used again to calculate the SINR
of the ith user. Consider using the W approximation of the diagonal property to replace W−1 and
6 −1 ≈ D−1 , and then convert it to the complex domain to get W
D−1 , that is, W 6 c−1 , in order to obtain the
approximate channel gain and NPI variance, expressed as:

μ 6 ii
6i = U (20)

K 2

62i =
v ∑ U6 ji 6ii σ 2
+E (21)
j =i

−1 −1 −1 −1
6 ≈W
where U 6 c Gc = Dc−1 Gc , and E
6≈W
6 c Gc W
6c 6W
=U 6 c = UD
6 c−1 , so we can calculate Yi = μ v2i .
62i /6

206
Electronics 2018, 7, 382

4.3. Complexity Analysis

According to the number of real multiplications required in the algorithm, the computational
complexity of the SDGS detection algorithm proposed in this paper was analyzed. Since all linear
MMSE detection algorithms and the proposed algorithm must calculate the ﬁlter matrix, W = G + I2k ,
and the matched ﬁlter signal, ŷ = HH y, then only the other parts were analyzed for complexity, mainly
using the following three parts of the composition.

4.3.1. Initial Value and First Iteration Calculation

Equation (15) requires 2K multiplications. The ﬁrst iteration is mainly to calculate r(0) = ŷ − Ws(0) ,
p(0)
= Wr(0) , and scalar. Obviously, the respective 4K2 , 4K2 , and 4K sub-multiplications are required.

Combining the ﬁrst iteration of Equation (18), a total of 2K2 + 10K multiplications are required.

4.3.2. GS iteration
Equation (19) can be expressed as (D + L)ŝ(i) = ŷ − LH ŝ(i−1) = c. After i iterations, the calculation
of ŝ(i) mainly comes from the following two steps: First, c is a 2K × 2K strictly lower triangular element

matrix; 2K × 2K and the 2K × 1 vector ŝ(i−1) are multiplied, and c must be multiplied 2K2 − K times.
(i)
Second, in Equation (19), the mth element, ŝm , can be expressed as:
⎧
⎨ c1
L11 , m=1
(i )
sm = cm −∑m −1 ( i ) (22)
⎩ k=1 sk Lmk
, m = 2, . . . , 2K
Lmm

where cm represents the mth element of c, and Lmk represents the mth row and kth column element of
(i)
the lower triangular matrix (D + L). When m = 1, it is obvious that ŝ1 requires 2K multiplications,
(i)
and all ŝm (m = 2, . . . , 2K) require 2K2 − K multiplications, so a total of 2K2 multiplications are
required for each iteration.

4.3.3. LLR calculation

The computational complexity of this part mainly came from the calculation of the effective
channel gain and the NPI variance after equalization. It can be known from Equations (20) and (21)
6 and the pair of matrices E
that all the elements of the matrix U 6 need to be calculated. Obviously, the
former requires 2K2 multiplications, while the latter only requires 2K multiplications. Therefore, a

total of 2K2 + 2K multiplications were required for this step.
In summary, the total complexity required for the joint iterations to be applied to the soft decision
was 2K2 (i + 2) + 12Ki, which reduced the computational complexity by an order of magnitude,
compared to the traditional MMSE algorithm. The complexity of the number of iterations was kept at

O K2 . In addition, considering the application scenarios of hard decision detection, Table 1 also gave
a comparison of the computational complexity in the four detection algorithms.

Table 1. Complexity comparison of four kinds of detection algorithms for hard decision calculation.

Algorithm Real Multiplications Performance

NS [11] 8K3 − 8K2 + 2K (i + 2) + 4K2 − 4K (i > 1) General
CG [15] 4K2 (i + 1) + 10Ki Better
GS [17] 4K2 i + 2K Better
SDGS 2K2 (i + 2) + 12Ki Optimal

5. Simulation Results
We deployed Matlab (R2017a, Mathworks, Natick, MA, USA) for performing analysis and
experimentation. In order to verify the soft and hard detection performance of the SDGS algorithm

207
Electronics 2018, 7, 382

proposed in this paper, this section presents Monte Carlo simulation results based on Matlab. The main
simulation parameters conﬁgured are in Table 2.

Table 2. Simulation parameters.

Parameter Value
Number of transmitter antennas ( N ) 64–128
Number of receiver antennas (K ) 16
Baseband modulation mode 16 QAM
Signal-to-noise ratio (SNR) 10–12 dB
Code rate 1/2
Channel characteristics Uncorrelated Rayleigh fading
Number of iterations (i ) 1000

Figure 2 compares the bit error rate (BER) based on Neumann series (NS) expansion, the conjugate
gradient (CG) detection algorithm, the Gauss–Seidel iterative detection algorithm, the MMSE exact
inversion detection algorithm, and the proposed SDGS joint algorithm under different antenna
configurations. The decision mode is a hard decision; that is, estimated signal vector ŝ is directly judged.
The simulation results showed that the detection performance of the various algorithm increased
with the number of iterations or the number of items expanded by the Neumann series. For example,
when the number of iterations i = 2, the BER performance of the SDGS algorithm was much better
than the BER when the number of items expanded by the Neumann series was 2. By comparing the
performance in Figure 2a,b, it can be seen that with the increase in the ratio of the number of base
station antennas to the number of users ( N/K ), the BER performance of the various algorithms was
greatly improved. For example, if the BER was to reach 10−3 , the MMSE algorithm and the proposed
algorithm require an SNR of about 13 dB when the antenna configuration is 64 × 16, and only 8 dB
when the configuration is 128 × 16.

(a)

(b)

Figure 2. Hard decision bit error rate (BER) performance comparison. (a) Analysis at 64 × 16 antenna
conﬁguration; (b) Analysis at 128 × 16 antenna conﬁguration.

208
Electronics 2018, 7, 382

Figure 3 shows the soft decision simulation results for the two antenna configurations. With BER
based on the NS, CG, and GS iterative detection algorithms, the MMSE exact inversion detection
algorithm and the SDGS joint algorithm were compared. We set the system’s convolutional code rate
to 1/2, and the LLR calculation used the approximate calculation method described in this paper.
Simulation results showed that no matter what kind of MMSE receiver was used, the soft decision
was checked. The measured performance was much better than the hard decision. For example, when
the BER reached 10−4 , when the antenna was configured, the MMSE algorithm and the proposed
algorithm required an SNR of 10 dB for hard decisions and only 5 dB for soft decisions. In addition, for
the same number of iterations, the BER performance of the SDGS algorithm proposed in this paper was
better than the other three simplified algorithms, and after a few iterations, the detection performance
could quickly approach the detection performance of the ideal MMSE filter matrix inversion.

(a)

(b)
Figure 3. Soft decision bit error rate (BER) performance comparison. (a) Analysis at 64 × 16 antenna
conﬁguration; (b) Analysis at 128 × 16 antenna conﬁguration.

Figure 4 shows the hard decision BER comparison of the proposed SDGS algorithm with NS, CG,
GS and MMSE under a high fading scenario with 128 × 16 antenna conﬁguration. As can be seen from
Figure 4 the BER of the proposed SDGS algorithm was better and followed the MMSE performance
with increasing SNR and number of iterations. Moreover, due to high fading impact on the SNR, there
was a gap between the proposed SDGS algorithm and MMSE algorithm at a high SNR level.
Figure 5 shows the BER comparison of the proposed SDGS algorithm with NS, CG, GS and MMSE
under a low fading level and 128 × 16 antenna conﬁguration. It can be seen from Figure 5 that all
the algorithms showed lower BER and better performance as compared with the hard decision BER
performance in Figures 2 and 4. Therefore, to keep the system performance in a suitable level, the

209
Electronics 2018, 7, 382

fading level and number of iterations should be considered, which has an obvious impact on the
system’s overall performance. Furthermore, the proposed SDGS algorithm in Figure 5 had a close BER
performance with MMSE which indicated that the SDGS algorithm showed better performance in the
low fading level.

Figure 4. Hard decision bit error rate (BER) performance comparison with high fading level at 128 × 16
antenna conﬁguration.

Figure 5. Soft decision bit error rate (BER) performance comparison with slow fading level at 128 × 16
antenna conﬁguration.

6. Conclusions
Signal detection methods based on MMSE ﬁltering in massive MIMO systems are widely used,
but matrix inversion with higher complexity makes it more difﬁcult to implement them in practical
applications. Some methods of approximate inversion, such as Neumann series expansion, has reduced
the detection complexity, but due to a large degree of detection performance loss; others avoid the
complex matrix inversion and directly estimate the signal vector. Although, computational complexity
is reduced by orders of magnitude, detection performance needs to be improved. Based on the MMSE
criterion, this paper proposes a low-complexity, hybrid, iterative SDGS joint detection algorithm,
which directly estimates the user’s transmitted symbol vector and can quickly converge to obtain an
ideal estimation value with a few simple iterations. The matrix inversion operation is avoided, and

210
Electronics 2018, 7, 382

algorithm complexity is kept at O K2 . In addition, in order to make full use of soft information, the
algorithm is applied to the soft decision, and an approximate calculation method of the LLR for channel
decoding is given, which further improves the signal detection performance. Theoretical derivation
and simulation results show that the SDGS algorithm can be used as one of the most effective solutions
for signal detection in massive MIMO systems.

Author Contributions: I.K. conceived and designed the presented idea and developed the theory, performed
the simulations, and wrote the paper. M.H.Z. analyzed the research and performed experimentations, and M.A.
provided extensive technical support throughout the research. S.K. provided extensive support in the theoretical
analysis and provided funding support.
Funding: This research was supported by the Basic Science Research Program through the National Research
Foundation of Korea (NRF-2016R1D1A1B03934653).
Conﬂicts of Interest: The authors declare no conﬂicts of interest.

References
1. Chang, Y.K.; Ueng, F.B.; Jhang, Y.W. Turbo MIMO-OFDM Receiver in Time-Varying Channels. KSII Trans.
Internet Inf. Syst. 2018, 12, 3704–3724.
2. Tseng, S.M.; Chen, Y.F. Average PSNR Optimized Cross Layer User Grouping and Resource Allocation for
Uplink MU-MIMO OFDMA Video Communications. IEEE Access 2018, 6, 50559–50571. [CrossRef]
3. Ghosh, A.; Ratasuk, R.; Mondal, B.; Mangalvedhe, N.; Thomas, T. LTE-advanced: Next-generation wireless
broadband technology. IEEE Wirel. Commun. 2010, 17, 10–22. [CrossRef]
4. Khan, I.; Zafar, M.H.; Jan, M.T.; Lloret, J.; Basheri, M.; Singh, D. Spectral and Energy Efficient Low-Overhead
Uplink and Downlink Channel Estimation for 5G Massive MIMO Systems. Entropy 2018, 20, 92. [CrossRef]
5. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T.L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2012, 30, 40–60. [CrossRef]
6. Larsson, E.G.; Edfors, O.; Tufvesson, F.; Marzetta, T.L. Massive MIMO for next-generation wireless systems.
IEEE Commun. Mag. 2014, 52, 186–195. [CrossRef]
7. Marzetta, T.L. Noncooperative cellular wireless with unlimited numbers of base station antennas. IEEE Trans.
Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
8. Khan, I.; Singh, D. Efficient compressive sensing based sparse channel estimation for 5G massive MIMO
systems. AEU-Int. J. Electron. Commun. 2018, 89, 181–190. [CrossRef]
9. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2011, 61, 1436–1449.
10. Dai, L.; Wang, Z.; Yang, Z. Spectrally efficient time-frequency training OFDM for mobile large-scale MIMO
systems. IEEE J. Sel. Areas Commun. 2013, 31, 251–263. [CrossRef]
11. Benmimoune, M.; Driouch, E.; Ajib, W. Joint antenna selection in grouping in Massive MIMO Systems.
In Proceedings of the 10th International Symposium on Communication Systems, Networks and Digital
Signal Processing (CSNDSP), Prague, Czech Republic, 20–22 July 2016; pp. 1–6.
12. Khan, I. A Robust Signal Detection Schemefor 5G Massive Multiuser MIMO Systems. IEEE TVT. 2018, 67,
9567–9604.
13. Wu, M.; Yin, B.; Wang, G.; Dick, C.; Cavallaro, J.R.; Studer, C. Large-scale MIMO detection for 3GPP LTE:
Algorithms and FPGA implementations. IEEE J. Sel. Top. Signal Process. 2014, 85, 916–929. [CrossRef]
14. Tang, C.; Liu, C.; Yuan, L.; Xing, Z. High precision low complexity matrix inversion based on Newton
iteration for data detection in the massive MIMO. IEEE Commun. Lett. 2016, 20, 490–493. [CrossRef]
15. Gao, X.; Dai, L.; Yuen, C.; Zhang, Y. Low-complexity MMSE signal detection based on Richardson method
for large-scale MIMO systems. In Proceedings of the Vehicular Technology Conference, Vancouver, BC,
Canada, 14–17 September 2014; pp. 1–5.
16. Gao, X.; Dai, L.; Hu, Y.; Wang, Z.; Wang, Z. Matrix inversion-less signal detection using SOR method for
uplink large-scale MIMO systems. In Proceedings of the IEEE Global Communications Conference, Qingdao,
China, 8–12 December 2014; pp. 3291–3295.

211
Electronics 2018, 7, 382

17. Hu, Y.; Wang, Z.; Gaol, X.; Ning, J. Low-complexity signal detection using the CG method for uplink
large-scale MIMO systems. In Proceedings of the IEEE International Conference on Communications, Macau,
China, 19–21 November 2014; pp. 477–481.
18. Hageman, L.A. The Iterative Solution for Large Linear Systems; Academic Press: Manhattan, NY, USA, 1971;
pp. 9–22.
19. Dai, L.; Gao, X.; Su, X.; Han, S.; Chih-Lin, I.; Wang, Z. Low-complexity soft-output signal detection based on
a Gauss-Seidel method for uplink multiuser large-scale MIMO systems. IEEE Trans. Veh. Technol. 2014, 64,
4839–4845. [CrossRef]
20. Wu, M.; Yin, B.; Vosoughi, A.; Studer, C.; Cavallaro, J.R.; Dick, C. Approximate matrix inversion for
high-throughput data detection in the large-scale MIMO uplink. In Proceedings of the International
Symposium on Circuits & Systems, Beijing, China, 19–23 May 2013; pp. 12155–12158.

© 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access
article distributed under the terms and conditions of the Creative Commons Attribution
(CC BY) license (http://creativecommons.org/licenses/by/4.0/).

212
electronics
Article
A Novel Iterative Discrete Estimation Algorithm for
Low-Complexity Signal Detection in Uplink Massive
MIMO Systems
Hui Feng 1 , Xiaoqing Zhao 2 , Zhengquan Li 2, * and Song Xing 3
1 Department of Electronic Science and Technology, Tongji University, Shanghai 201804, China
2 Jiangsu Provincial Engineering Laboratory of Pattern Recognition and Computational Intelligence,
Jiangnan University, Wuxi 214122, China
3 Department of Information Systems, California State University, Los Angeles, CA 90032, USA
* Correspondence: [email protected]; Tel.: +86-0510-8591-0633

Received: 1 August 2019; Accepted: 29 August 2019; Published: 2 September 2019

Abstract: In this paper, a novel iterative discrete estimation (IDE) algorithm, which is called the
modiﬁed IDE (MIDE), is proposed to reduce the computational complexity in MIMO detection in
uplink massive MIMO systems. MIDE is a revision of the alternating direction method of multipliers
(ADMM)-based algorithm, in which a self-updating method is designed with the damping factor
estimated and updated at each iteration based on the Euclidean distance between the iterative
solutions of the IDE-based algorithm in order to accelerate the algorithm’s convergence. Compared to
the existing ADMM-based detection algorithm, the overall computational complexity of the proposed

MIDE algorithm is reduced from O Nt3 + O Nr Nt2 to O Nt2 + O ( Nr Nt ) in terms of the number
of complex-valued multiplications, where Ntand Nr are the number of users and the number of
receiving antennas at the base station (BS), respectively. Simulation results show that the proposed
MIDE algorithm performs better in terms of the bit error rate (BER) than some recently-proposed
approximation algorithms in MIMO detection of uplink massive MIMO systems.

Keywords: massive MIMO systems; MIDE algorithm; low computational complexity; BER

1. Introduction
With the development of the mobile Internet and the Internet of Things, much high data rate
communication is required in the new generation of cellular networks like 5G [1]. By equipping
hundreds of antennas at the base station (BS) serving tens of users, the massive multiple-input
multiple-output (MIMO) is deemed one key technology for meeting the 5G requirements due to its
improvements in data throughput, link reliability, higher spectral efﬁciency, and better communication
quality compared with the traditional MIMO usage [2–4].
However, when applying the massive MIMO, a major computational challenge is the data
detection in uplink MIMO systems due to the large increase in the system dimensions [5].
The maximum likelihood (ML) is the optimal detection approach on data detection, but its
computational complexity grows exponentially with the number of user equipment (UE) and
the modulation order [6,7]. Some suboptimal detection alternatives are proposed to reduce the
computational complexity while obtaining a good bit error rate (BER) performance. For example,
the linear minimum mean squared error (LMMSE) algorithm is one of the widely-used suboptimal
detection algorithms with near-optimal BER performance and reduced computational complexity [8].
However, the LMMSE algorithm still involves the computation of the Gram matrix, as well as matrix

inversion, where their corresponding computational complexity is O Nr Nt2 and O Nt3 , respectively,
with Nt denoting the number of single-antenna UE and Nr denoting the number of antennas at the BS.

Electronics 2018, 8, 980; doi:10.3390/electronics8090980 213 www.mdpi.com/journal/electronics

Electronics 2018, 8, 980

It is worth noting that some approaches for approximating the matrix inversion have been
proposed to reduce the computational complexity [9–19], among which, for example, the Neumann
series (NS) approximation is used to approximate the matrix inversion by a series of truncated
NS expansions [9,10]. However, only marginal reduction in the computational complexity can
be reached with the increased terms of the NS expansion. Hence, various classical iterative
algorithms have been provided to approximate the inverse matrix in LMMSE detection to achieve
low computational complexity, which include the Richardson (RI) algorithm [11], the Jacobi
algorithm [12–15], the successive over relaxation (SOR) algorithm [16], the symmetric successive
over relaxation (SSOR) algorithm [17], and the Gauss–Seidel (GS) algorithm [18,19]. The computational
complexity of the matrix inverse is reduced by these approximation-based detection algorithms from

O Nt3 to O Nt2 . It is noted that these algorithms involving the inverse-matrix approximation in
LMMSE detection (also known as the approximated LMMSE) achieve a near-LMMSE performance,
but with lower computational complexity.
It is well known that the approximated LMMSE detection algorithms achieve a substantial
performance loss when Nr > Nt in multiuser massive MIMO systems. Various algorithms have been
proposed to obtain a better BER performance than that of LMMSE detection in multiuser massive
MIMO systems, which include the non-convex and the convex optimization algorithms [20–22].
For example, the alternating minimization (AltMin) algorithm is one of the non-convex
optimization algorithms, which is applied to the data detection in a multiuser massive MIMO
system [20]. Specifically, the AltMin algorithm converts the ML detection problem into a sum of convex
functions by decomposing the received vector into multiple sub-vectors. Hence, the non-convex
problem is transformed into the convex problem in the AltMin algorithm. The AltMin algorithm
shows better BER performance than that of the LMMSE detector in overloaded network scenarios
with relatively low computational complexity. However, it shows near-LMMSE performance with
even higher computational complexity when the ratio of the number of BS antennas to the number of
single-antenna users is larger.
Similarly, some convex optimization algorithms are used to solve the non-convex optimization
problems, which include, for example, the alternating direction method of multipliers (ADMM)
detection algorithm [21,22]. In the multiuser massive MIMO system, ADMM demonstrates better BER
performance than the LMMSE detection algorithm with the relatively low computational complexity
of the iterative procedure. However, the computational complexity of the preprocessing in the ADMM
algorithm includes the calculation complexity of the Gram matrix and LDLdecomposition [23,24],
which results in very high computational complexity for massive MIMO systems.
To make a tradeoff between the performance and the computational complexity with different
antenna configurations, the iterative discrete estimation (IDE) algorithm is integrated into the ADMM
algorithm [25], which presents low computational complexity due to the avoidance of the calculation
of the Gram matrix and LDL decomposition. Motivated by the aforementioned algorithms, we propose
a modified IDE (MIDE) algorithm to present a better BER performance and lower computational
complexity than the ADMM algorithms. To summarize, the main contributions of this work are listed
as follows.
• A novel iterative data detection algorithm for uplink multiuser massive MIMO systems is
designed by exploiting the IDE-based algorithmic framework. The proposed MIDE algorithm refactors
the detection algorithm as a series of simpler subproblems with closed-form solutions.
• A heuristic damping factor is defined based on the Euclidean distance instead of a fixed factor.
Compared with the fixed damping factor, this self-updated damping factor contributes to a faster
convergence in the proposed MIDE algorithm.
• The computational complexity analysis indicates that the proposed algorithm has a lower
computational complexity than the traditional approximated detection approximation algorithms
(LMMSE, AltMin, and ADMM), under the same BER performance. Specifically, the complexity of the

novel MIDE detection algorithm is only O Nt2 + O ( Nr Nt ).

214
Electronics 2018, 8, 980

• Simulation results reveal that for the typical independent and identically distributed (i.i.d.)
frequency flat Rayleigh fading channel in massive MIMO systems, the proposed MIDE detection
algorithm performs better than the ADMM and AltMin-based detection algorithms and the LMMSE
detection algorithm in terms of BER performance with various system configurations.
The rest of the paper is organized as follows. In Section 2, we briefly introduce the system
model. Section 3 specifies the proposed low-complexity signal detection based on the IDE algorithm
and performs the computational complexity analysis of the algorithms. In Section 4, the numerical
simulation results of the BER performance are presented. Finally, Section 5 concludes the paper.
Notation: Throughout the paper, the lowercase and uppercase boldface type is used for vectors
(e.g., a) and matrices (e.g., A). The superscripts (·)−1 and (·) H denote the matrix inversion and the
conjugate transpose, respectively. The L2 norms of the vectors are represented by ·2 . and denote
the real part and the imaginary part of the complex-valued signal, respectively.
The typical uplink massive MIMO system is considered in this work, as shown in Figure 1,
where there are Nt single-transmitting antenna UE devices and Nr receiving antennas at the BS [26].
In general, Nr is larger than Nt for an uplink massive MIMO communication system [27].

0RGXODWRU
V [ \

,QIRUPDWLRQ 'HWHFWHG
ELWV ELWV
...
...

6LJQDO
63
...

'HWHFWRU
1W 1U

0RGXODWRU
V 1W [ 1W \ 1U

Figure 1. MIMO system signal detection structure diagram.

2. System Model
At the transmitter side, the source information s = [s1 , · · · , si , · · · s Nt ] T where each symbol s
is mapped to constellation symbols by taking symbols from a set of the constellation alphabet Ω.
The transmitted signal x = [x1 , · · · , xi , · · · x Nt ] T is constructed by the modulated symbol s, where xi
denotes a signal transmitted by the ith UE device. The vector y = [y1 , · · · , yi , · · · y Nr ] T represents the
receiving signal at the BS, where yi denotes a signal received by the ith receive antenna, and:

y = Hx + w (1)

where w is the Nr -by-one additive white Gaussian noise (AWGN) vector following CN 0, σ2 with σ2
representing the average power of the noise. In Equation (1), the matrix H denotes the Nr -by-Nt ﬂat
fading channel gain, and H can be expressed as:
⎡ ⎤
h11 h12 · · · h1Nt
⎢ ⎥
⎢ h21 h22 · · · h2Nt ⎥
H=⎢ ⎢ .. .. .. .. ⎥
⎥ (2)
⎣ . . . . ⎦
h Nr 1 h Nr 2 · · · h Nr Nt
where the element hi,j , i = {1, 2, · · · , Nr },j = {1, 2, · · · , Nt } denotes the channel impulse response
between the receiving antenna j and the user i. It is noted that hij follows an i.i.d. Gaussian distribution
with zero mean and unit variance. In addition, the channel matrix H is assumed to be known perfectly
at the BS [28,29].

215
Electronics 2018, 8, 980

3. Proposed MIDE Algorithm

We ﬁrst propose a low-complexity signal detection algorithm, which converts the ML problem
into the constrained non-convex problem and utilizes the IDE-based algorithm to solve the problem
iteratively. Then, a self-updating method is proposed to update the damping factor based on the
Euclidean distance, which accelerates the convergence in the proposed MIDE algorithm. Finally,
the computational complexity analysis of the proposed algorithm is provided with the comparison to
the conventional algorithms.

3.1. ML Problem Formulation and IDE-Based Algorithm

Detecting the transmitted symbol vector x at the BS can be done by minimizing the squared
Euclidean distance between the received signal vector y and the hypothesized received signal Hx with
the vector x constrained to the modulation constellation Ω Nt , which can be represented as:

x̂ = arg min y − Hx22 (3)

x ∈Ω Nt

It is noted that in Equation (3), the ﬁnite-alphabet constraint x ∈ Ω Nt can be converted into the
indicator function IΩ (x̂), which is given by:

0, ifx̂ ∈ Ω Nt
IΩ (x̂) = (4)
∞, otherwise

By combining Equation (3) and (4), the signal detection problem can be converted into the
constrained optimization problem, which is given by:

minimize y − Hz22 + IΩ (x̂)

z,x̂
(5)
s.t. z − x̂ = 0

where z is the least-squares solution of the least-squares and x̂ is the estimated transmitted symbol.
Since the optimization problem is deﬁned over complex-valued variables, the Lagrangian objective
function for the optimization problem in Equation (5) can be remodeled as:

Lγ (z, x̂, u) = y − Hz22 + IΩ (x̂) + γ z − x̂ + u H (z − x̂) (6)

where γ > 0 is the penalty parameter and u is the dual vector. In order to solve this problem efficiently,
we decompose it into three sub-problems. First, we solve z while holding x̂ and u fixed; then, we solve
x̂ while holding z and u fixed; finally, we solve u while holding x̂ and z fixed. Specifically, the following
procedure is repeated with iterations.

zk = arg min Lγ z, x̂k−1 , uk−1 (7)
z

x̂k = arg min Lγ zk , x̂k , uk−1 (8)
x̂

uk = uk−1 + γ zk − x̂k (9)

Note that the z-minimization procedure is convex, but the x̂-update is projected onto a convex set
Ω Nt . To make the iterative procedure converge, the IDE-based algorithm is applied to remove the dual
vector u at each iterative and turn the z-update and x̂-update to reach a consensus. After manipulation,
the x̂-update involves solving a linearly-constrained minimum Euclidean-norm problem, and the
z-update in IDE is given by:

216
Electronics 2018, 8, 980

−1
zk = x̂k−1 + diag H H H H H y − Hx̂k−1 (10)

Hence, the x̂-update step can be represented as:

x̂k+1 =ΠΩ zk+1 (11)

where ΠΩ (·) is the projection onto Ω Nt , i.e., the elements of zk+1 can be implemented through simple
rounding of each component to the closest element in Ω.
Additionally, to make the iteration converge, a ﬁxed damping factor α is employed in the
IDE-based algorithm to update the iterative solution x̂, i.e.,

x̂dk+1 = 1 − αk x̂kd + αk x̂k+1 (12)

where x̂dk+1 denotes the solution after updating. By applying x̂d to the z-update, the IDE-based
algorithm for detection can be expressed as:
−1
zk = x̂kd−1 + diag H H H H H y − Hx̂dk−1 (13)

x̂k = ΠΩ zk (14)

x̂kd = 1 − αk x̂dk−1 + αk x̂k (15)

3.2. Modiﬁed IDE-Based Detection Algorithm with Self-Update Damping

The performance of the IDE-based detection algorithm is influenced by the choice of the damping
factor. In early studies, this parameter was fixed, e.g., 0.05 in [25]. However, a fixed damping factor is
not applicable for all cases, and an optimal damping factor is not easy to obtain. We propose an MIDE
herein to decide a proper damping factor α, by which the x̂d -update step is analyzed as follows.

x̂1d = 1 − α0 x̂0d + α0 x̂1 ,

x̂2d = 1 − α1 x̂1d + α1 x̂2

= 1 − α1 1 − α0 x̂0d + α0 x̂1 + α1 x̂2

= 1 − α1 1 − α0 x̂0d + 1 − α1 α0 x̂1 + α1 x̂2 (16)
..
.

x̂kd+1 = 1 − αk x̂kd + αk x̂k+1

= 1 − αk−1 · · · 1 − α0 x̂0d + 1 − αk−1 · · · 1 − α1 α0 x̂1 + · · · + αk−1 x̂k+1

Since there is no prior information of the final result x̂d , the initial value of x̂0d can be set as a
zero vector. Hence, the expression for x̂d in Equation (16) is composed of the solution of x̂ with
different values of the damping factor. Based on the expression in Equation (16), the convergence of the
iterations can be measured by the difference between x̂kd and x̂k+1 . Specifically, the Euclidean distance,
which is one of the widely-used approaches for measuring the distance between two vectors [30],
is defined as:
8
9N
9 t
= : ∑ x̂kd (i, 1) − x̂k+1 (i, 1)
2
dk x̂kd , x̂k+1 (17)
i =1

217
Electronics 2018, 8, 980

We can notice from Equation (17) that the smaller the distance d, the closer the two vectors,
implying the convergence of the IDE iterations. Based on this discussion, the following heuristic
damping factor at the kth iteration is deﬁned as:

dk
αk = (18)
dk +q
where q is a positive constant. Obviously, a higher value of d leads to a higher α, and vice versa.
In other words, when dk → 0, αk → 0, and when dk → 1, αk → 1. We utilize the ﬁrst iteration result d1
to obtain the value of p. According to Equation (16), d1 can be calculated as:
8
9N
9 t
d = : ∑ x̂0d (i, 1) − x̂1 (i, 1)
1 2

i =1
8
9N
9 t
= : ∑ (x̂1 (i, 1))
2
(19)
i =1
8
9N
9 t
= : ∑ ( (x̂1 (i, 1))) + ( (x̂1 (i, 1)))
2 2

i =1

It is noticed that the value of d1 is varying and decided based on several factors such as the
modulation method, the noise in the system, etc. For the ease of calculation, the expectation of d1
is computed instead of the direct calculation of d1 . Since the real part and the imaginary part of the
vector x̂1 have the same uniform distribution, the expectation for d1 is obtained as:
⎛8
9N ⎞
9 t
E d1 = 2E ⎝: ∑ ( (x̂1 (i, 1))) ⎠
2
(20)
i =1

The expectation of d1 is based on the constellation points of the modulation scheme. With 16-QAM,

for example, x̂1 (i, 1) can be {−3, −1, +1, +3}, and the probability p of each possible value of the
point is the same without the prior information, i.e., p = 0.25 in this case. Hence, Equation (19) can be
rewritten as:
⎛8
9N ⎞ 7
9 t ∑ cand2
E d1 = 2E ⎝: ∑ ( (x̂1 (i, 1))) ⎠ = 2Nt
2
√ (21)
i =1 M

where cand represents the candidate value set of x̂1 (i, 1) and M represents the modulation
cardinality. For the ease of description, we still take the 16-QAM as an example. Then, cand is denoted
as the candidate values {−3, −1, +1, +3} and ∑ cand2 = (−3)2 + (−1)2 + 12 + 32 . M represents the
modulation cardinality. Based on experience, the value of α1 = 0.8 is set as 0.8. Then, substituting
Equation (22) into Equation (18) yields the estimation of q as:
7
E d1 Nt ∑ cand2
q= = √ (22)
4 2 M
The simulation results in Section 4 show that the proposed MIDE algorithm can improve the BER
performance signiﬁcantly compared to the IDE-based algorithm, which employs the ﬁxed damping
factor. The procedure of the proposed MIDE detection algorithm is illustrated in Algorithm 1.

218
Electronics 2018, 8, 980

Algorithm 1: The pseudocode of the MIDE detection algorithm.

1 Input: 1) H, the channel matrix
2 2) y, the received signal matrix
3 3) K, the number of iterations
4 Output: the detected signal x̃ ← x̂K
5 Preprocessing
−1
1: D ← diag H H H ;

2: W ← DH H ;
.
cand2
3: q ← N2t ∑√ ;
M
6 Initialization
4: x̂0d = 0;
Iterative process (repeat K times)
5: for k = 1–K (iteration
times) do
6: zk ← x̂kd−1 + W y − Hx̂dk−1

7: x̂k ←ΠΩ zk
8: rk ← x̂kd−1 − x̂k
7
Nt
2 2
9: d ←
k
∑ rk (i, 1) + rk (i, 1)
i =1
k
10: αk ← d
;
d +q
k

11: x̂d ← 1 − αk x̂dk−1 + αk x̂k ;

12: end for

3.3. Analysis of the Complexity of the Algorithm

In this subsection, we analyze the computational complexity of the proposed MIDE algorithm,
which is dominated by the multiplications operations. Hence, we compute the number of
complex-valued multiplications as the measurement of the computational complexity of the
algorithm [31].
It can be found in Algorithm 1 that the computational complexity is composed of three parts
including (1) preprocess, (2) the x-update procedure, and (3) the xd -update procedure.

(1) Preprocess: The ﬁrst part comes from the related computation before the iterative process.
The main factors affecting the computational complexity of the preprocess are the computation
of D and the multiplication of the Nt × Nt diagonal matrix and the Nt × Nr matrix. Let hi be
the ith column of the complex-valued channel matrix H. Then, the diagonal calculation can be
presented as:
1
hiH h j
, i=j
Dij = (23)
0, i=j

Therefore, the complexity of the preprocessing is counted as Nt2 + Nr Nt .

(2) x-update procedure: The second part comes from the x-update procedure, which involves the
computation of two multiplications of the Nt × Nt matrix and the Nt × 1 vector. Thus, the
complexity is counted as 2Nr Nt .
(3) xd -update procedure: The third part originates from updating the value of xd . As can be seen in
the expression of xkd , the computation of this part includes the update of the Euclidean distance dk

219
Electronics 2018, 8, 980

and two scalar multiplications with Nt × 1 vectors. Then, the complexity in this part is counted
as 3Nt .

Therefore, the overall required number of complex multiplications by the MIDE algorithm is
Nt Nr + Nt2 + K (2Nr Nt + 3Nt ). For comparison, the similar calculations of the complexity of the typical
detection algorithms (e.g., LMMSE with full matrix inversion, AltMin, ADMM) are given, as well as
the one of the proposed MIDE in Table 1.

Table 1. Complexity comparison in terms of complex multiplication operations. LMMSE, linear

minimum mean squared error; AltMin, alternating minimization; ADMM, alternating direction method
of multipliers; MIDE, modiﬁed iterative discrete estimation.

Algorithm Complexity
LMMSE [8] 5 3
6 Nt + + 43 Nt + Nr Nt2 + Nr Nt
3 2
4 Nt
AltMin [18] (4K + 2) Nr Nt
2 Nt + Nt + Nr Nt + Nr Nt + K Nt + Nr Nt
1 3 2 2 2
ADMM [20]
MIDE Nt Nr + Nt2 + K (2Nr Nt + 3Nt )

Note that all these algorithms utilize the approximation approaches to solve the ML problem.
It is obvious that the proposed MIDE algorithm and AltMin algorithm have lower computational
complexity at each iteration among the compared iterative approaches. The numerical analysis will
be provided for further analysis of the computational complexity, which depends on the number of
iterations K and is presented in Section 4.

4. Simulation Results

4.1. BER Performance Evaluation

The BER performance of the proposed algorithm was evaluated and compared with the ones of
other detection algorithms by numerical simulations. The simulation parameters are listed in Table 2.
Several typical detection algorithms were selected for comparison, which were introduced in Section 1
and listed as LMMSE, AltMin, and ADMM.
We first considered the number of iterations for the antenna configurations of Nt × Nr = 16 × 128,
32 × 128 and 64 × 128. Figure 2 illustrates the BER performance of the proposed MIDE detection
algorithm against the number of iterations, where the SNR was set as 3 dB. It is observed that with
the increase of Nr /Nt , the convergence number of the iterations required by the proposed MIDE was
almost the same and a very small one, e.g., 10 in all simulations. This demonstrated a reliable BER
performance and a fast detection convergence in our proposed MIDE algorithm. Moreover, with the
increase of the transmitting antennas, the diversity gain of the MIMO system decreased, leading to a
degradation of the BER performance of the proposed algorithm and the compared MIMO detectors.

Table 2. Simulation parameters.

Channel Model Uncorrelated Rayleigh Flat Fading

Modulation scheme 16-QAM
Number of transmitting antennas (Nt ) 16, 32, 64
Number of receiving antennas (Nr ) 128

220
Electronics 2018, 8, 980

%(5

0,'([
0,'([
0,'([

7LWHUDWLRQ

Figure 2. BER performance versus proposed iterations with SNR = 3 dB.

Figure 3 depicts the BER performance comparison of the IDE-based detection algorithm with the
ﬁxed damping factor α = 0.05 and the proposed MIDE detection with the self-updated damping factor.
It is clear that the proposed MIDE-based detection with the self-updated damping factor showed better
BER performance than the conventional IDE-based detection algorithm with all antenna conﬁgurations.

%(5

,'([
0,'([
,'([
0,'([
,'([
0,'([

615 G%

Figure 3. BER performance of the conventional damping factor and the proposed damping factor.

Moreover, Figure 4 compares the BER performances of the proposed MIDE algorithm and the
conventional ADMM algorithm. From the ﬁgure, it is clear that the BER performance of both algorithms
degraded when the number of users increased. However, it is observed that the proposed MIDE-based
detection performed better than the conventional ADMM detection in terms of BER performance with
all antenna conﬁgurations. Furthermore, we can observe from Figure 4 that when the target of BER
was set as 10−3 , the SNR required by the proposed algorithm was at least 0.5 dB less than the one of
the conventional ADMM algorithm.

221
Electronics 2018, 8, 980

%(5

$'00>@[
0,'([
$'00>@[
0,'([
$'00>@[
0,'([

615 G%

Figure 4. BER performance comparison of the conventional ADMM and the proposed algorithm.

Finally, we show the BER performance comparison of the conventional AltMin algorithm,
the LMMSE algorithm, and the proposed MIDE algorithm in Figure 5.

%(5

0,'([
$OW0LQ>@[
/006(>@[
0,'([
$OW0LQ>@[
/006(>@[
0,'([
$OW0LQ>@[
/006(>@[

615 G%

Figure 5. BER performance comparison of the AltMin algorithm, the LMMSE algorithm, and the
proposed algorithm.

It is clear that with any ratio of Nr /Nt , the proposed MIDE algorithm performed better than all
the compared algorithms in terms of BER performance.

4.2. Computational Complexity Comparison

The computational complexity of ADMM, AltMin, and the proposed MIDE algorithms depends
on the number of iterations K. The compared algorithms had a different number of iterations to reach
convergence with different antenna conﬁgurations. We ﬁxed the number of receiving antennas to
Nr = 128, and the number of transmitting antennas Nt was increased from 16 to 84. Further, we set
K = 5, 14, and 10 for the conventional ADMM algorithm, the AltMin algorithm [20], and the proposed

222
Electronics 2018, 8, 980

MIDE algorithm, respectively, based on the convergence simulations. Figure 6 illustrates the total
number of multiplications vs. the number of transmitting antennas, which was based on the analysis
of the computational complexity of the algorithms in Section 3.3.

1XPEHURIFRPSOH[YDOXHGPXOWLSOLFDWLRQRSUDWLRQV /006(>@
$OW0LQ>@
$'00>@
0,'(

1XPEHURIXVHUV

Figure 6. Computational complexity comparison against the number of users.

From Figure 6, we can see that the computational complexity increased with the number of
users in all compared algorithms. However, the proposed MIDE algorithm achieved the lowest
computational complexity among all compared algorithms under various antenna conﬁgurations.
Speciﬁcally, the MIDE algorithm showed a relatively lower computational complexity than the AltMin
algorithm, which was proven to have a low complexity detection for uplink massive systems in [20].
In addition, from Figure 6, the proposed MIDE algorithm achieved much lower computational
complexity than the LMMSE and ADMM detection algorithms when the dimension of the MIMO
system became larger. As a consequence, the proposed MIDE detection is much more applicable for
massive MIMO systems with its low computational complexity.

5. Conclusions
In this paper, we proposed a low-complexity, IDE-based detection algorithm in uplink massive
MIMO systems. The proposed MIDE algorithm avoided the calculation of the Gram matrix,
the matrix inversion, and LDL decomposition to reach a low computational complexity. In addition,
a self-updating damping method was provided with the damping factor estimated and updated at
each iteration based on the Euclidean distance between the latest two detection solutions, which
accelerated the convergence of the IDE-based detection algorithms. Simulation results showed that
the proposed MIDE algorithm performed better than the conventional LMMSE, AltMin, and ADMM
detection algorithms in terms of the BER performance and the computational complexity.

Author Contributions: H.F. and X.Z. contributed the idea generation; S.X. and X.Z. conducted the study design
and paper writing; Z.L. helped the analysis of the simulation results.
Funding: This work is supported in part by the National Natural Science Foundation of China (No. 61571108,
No. 61701197, and No. 61801193), the open research fund of National Mobile Communications Research Laboratory
of millimeter wave, Southeast University (No. 2018D15), the open research fund of the National Key Laboratory
of millimeter wave, Southeast University (No.K201918), the Open Foundation of Key Laboratory of Wireless
Communication, Nanjing University of Posts and Telecommunication (No. 2017WICOM01), and the Wuxi Science
and Technology Development Fund (No. H20191001).
Conﬂicts of Interest: The authors declare no conﬂict of interest.

223
Electronics 2018, 8, 980

References
1. Xiang, W.; Zheng, K.; Shen, X.S. 5G Mobile Communications; Springer International Publishing: Basel,
Switzerland, 2017.
2. Lu, L.; Li, G.Y.A.; Swindlehurst, L.; Ashikhmin, A.; Zhang, R. An overview of massive MIMO: Benefits and
challenges. IEEE J. Sel. Topics Signal Process. 2014, 8, 742–758. [CrossRef]
3. Wu, M.; Yin, B.; Wang, G.; Dick, C.; Cavallaro, J.R.; Studer, C. Large-scale MIMO detection for 3GPP LTE:
algorithms and FPGA implementations. IEEE J. Sel. Topics Signal Process. 2014, 8, 916–929. [CrossRef]
4. Ngo, H.; Larsson, E.; Marzetta, T. Energy and spectral efficiency of very large multiuser MIMO systems.
IEEE Trans. Commun. 2012, 61, 1436–1449.
5. Rusek, F.; Persson, D.; Lau, B.K.; Larsson, E.G.; Marzetta, T. L.; Edfors, O.; Tufvesson, F. Scaling up MIMO:
Opportunities and challenges with very large arrays. IEEE Signal Process. Mag. 2013, 30, 40–60. [CrossRef]
6. Chockalingam, A.; Rajan, B.S. Large MIMO Systems; Cambridge University Press: Cambridge, UK, 2014.
7. Kuchi, K.; Ayyar, A.B. Performance analysis of ML detection in MIMO systems with co-channel interference.
IEEE Commun. Lett. 2011, 15, 786–788. [CrossRef]
8. Mandloi, M.; Bhatia, V. Low-complexity near-optimal iterative sequential detection for uplink massive
MIMO systems. IEEE Commun. Lett. 2017, 21, 568–571. [CrossRef]
9. Gustafsson, O.; Bertilsson, E.; Klasson, J.; Ingemarsson, C. Approximate Neumann Series or exact matrix
inversion for massive MIMO? In Proceedings of the 2017 IEEE 24th Symposium on Computer Arithmetic
(ARITH), London, UK, 24–26 July 2017.
10. Fang, L.; Huang, D. Neumann Series Expansion Based LMMSE Channel Estimation for OFDM Systems.
IEEE Commun. Lett. 2016, 20, 748–751. [CrossRef]
11. Gao, X.; Dai, L.; Ma, Y.; Wang, Z. Low-complexity near-optimalsignal detection for uplink large-scale MIMO
systems. Electron. Lett. 2014, 50, 1326–1328. [CrossRef]
12. Qin, X.; Yan, Z.; He, G. A near-optimal detection scheme based on joint steepest descent and Jacobi method
for uplink massive MIMO systems. IEEE Commun. Lett. 2016, 20, 276–279. [CrossRef]
13. Lee, Y. Decision-aided Jacobi iteration for signal detection in massive MIMO systems. Electron. Lett. 2017, 53,
1552–1554. [CrossRef]
14. Kong, B.Y.; Park, I. Low-complexity symbol detection for massive MIMO uplink based on Jacobi method.
In Proceedings of the IEEE 27th Annual International Symposium on Personal, Indoor, and Mobile Radio
Communications (PIMRC), Valencia, Spain, 1–5 September 2016.
15. Minango, J.; de Almeida, C.; Daniel Altamirano, C. Low-complexity MMSE detector for massive MIMO systems
based on Damped Jacobi method. In Proceedings of the 2017 IEEE 28th Annual International Symposium on
Personal, Indoor, and Mobile Radio Communications (PIMRC), Montreal, QC, Canada, 1–5 October 2017.
16. Deng, Q.; Guo, L.; Dong, C.; Lin, J.; Meng, D.; Chen, X. High-throughput signal detection based on fast
matrix inversion updates for uplink massive multiuser multiple-input multi-output systems. IET Commun.
2017, 11, 2228–2235. [CrossRef]
17. Sun, Y.; Li, Z.; Zhang, C.; Zhang, R.; Yan, F.; Shen, L. Low complexity signal detector based on SSOR
iteration for large-scale MIMO systems. In Proceedings of the 9th International Conference on Wireless
Communications and Signal Processing (WCSP), Nanjing, China, 1–6 October 2017.
18. Dai, L.; Gao, X.; Su, X.; Han, S.; Chih-Lin, I.; Wang, Z. Low-complexity soft-output signal detection based on
Gauss–Seidel method for uplink multiuser large-scale MIMO systems. IEEE Trans. Veh. Technol. 2015, 64,
4839–4845. [CrossRef]
19. Zeng, J.; Lin, J.; Wang, Z. An improved Gauss–Seidel algorithm and its efficient architecture for massive
MIMO systems. IEEE Trans. Circuits Syst. II-Express Briefs. 2018, 65, 1194–1198. [CrossRef]
20. Elgabli, A.; Elghariani, A.; Aggarwal, V.; Bell, M.R. A low-complexity detection algorithm for uplink massive
MIMO systems based on Alternating Minimization. IEEE Wirel. Commun. Lett. 2019, 8, 917–920. [CrossRef]
21. Souto, N.; Dinis, R. MIMO detection and equalization for single-carrier systems using the Alternating
Direction Method of Multipliers. IEEE Signal Process. Lett. 2016, 23 , 1751–1755. [CrossRef]
22. Shahabuddin, S.; Juntti, M.; Studer, C. Admm-based infinity normdetection for large MU-MIMO: Algorithm
and vlsi architecture. In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS),
Baltimore, MD, USA,1–4 May 2017.

224
Electronics 2018, 8, 980

23. Zhang, K.; Zhang, Y. Recursive algorithm for calculating eigenvalues of real symmetric matrix based on
LDLT decomposition. Computer Engineering and Applications. 2008, 44, 78–80.
24. Chou, C.; Wu, J. Low-complexity MIMO precoder design with LDL H channel decomposition. IEEE Trans.
Veh. Technol. 2011, 60, 2368–2372. [CrossRef]
25. Wang, C.; Wen, C.; Jin, S.; Tsai, S. Finite-Alphabet precoding for massive MU-MIMO with low-resolution
DACs. IEEE Trans. Wirel. Commun. 2018, 17, 4706–4720. [CrossRef]
26. Chen, J. A low complexity data detection algorithm for uplink multiuser massive MIMO systems. IEEE J. Sel.
Areas Commun. 2017, 35, 1701–1714. [CrossRef]
27. Jin, F.; Liu, Q.; Liu, H.; Wu, P. A low complexity signal detection scheme based on improved Newton iteration
for massive MIMO systems. IEEE Commun. Lett. 2019, 23, 748–751. [CrossRef]
28. Wu, Z.; Ge, L.; You, X.; Zhang, C. Efficient near-MMSE detector for large-scale MIMO systems. In Proceedings
of the 2017 IEEE International Workshop on Signal Processing Systems (SiPS), Lorient, France, 1–6 October 2017.
29. Wu, C.; Huang, W.; Chung, W. Robust update algorithms for Zero-Forcing detection in uplink large-scale
MIMO systems. IEEE Commun. Lett. 2018, 22, 424–427. [CrossRef]
30. Monteiro, F.A.; Wassell, I.J. Euclidean distances in quantized spaces with pre-stored components for MIMO
detection. In Proceedings of the European Conference on Wireless Technologies, Munich, Germany, 8–10
October 2007.
31. Hunger, R. Floating Point Operations in Matrix-Vector Calculus; Technical Report; Munich University of
Technology: Munich, Germany, 2007.

225
electronics
Article
Energy Efﬁciency Optimization for Massive MIMO
Non-Orthogonal Unicast and Multicast Transmission
with Statistical CSI
Wenjin Wang *, Yufei Huang, Li You, Jiayuan Xiong, Jiamin Li and Xiqi Gao
National Mobile Communications Research Laboratory, Southeast University, Nanjing 210096, China
* Correspondence: [email protected]; Tel.: +86-025-8379-0506

Received: 16 July 2019; Accepted: 30 July 2019; Published: 1 August 2019

Abstract: We study the energy efficiency (EE) optimization problem in non-orthogonal unicast and
multicast transmission for massive multiple-input multiple-output (MIMO) systems with statistical
channel state information of all receivers available at the transmitter. Firstly, we formulate the
EE maximization problem. We reduce the number of variables to be solved and simplify this
large-dimensional-matrix-valued problem into a real-vector-valued problem. Next, we lower the
computational complexity significantly by replacing the objective with its deterministic equivalent
to avoid the high-complex expectation operation. With guaranteed convergence, we propose an
iterative algorithm on beam domain power allocation using the minorize maximize algorithm and
Dinkelbach’s transform and derive the locally optimal power allocation strategy to achieve the optimal
EE. Finally, we illustrate the significant EE performance gain of our EE maximization algorithm
compared with the conventional approach through conducting numerical simulations.

Keywords: energy efﬁciency; non-orthogonal unicast and multicast transmission; statistical channel
state information; massive MIMO; beam domain

1. Introduction
As mobile data expands rapidly, it is expected that global wireless data traffic will surpass
100 exabytes per month by 2023 [1]. A considerable proportion of the data traffic, such as massive
software updating and sports broadcasting, is of common interest, which stimulates the demand for
services that can deliver the same data to a group of user terminals (UTs) efficiently. Since physical
layer multicasting can provide efficient point-to-multipoint wireless transmission, it has great potential
for future mobile communication systems [2–4].
Recently, non-orthogonal unicast and multicast (NOUM) transmission has been gaining increasing
interest [5–7]. At the transmitter, the unicast and multicast signals are precoded and then sent out to the
receivers simultaneously, sharing the same time-frequency resources. Compared with the conventional
orthogonal unicast and multicast (OUM) transmission, NOUM transmission is more spectrum-efficient,
and more suitable for scenarios where both multicast and unicast signals are needed by a UT. Massive
multiple-input multiple-output (MIMO) has become one of the core technologies of the fifth generation
wireless system for its significant performance in energy efficiency (EE) and spectral efficiency [8,9].
Therefore, there has been considerable research on the combination of multicast transmission and
massive MIMO systems [6,10,11]. Please note that mutual coupling is a major concern in massive
MIMO because it can weaken the system performance [12–16]. We assume perfect isolation between
the antennas without loss of generality.
EE has become a significant design criterion for wireless communication systems [17–19].
The broad-scale antenna arrays equipped at the base station (BS) cause the power consumption
to increase in massive MIMO systems, and the energy consumed by wireless communications is

Electronics 2019, 8, 857; doi:10.3390/electronics8080857 227 www.mdpi.com/journal/electronics

Electronics 2019, 8, 857

responsible for greenhouse gas emissions [20], which motivates the need to design energy-efficient
systems [8,21,22]. EE of a massive MIMO system was considered in [8]. However, it ignores the power
consumed by the BS circuit, while in [21], research on maximizing the EE and power transfer efficiency
for wireless-powered systems was analyzed, taking the circuit power consumption into account.
In [22], how the system parameters (number of antennas, transmitted power and number of UTs) affect
the EE of a multi-user MIMO system was investigated.
There are also previous works that studied energy-efficient NOUM transmission in massive
MIMO systems [23–25]. In [23], energy-efficient NOUM beamforming in multi-cell multi-user MIMO
scenario was studied. An optimization beamforming algorithm was proposed in [24] to optimize the EE
in the multi-cell multicast system. The extension of the problem was investigated in [25], which takes
antenna selection into consideration.
Please note that most of the previous works made the assumption that the UTs’ instantaneous
channel state information (CSI) is available at the BS. However, in realistic systems, obtaining good
estimates of instantaneous CSI is a challenging job [26–28]. Compared with obtaining instantaneous
CSI, the acquisition of statistical CSI is easier and more precise. In [11], rate maximization problem
for NOUM massive MIMO transmission was considered, and the EE maximization problem for
physical-layer multicast transmission was investigated in [29], both assumed that the BS only has
access to the UTs’ statistical CSI.
To our knowledge, the research on EE optimization of NOUM transmission for massive MIMO
systems with statistical CSI at the transmitter has not been studied yet. We investigate this problem in
our work, and the major contributions we provide in this paper are listed as follows:

• With statistical CSI, we formulate the EE maximization problem for NOUM transmission in the
massive MIMO scenario.
• We determine the optimal transmit directions of the multicast and unicast transmission in
closed-form, respectively, and then simplify the large-scale complex-matrix-valued precoding
design problem into a real-vector-valued power allocation problem in the beam domain.
• We reduce the computational complexity of the EE optimization problem signiﬁcantly by replacing
the objective function with its deterministic equivalent (DE).
• With guaranteed convergence, we propose an algorithm on beam domain power allocation
using the minorize maximize (MM) algorithm and Dinkelbach’s transform. We deal with the EE
optimization problem by iteratively solving a series of related convex optimization problems.

The remainder of the paper is constructed as follows. The channel model is introduced in Section 2.
The EE maximization problem is formulated and investigated in Section 3. Numerical simulations are
conducted in Section 4. Section 5 summarizes the paper.
Column vectors and matrices are represented by lower and upper case boldface letters,
respectively, whereas italic letters stand for scalars, and the following are other notations used in
this paper.

• We adopt R M× N to represent M × N real-valued vector space and C M× N to denote M × N

complex-valued vector space.
• I M represents the identity matrix of size M × M.
• X 0 indicates that matrix X is positive semideﬁnite.
• E {.} represents the expectation operation.
• denotes the Hadamard product.
• Denote tr {.} as the trace operation, (.) T as the transpose operation, (.)∗ as the conjugate operation,
(.) H as the conjugate-transpose operation, and det {.} as the determinant operation.
• ∼ stands for “be distributed as”, and stands for “be deﬁned as”.

228
Electronics 2019, 8, 857

2. System Model
Consider a single cell massive MIMO system with an M-antenna BS, jointly serves K UTs.
Denote by K {1, 2, . . . , K } the UT set, where the kth UT is equipped with Nk antennas. The multicast
and unicast services are carried out with the same time-frequency resources. The BS sends a multicast
signal that is of common interest to all the UTs in the cell while delivering unique messages to UTs
according to each UT’s demand during the downlink transmission, as shown in Figure 1.

Figure 1. System model of NOUM.

Assume the downlink signal sent by the BS is denoted by

x = xm + ∑ xuk ∈ C M×1 , (1)

k ∈K

where xm ∈ C M×1 represents the multicast signal and xuk ∈ C M×1 denotes the unicast signal sent to the
kth UT. Assume that xm and xuk are mutually uncorrelated, zero-mean, and their covariance matrices
+ ,
are Qm and Quk , respectively. Deﬁne tr {Qm } as the multicast transmission power and tr Quk as the
unicast transmission power. At the kth UT, the received signal is denoted by

yk = Hk x + nk ∈ C Nk ×1 , (2)

where Hk is the downlink channel matrix of size Nk × M, and nk ∼ CN 0, σ2 I Nk represents the
additive circularly symmetric complex-valued Gaussian noise with the variance being σ2 .
We adopt Weichselberger’s channel model [30,31] in our work because the correlation properties
between the transmit and receive ends of Weichselberger’s channel model are jointly considered rather
than separately characterized in the Kronecker model. Then, we can write the downlink channel
matrix in (2) as
Hk = Uk Gk VkH ∈ C Nk × M , (3)

where Uk ∈ C Nk × Nk and Vk ∈ C M× M are deterministic unitary matrices. Gk ∈ C Nk × M represents the

downlink channel matrix in the beam domain [26,27,32], and the elements of Gk are independently
distributed random variables with zero-mean. Denote Ωk as the beam domain channel power matrix

Ωk = E {Gk G∗k } ∈ R Nk × M , (4)

where the average power of [Gk ]i,j is represented by [Ωk ]i,j . As Ωk has the property of remaining
approximately constant while the frequency changes widely, the statistical CSI can be obtained
accurately and efﬁciently [32].
The vast number of antenna arrays employed at the BS brings about new channel properties for
massive MIMO systems. For example, as the BS antenna number M tends to inﬁnity, the eigenvector
matrices of the transmit correlation matrices between the BS and all UTs tend to be the same and are

229
Electronics 2019, 8, 857

only affected by the BS array topology [32,33]. Denote the corresponding deterministic unitary matrix
as V, and then in the massive MIMO scenario, the downlink channel matrix becomes

M→∞
Hk = Uk Gk V H . (5)

Please note that many of the previous works on massive MIMO adopted the channel model
mentioned in (5) such as [26,29,34], and it can achieve quite accurate performance [34].

3. NOUM Transmission in Massive MIMO

3.1. Problem Formulation

Consider a NOUM massive MIMO system. We assume that there is only one multicast group
without loss of generality. Consider the case when the kth (∀k ) UT knows its own instantaneous CSI
with proper pilot design [33], while the BS only has access to the statistic CSI of all UTs.
Rewrite the received signal at the kth UT by inserting (1) into (2) as follows

yk = Hk xm + Hk xuk + ∑ Hk xuk + nk . (6)

k =k

Each UT will decode the common multicast signal and its desired unicast signal in order by
applying successive interference cancellation (SIC) method.
During the process of multicast decoding, the kth UT regards the term Hk xm in (6) as the desired
message while treating the others as interference. For the covariance matrix of the interference and
noise, we have
! "
k = σ I Nk + ∑ E Hk Qk Hk
Km 2 u H
∈ C Nk × Nk . (7)
2 34 5 k ∈K
noise 2 34 5
interference

Since UT k has the knowledge of its own instantaneous CSI and the covariance matrix Km
k ,
during the multicast transmission, we denote by Rm
k the ergodic rate of the kth UT
! ! ""
k = E log det Kk + Hk Q Hk
Rm − log det {Km
m m H
k }. (8)

Denote multicast ergodic rate as min Rm

k . By inserting the massive system model in (5) and
k
det {I + MN} = det {I + NM}, the Sylvester’s determinant identity, into (8), the multicast rate Rm
k
in (8) becomes ! ! "" ! "
m m
Rmk = E log det Kk + Gk V Q VGk − log det Kk ,
H m H
(9)
m
where Kk is deﬁned as
m
Kk UkH Km
k Uk
! "
(10)
= σ2 I Nk + ∑ E Gk V H Quk VGkH ∈ C Nk × Nk .
k ∈K
+ ,
Deﬁne a matrix-valued function Ak (X) by Ak (X) E Gk XGkH . Since all the elements of Gk are
zero-mean and independently distributed, the off-diagonal elements of Ak (X) are zero, so Ak (X) is
a diagonal matrix-valued function of size Nk × Nk , and its ith diagonal element is
T
[Ak ]i,i = tr diag [Ωk ]i,: X . (11)

+ ,
Then the terms E Gk V H Quk VGkH in (10) can be rewritten as Ak V H Quk V .

230
Electronics 2019, 8, 857

For the unicast signal decoding, with SCI, the multicast signal is removed, so the interference only
contains the unicast signal meant for other UTs. For the covariance matrix of the interference and noise
at the kth UT, we have ! "
Kuk = σ2 I Nk + ∑ E Hk Quk HkH ∈ C Nk × Nk . (12)
2 34 5 k =k
noise 2 34 5
interference

Then we denote by Ruk the ergodic rate of the kth UT during the unicast transmission
! ! ""
Ruk = E log det Kuk + Hk Quk HkH − log det {Kuk } . (13)

By inserting the massive system model in (5) and the Sylvester’s determinant identity into (13),
the unicast rate Ruk at the kth UT becomes
! ! "" ! "
u u
Ruk = E log det Kk + Gk V H Quk VGkH − log det Kk , (14)

u
where Kk is deﬁned as
Ak V H Quk V ∈ C Nk × Nk ,
u
Kk σ2 I Nk + ∑ (15)
k =k

and the deﬁnition of Ak (X) is given in (11).

Next, we consider the system power consumption. Apply the power consumption model the
same as the one used in [29,35] as follows

P = μ tr {Qm } + ∑ tr {Quk } + MPc + Ps , (16)

k ∈K

where the constant-coefficient μ ≥ 1 accounts for the reciprocal of the transmit amplifier drain efficiency.
+ ,
tr {Qm } means the multicast transmit power, and ∑k∈K tr Quk denotes the total unicast transmit
power. Pc stands for the constant circuit power consumption per antenna and is unaffected by the
actual transmit power. Ps represents the BS static power consumption and is irrelevant to the number
of antennas.
In the following, we formulate the EE optimization problem for NOUM massive MIMO system.
We aim at identifying the optimal transmit covariance matrices Qm and Quk for multicast and
unicast transmission that can maximize the system EE, respectively. We define a weight matrix
u = [u0 , u1 , . . . , uK ] with u0 being the weight of multicast rate and uk being the weight of kth unicast
rate. Then we can denote by R the weighted sum rate as follows:

R u0 K (min Rm
k
k )+ ∑ uk Ruk , (17)
k ∈K

and the EE of the considered system with bandwidth W is given by

W u0 K (min Rmk ) + ∑ uk Ruk
WR k k ∈K
EE = = . (18)
P + ,
μ tr {Qm } + ∑ tr Quk + MPc + Ps
k ∈K

Therefore, the EE maximization problem is stated as

231
Electronics 2019, 8, 857

W u0 K (min Rmk ) + ∑ uk Ruk
k k ∈K
max ,
Qm ,Quk ,∀k ∈K + u,
μ tr {Qm } + ∑ tr Qk + MPc + Ps
k ∈K (19)
s.t. tr {Qm } + ∑ tr {Quk } ≤ Pmax ,
k ∈K
Qm 0, Quk 0 (∀k ∈ K) ,

where Pmax is the power budget at the BS.

3.2. Optimal Transmit Directions

The problem in (19) aims at designing large-dimensional complex matrices Qm and Quk (∀k ),
and the computational complexity can be very high. To simplify this problem, ﬁrst, we decompose the
H
transmit covariance matrices as Qm = Φm Λm (Φm ) H and Quk = Φuk Λuk Φuk , respectively. Φm and
Φk are constituted by the eigenvectors of Q and Qk , respectively, which represent the directions of
u m u

the transmitted signals. Meanwhile, Λm and Λuk are diagonal matrices with their diagonal elements
constituted by the eigenvalues of Qm and Quk , respectively, which denote the allocated power over the
corresponding directions.
The following theorem determines the values of the eigenvectors of Qm and Quk .

Theorem 1. The optimal multicast and unicast transmit covariance matrices of problem (19) is
u,opt
Qm,opt = VΛm V H , Qk = VΛuk V H , ∀k, (20)

where Λm and Λuk (∀k) are both diagonal matrices, and the matrix V equals to the eigenvector matrices of the
correlation matrices between the BS and all UTs and only depends on the BS array topology. The eigenvectors of
Qm and Quk are given by the columns of the matrix V,

Proof. Please refer to the Appendix A.

Theorem 1 above indicates that when solving problem (19), since the eigenvectors
are deterministic, we only have to determine the power allocation matrix denoted by
+ m u u ,
Λ Λ , Λ1 , Λ2 , . . . , ΛuK , which reduces the number of variables to be optimized and the
computational complexity signiﬁcantly. Therefore, the large-dimensional complex-matrix-valued
EE maximization problem can be transformed into a real-vector-valued power allocation problem in
the beam domain.
m u
Rewrite Kk and Kk as follows
m
Kk (Λ) σ2 I Nk + ∑ Ak Λuk , (21)
k ∈K

u
Kk (Λ) σ2 I Nk + ∑ Ak Λuk , (22)
k =k

and without loss of optimality, we can simplify the problem in (19) into the problem below

W u0 K min Rmk ( Λ ) + ∑ u R
k k
u (Λ)
k k ∈K
max ,
Λ + u,
μ tr {Λm } + ∑ tr Λk + MPc + Ps
k ∈K (23)
s.t. tr {Λ } +
m
∑ tr {Λuk } ≤ Pmax ,
k ∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) ,

232
Electronics 2019, 8, 857

with ! ! "" ! "

m m
k ( Λ ) = E log det Kk ( Λ ) + Gk Λ Gk
Rm − log det Kk (Λ) ,
m H
(24)
2 34 5 2 34 5
s +
k (Λ) s −
k (Λ)
! ! ! " ""
u u
Ruk (Λ) = E log det Kk (Λ) + Gk Λuk GkH − log det Kk (Λ) . (25)
2 34 5 2 34 5
t +
k (Λ) t −
k (Λ)

Denote the lower bound of Rm

k (Λ) (∀k) as an auxiliary variable η, the problem in (23) can be
equivalently expressed as

W u0 Kη + ∑ uk Ruk (Λ)
k ∈K
max ,
Λ + ,
μ tr {Λm } + ∑ tr Λuk + MPc + Ps
k ∈K
(26)
s.t. k ( Λ ) ≥ η (∀ k ∈ K) ,
Rm
tr {Λm } + ∑ tr {Λuk } ≤ Pmax ,
k ∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) .

3.3. Energy-Efﬁcient Power Allocation for NOUM Transmission

By observing problem (26), we can conclude that the numerator of the objective function is
a difference of concave functions. We adopt the MM algorithm to deal with the problem. It is an iteration
optimization process, where during each iteration, we replace the objective function with its lower
bound function.
In this problem, we substitute s− −
k ( Λ ) in (24) and tk ( Λ ) in (25) with their ﬁrst-order Taylor
expansions, respectively, to transfer the numerator of the objective function into a concave function,
which leads to a concave-linear fractional program. We can solve problem! (26) by solving "
a series of
substitution problems iteratively. Then at the pth iteration, Λ( p) = Λm
( p)
, Λu1,( p) , . . . , ΛuK,( p) , and the
sub-problem is
Λ ( p +1) =
T #

k ( Λ( p) )
∂t−
W u0 Kη + ∑ uk t+
k ( Λ ) − t −
k Λ ( p ) − ∑ tr ∂Λ a
u Λ u − Λu
a a,( p)
k ∈K a =k
arg max ,
Λ
+ u,
μ tr {Λm } + ∑ tr Λk + MPc + Ps
k∈K
⎧⎛ ⎞T ⎫ (27)
⎪
⎨ ∂s− ⎪
⎬
+ − k Λ( p)
s.t. sk (Λ) − sk Λ( p) − ∑ tr ⎝ ⎠ Λa − Λ
u
a,( p) ⎪ − η ≥ 0 (∀ k ∈ K) ,
u
⎪
a∈K ⎩
∂Λ u
a ⎭

tr {Λm } + ∑ tr {Λuk } ≤ Pmax ,

k∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) ,

( p) ( p)
where the gradients of s− −
k Λ( p) and tk Λ( p) with respect to Λ a are deﬁned by Δsk and Δtk ,
u

respectively, with their diagonal elements being

⎡ ⎤

( p)
∂s−
k Λ( p)
Nk [Ωk ]n,i
Δsk =⎣ ⎦ = ∑ , (28)
i,i ∂Λua n =1
M
i,i σ2 + ∑ ∑ [Ωk ]n,m Λuk ,( p)
k ∈K m=1 m,m

233
Electronics 2019, 8, 857

⎤ ⎧
⎡ ⎪ Nk [Ωk ]n,i
∂t− ⎪
⎨ a = k,
k Λ( p) ∑ ,
( p) M
Δtk =⎣ ⎦ = n =1 σ 2 + ∑ ∑ [Ωk ]n,m Λuk ,( p) (29)
i,i ∂Λua ⎪
⎪ k =k m =1 m,m
⎩
i,i 0, a = k,
respectively.
Since t−
k Λ( p) and Λ a,( p) in (27) are constant in each iteration, we can ignore them and obtain
u

an equivalent optimization problem as

Λ ( p +1) =
T #
k ( Λ( p) )
∂t−
W u0 Kη + ∑ uk (Λ) − ∑ trt+
k ∂Λua Λua
k ∈K a =k
arg max ,
Λ
+ ,
μ tr {Λm } + ∑ tr Λuk + MPc + Ps
k∈K
⎧⎛ ⎞T ⎫ (30)
⎪
⎨ ∂s− Λ ⎪
⎬
k ( p )
s.t. s+ (Λ) − s− Λ( p) − ∑ tr ⎝ ⎠ Λua − Λu
a,( p) ⎪ − η ≥ 0 (∀ k ∈ K) ,
k k ⎪
a∈K ⎩
∂Λ u
a ⎭

tr {Λm } + ∑ tr {Λuk } ≤ Pmax ,

k ∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) .

Although the numerator of the objective function and constraint of the transformed
sub-problem (30) are concave, the computational complexity can still be quite high if the expectation
operation is manipulated using Monte-Carlo methods. Via applying the large-dimensional random
matrix theory in [36,37], we further reduce the optimization complexity by substituting the minuends
k ( Λ ) and Rk ( Λ ) with their DEs, respectively.
of Rm u

First, we deﬁne a diagonal matrix-valued function Yk (X) of size M × M, and its ith diagonal
element is ! ! " "
[Yk (X)]i,i = tr diag [Ωk ]:,i X . (31)

Then, we can write the DE of s+

k ( Λ ) as

Sk+ (Λ) = log det {I M + Γm m

k Λ }
! " −1 (32)
+ log det Γ6m + Km k ( Λ ) − tr I Nk − Φ6m ,
k k

where Γm 6m 6m
k , Γk and Φk are given by
−1
6 m Km ∈ C M× M ,
k = Yk
Γm Φ k k (Λ)

6 m = Ak −1
Γ k Λm (I M + Λm Γmk ) ∈ C Nk × Nk , (33)
−1
6 m = IN
Φ 6 m Km
+Γ k (Λ) ∈ C Nk × Nk ,
k k k

and the deﬁnition of Ak (X) is given in (11).

Likewise, we have the DE of t+k ( Λ ) as

Tk+ (Λ) = log det {I M + Γuk Λuk }

! " −1 (34)
+ log det Γ6u + Kuk (Λ) − tr I N − Φ
6u ,
k k k

234
Electronics 2019, 8, 857

6u and Φ
where Γuk , Γ 6 u are given by
k k
−1
Γuk = Yk 6 u Kuk (Λ)
Φ ∈ C M× M ,
k

6u = Ak Λu (I M + Λu Γu )−1 ∈ C Nk × Nk ,
Γ (35)
k k k k
−1
6 u = IN + Γ
Φ 6u Kuk (Λ) ∈ C Nk × Nk .
k k k

With the DEs of s+ +

k ( Λ ) and tk ( Λ ) deﬁned above, the optimization problem in (30) becomes

Λ ( p +1) =
T #
k ( Λ( p) )
∂t−
W u0 Kη + ∑ uk Tk+ (Λ) − ∑ tr ∂Λua Λ u
a
k ∈K a =k
arg max ,
Λ
+ ,
μ tr {Λm } + ∑ tr Λuk + MPc + Ps
k ∈K
⎧⎛ ⎞T ⎫ (36)
⎪
⎨ ∂s− ⎪
⎬
k Λ( p)
s.t. Sk+ (Λ) − s−
k Λ( p) − ∑ tr ⎝ ⎠ Λua − Λu
( ) − η ≥ 0 (∀k ∈ K) ,
⎪
a∈K ⎩
∂Λ a
u a, p ⎪
⎭

tr {Λm } + ∑ tr {Λuk } ≤ Pmax ,

k ∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) .

We can observe from the optimization problem in (36) that the denominator and numerator of
the objective function are linear and concave functions of Λ, respectively. We invoke Dinkelbach’s
transform [38] to deal with this concave-linear program. We can obtain the solution to (36) via solving
a series of problems below
! "
( q +1)
Λ( p) , η ( q +1) =
⎛ ⎛ ⎧⎛ ⎞T ⎫⎞⎞
⎪
⎨ ∂t− ⎪
⎬
⎜ ⎜ k Λ( p) ⎟⎟ (q)
arg max W ⎝u0 Kη + ∑ uk ⎝ Tk+ (Λ) −
∑ tr ⎪⎝ ∂Λua ⎠ Λa ⎪⎠⎠ − χ( p) P (Λ) ,
u
Λ k ∈K a =k ⎩ ⎭
⎧⎛ ⎞T ⎫
⎪
⎨ ∂s− Λ( p) ⎪
⎬ (37)
k
s.t. Sk+ (Λ) − s−
k Λ( p) − ∑ tr ⎪
⎝ ⎠ Λua − Λu
a,( p) ⎪ − η ≥ 0 (∀ k ∈ K) ,
a∈K ⎩ ∂Λ u
a ⎭

tr {Λm } + ∑ tr {Λuk } ≤ Pmax ,

k ∈K
Λm 0, Λ m
diagonal, Λuk 0, Λuk diagonal (∀k ∈ K) ,

+ , (q)
where P (Λ) = μ tr {Λm } + ∑ tr Λuk + MPc + Ps , q is the iteration index, and χ( p) is the auxiliary
k ∈K
(q)
variable. During each iteration, we update χ( p) using the following equation
T #

(q) k ( Λ( p) )
∂t− u,(q)
W u0 Kη (q) + ∑ uk Tk+ Λ( p) − ∑ tr ∂Λua Λ a,( p)
(q) k∈K a =k
χ( p) = . (38)
(q)
P Λ( p)

From the analysis above, we can observe that the proposed EE optimization algorithm involves
two-layer iterations. During the outer iteration, via invoking the MM algorithm, we replace the
numerator of the objective function in (26) with its lower bound function, thus making the numerator

235
Electronics 2019, 8, 857

concave. The MM-based algorithm is guaranteed to converge to the locally optimal solution [39–41];
in the inner iteration, we transform the fractional problem in (36) into solvable convex optimization
problems in (37) via Dinkelbach’s transform, which can derive the global optimum solution to (36)
with guaranteed convergence [42]. After several iterations, we can obtain the optimal beam domain
power allocation matrix Λ. Please note that Λ is locally optimal due to the local optimality of MM
algorithm. We present our algorithm in Algorithm 1.

Algorithm 1 Energy-Efﬁcient Power Allocation Algorithm in the Beam Domain for Massive MIMO
NOUM Transmission
Input: Beam domain channel statistics Ωk , initial power allocation matrix Λ(0) , outer iteration

threshold 1 and inner iteration threshold 2

Output: Power allocation matrix Λ in the beam domain
1: Initialization: EE(−1) = 0, p = 0
2: Calculate

! "
W u0 K min Sk+ Λ( p) − s−
k Λ ( p) + ∑ u k Tk
+
Λ ( p) − t −
k Λ ( p)
k
EE( p) = k∈K (39)
P Λ( p)

3: while EE( p) − EE( p−1) ≥ 1 do
(q) (q)
4: Initialization: q = 0, let Λ( p) = Λ( p) , calculate χ( p) with (38)

(q) ( q −1)
5: while χ( p) − χ( p) ≥ 2 do
6: Let q = q + 1
(q) ( q −1)
7: Calculate Λ( p) via solving problem (37) with χ( p)
(q)
8: Calculate χ( p) using (38)
9: end while
10: Let p = p + 1
(q)
11: Let Λ( p) = Λ( p−1)
12: Calculate EE( p) with (39)
13: end while
14: return Λ = Λ( p)

4. Numerical Results
We provide numerical simulation results to demonstrate the performance of the EE optimization
algorithm proposed above for NOUM transmission massive MIMO scenario with statistical CSI.
Table 1 illustrates how the numerical simulation parameters are set.
First of all, in Figure 2, we illustrate the convergence performance by showing the iteration
process of our EE optimization algorithm under different transmit power budgets Pmax . The horizontal
ordinate is the outer iteration index. As we can see, the EE converges after only a few iterations. Also,
we can observe that in the lower power budget regime, the EE performance convergences faster than
that in the higher power budget regime.

236
Electronics 2019, 8, 857

Table 1. Simulation parameters.

Parameter Value
Scenario Suburban macro
Channel model 3GPP SCM
Pathloss −120 dB (∀k)
Array topology ULA with antenna spacing half wavelength
Noise variance σ2 = −131 dBm
Number of UTs K=8
Number of BS antennas M = 128
Number of UT antennas Nk = 4 (∀k)
Transmission bandwidth W= 10 MHz
Ampliﬁer drain efﬁciency μ=5
Circuit power consumption per antenna Pc = 30 dBm
Static power consumption Ps = 40 dBm
Weights u0 = 0.7, uk = 0.3 (∀k )

G%P
G%P
G%P
(QHUJ\HIILFLHQF\ ELWV-RXOH

G%P

1XPEHURILWHUDWLRQV

Figure 2. The convergence performance of the proposed EE optimization algorithm for different power
budgets Pmax .

Then, we evaluate the EE of the NOUM transmission versus the power budget Pmax under
different numbers the antennas M at the BS in Figure 3. As we can see, the EE performance decreases
when the BS antenna number M increases for the reason that in the power consumption model we
adopted in (16), the total circuit power consumption grows linearly with M, the BS antenna number.
Next, the comparison of the EE performance of the power allocation algorithm proposed above
with the rate maximization approach [11] is shown in Figure 4. We notice that the EE performance of
the two approaches are similar at low transmit power budget regime. However, when the transmit
power budget gets high, the EE performance of the rate maximization approach decreases, while that
of our EE maximization approach remains high. This indicates that the rate maximization approach can
achieve almost EE optimal when Pmax is low. However, our EE maximization approach outperforms
the rate maximization one at high transmit power budget regime.

237
Electronics 2019, 8, 857

(QHUJ\HIILFLHQF\ ELWV-RXOH 0
0
0

7UDQVPLWSRZHUEXGJHW3PD[ G%P

Figure 3. The EE performance of the NOUM transmission versus the power budget Pmax for different
numbers of BS antennas M.

((PD[
5DWHPD[
(QHUJ\HIILFLHQF\ ELWV-RXOH

7UDQVPLWSRZHUEXGJHW3PD[ G%P

Figure 4. The EE performance of the proposed beam domain power allocation algorithm compared
with the rate maximization approach.

Finally, in Figure 5, the EE performance of our power allocation approach and that of full CSI
approach, which assumes instantaneous CSI is known at the BS, is compared. Since full CSI is an ideal
case, it can achieve better EE performance than other imperfect CSI situation. However, the full CSI
case suffers from pilot overhead. As Figure 5 illustrates, our proposed algorithm surpasses the full CSI
approach with 3/7 pilot overhead [43] in the EE performance.

238
Electronics 2019, 8, 857

6WDWLVWLFDO&6,

(QHUJ\HIILFLHQF\ ELWV-RXOH )XOO&6,
)XOO&6,SLORWRYHUKHDG

7UDQVPLWSRZHUEXGJHW3PD[ G%P

Figure 5. The comparison on the EE performance of proposed algorithm, full CSI case and full CSI
with 3/7 overhead.

5. Conclusions
To conclude, we considered the EE optimization problem in NOUM transmission systems
with statistical CSI available at the BS. We ﬁrst formulated the EE maximization problem, and then
determined the closed-form optimal eigenvectors of the multicast and unicast transmit covariance
matrices for optimal EE, respectively. Next, with guaranteed convergence, we proposed a beam domain
power allocation algorithm adopting the MM algorithm, DE and Dinkelbach’s transform and derived
the locally optimal power allocation strategy to achieve the EE optimization. Finally, with numerical
results, we presented the performance gain of our EE maximization algorithm compared with the
conventional approach.

Author Contributions: W.W. and Y.H. perceived the idea and wrote the manuscript. L.Y., J.X., J.L. and X.G. gave
valuable suggestions on the structuring of the paper and assisted in the revising and proofreading.
Funding: This work was supported by the National Key R&D Program of China under Grant 2018YFB1801103,
the National Natural Science Foundation of China under Grants 61631018, 61801114, 61761136016, 61771264,
61871465, and 61501113, the Natural Science Foundation of Jiangsu Province under Grant BK20170688,
the Fundamental Research Funds for the Central Universities, and the Huawei Cooperation Project.
Acknowledgments: The authors would like to thank the Editor and the anonymous reviewers for their valuable
comments and suggestions.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

Appendix A. Proof of Theorem 1

Firstly, we can tell from (11) that the value of Ak (X) is only affected by the diagonal elements
m u
of X, so Kk in (10) and Kk in (15) are irrelevant to the off-diagonal elements of V H Quk V. Then,
the power consumption in (16) has no relationship with the off-diagonal elements of VΛm V H or
VΛuk V H . Moreover, applying the proof method similar to [44], we can conclude that VΛm V H and
VΛuk V H should be diagonal to maximize Rm u
k and Rk , respectively. Therefore, to maximize the objective
m u
function in (19), Q and Qk should be both diagonal matrices. This concludes the proof.

239
Electronics 2019, 8, 857

References
1. Lecompte, D.; Gabin, F. Evolved multimedia broadcast/multicast service (eMBMS) in LTE-advanced:
Overview and Rel-11 enhancements. IEEE Commun. Mag. 2012, 50, 68–74. [CrossRef]
2. Wang, W.; Liu, A.; Zhang, Q.; You, L.; Gao, X.; Zheng, G. Robust multigroup multicast transmission for
frame-based multi-beam satellite systems. IEEE Access 2018, 6, 46074–46083. [CrossRef]
3. You, L.; Liu, A.; Wang, W.; Gao, X. Outage constrained robust multigroup multicast beamforming for
multi-beam satellite communication systems. IEEE Wirel. Commun. Lett. 2019, 8, 352–355. [CrossRef]
4. You, L.; Wang, J.; Wang, W.; Gao, X. Secure multicast transmission for massive MIMO with statistical channel
state information. IEEE Signal Process. Lett. 2019, 26, 803–807. [CrossRef]
5. Chen, E.; Tao, M.; Liu, Y. Joint base station clustering and beamforming for non-orthogonal multicast and
unicast transmission with backhaul constraints. IEEE Trans. Wirel. Commun. 2018, 17, 6265–6279. [CrossRef]
6. Sadeghi, M.; Björnson, E.; Larsson, E.G.; Yuen, C.; Marzetta, T. Joint unicast and multi-group multicast
transmission in massive MIMO systems. IEEE Trans. Wirel. Commun. 2018, 17, 6375–6388. [CrossRef]
7. Wang, J.; Xu, H.; Zhu, B.; Fan, L.; Zhou, A. Hybrid beamforming design for mmWave joint unicast and
multicast transmission. IEEE Commun. Lett. 2018, 22, 2012–2015. [CrossRef]
8. Ngo, H.Q.; Larsson, E.G.; Marzetta, T.L. Energy and spectral efficiency of very large multiuser MIMO
systems. IEEE Trans. Commun. 2013, 61, 1436–1449.
9. Hoydis, J.; ten Brink, S.; Debbah, M. Massive MIMO in the UL/DL of cellular networks: How many antennas
do we need? IEEE J. Sel. Areas Commun. 2013, 31, 160–171. [CrossRef]
10. Sadeghi, M.; Björnson, E.; Larsson, E.G.; Yuen, C.; Marzetta, T.L. Max–min fair transmit precoding for
multi-group multicasting in massive MIMO. IEEE Trans. Wirel. Commun. 2018, 17, 1358–1373. [CrossRef]
11. You, L.; Xiong, J.; Li, K.; Wang, W.; Gao, X. Non-orthogonal unicast and multicast transmission for massive
MIMO with statistical channel state information. IEEE Access 2018, 6, 66841–66849. [CrossRef]
12. Alibakhshikenari, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Falcone, F.; Limiti, E. High-isolation
leaky-wave array antenna based on CRLH-metamaterial implemented on SIW with 30o frequency
beam-scanning capability at millimetre-waves. Electronics 2019, 8, 642. [CrossRef]
13. Song, Y.J.; Sarabandi, K. Equivalent circuit model for metamaterial-based electromagnetic band-gap isolator.
IEEE Antennas Wirel. Propag. Lett. 2012, 11, 1366–1369. [CrossRef]
14. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual-coupling
isolation using embedded metamaterial EM bandgap decoupling slab for densely packed array antennas.
IEEE Access 2019, 7, 51827–51840. [CrossRef]
15. Farsi, S.; Aliakbarian, H.; Schreurs, D.; Nauwelaers, B.; Vandenbosch, G.A.E. Mutual coupling reduction
between planar antennas by using a simple microstrip U-section. IEEE Antennas Wirel. Propag. Lett. 2012,
11, 1501–1503. [CrossRef]
16. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual coupling
suppression between two closely placed microstrip patches using EM-bandgap metamaterial fractal loading.
IEEE Access 2019, 7, 23606–23614. [CrossRef]
17. Singh, K.; Gupta, A.; Ratnarajah, T. QoS-driven energy-efficient resource allocation in multiuser
amplify-and-forward relay networks. IEEE Trans. Signal Inf. Process. Over Netw. 2017, 3, 771–786. [CrossRef]
18. Singh, K.; Gupta, A.; Ratnarajah, T. QoS-driven resource allocation and EE-balancing for multiuser two-way
amplify-and-forward relay networks. IEEE Trans. Wirel. Commun. 2017, 16, 3189–3204. [CrossRef]
19. Singh, K.; Gupta, A.; Ratnarajah, T. A utility-based joint subcarrier and power allocation for green
communications in multi-user two-way regenerative relay networks. IEEE Trans. Commun. 2017, 65, 3705–3722.
[CrossRef]
20. Li, G.Y.; Xu, Z.; Xiong, C.; Yang, C.; Zhang, S.; Chen, Y.; Xu, S. Energy-efficient wireless communications:
tutorial, survey, and open issues. IEEE Wirel. Commun. 2011, 18, 28–35. [CrossRef]
21. Khan, T.A.; Yazdan, A.; Heath, R.W. Optimization of power transfer efficiency and energy efficiency for
wireless-powered systems with massive MIMO. IEEE Trans. Wirel. Commun. 2018, 17, 7159–7172. [CrossRef]
22. Björnson, E.; Sanguinetti, L.; Hoydis, J.; Debbah, M. Optimal design of energy-efficient multi-user MIMO
systems: Is massive MIMO the answer? IEEE Trans. Wirel. Commun. 2015, 14, 3059–3075. [CrossRef]

240
Electronics 2019, 8, 857

23. Tervo, O.; Tran, L.; Chatzinotas, S.; Juntti, M.; Ottersten, B. Energy-efficient joint unicast and multicast
beamforming with multi-antenna user terminals. In Proceedings of the 2017 IEEE 18th International
Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Sapporo, Japan,
3–6 July 2017; pp. 1–5.
24. He, S.; Huang, Y.; Jin, S.; Yang, L. Energy efficient coordinated beamforming design in multi-cell multicast
networks. IEEE Commun. Lett. 2015, 19, 985–988. [CrossRef]
25. Tervo, O.; Tran, L.; Pennanen, H.; Chatzinotas, S.; Ottersten, B.; Juntti, M. Energy-efficient multicell
multigroup multicasting with joint beamforming and antenna selection. IEEE Trans. Signal Process.
2018, 66, 4904–4919. [CrossRef]
26. You, L.; Gao, X.; Xia, X.; Ma, N.; Peng, Y. Pilot reuse for massive MIMO transmission over spatially correlated
rayleigh fading channels. IEEE Trans. Wirel. Commun. 2015, 14, 3352–3366. [CrossRef]
27. You, L.; Gao, X.; Swindlehurst, A.L.; Zhong, W. Channel acquisition for massive MIMO-OFDM with
adjustable phase shift pilots. IEEE Trans. Signal Process. 2016, 64, 1461–1476. [CrossRef]
28. You, L.; Chen, X.; Wang, W.; Gao, X. Coordinated multicast precoding for multi-cell massive MIMO
transmission exploiting statistical channel state information. Electronics 2018, 7, 338. [CrossRef]
29. You, L.; Wang, W.; Gao, X. Energy-efficient multicast precoding for massive MIMO transmission with
statistical CSI. Energies 2018, 11, 3175. [CrossRef]
30. Weichselberger, W.; Herdin, M.; Ozcelik, H.; Bonek, E. A stochastic MIMO channel model with joint
correlation of both link ends. IEEE Trans. Wirel. Commun. 2006, 5, 90–100. [CrossRef]
31. Gao, X.; Jiang, B.; Li, X.; Gershman, A.B.; McKay, M.R. Statistical eigenmode transmission over jointly
correlated MIMO channels. IEEE Trans. Inf. Theory 2009, 55, 3735–3750. [CrossRef]
32. You, L.; Gao, X.; Li, G.Y.; Xia, X.; Ma, N. BDMA for millimeter-wave/terahertz massive MIMO transmission
with per-beam synchronization. IEEE J. Sel. Areas Commun. 2017, 35, 1550–1563. [CrossRef]
33. Sun, C.; Gao, X.; Jin, S.; Matthaiou, M.; Ding, Z.; Xiao, C. Beam division multiple access transmission for
massive MIMO communications. IEEE Trans. Commun. 2015, 63, 2170–2184. [CrossRef]
34. Wen, C.; Jin, S.; Wong, K.; Chen, J.; Ting, P. Channel estimation for massive MIMO using Gaussian-mixture
Bayesian learning. IEEE Trans. Wirel. Commun. 2015, 14, 1356–1368. [CrossRef]
35. Ng, D.W.K.; Lo, E.S.; Schober, R. Energy-efficient resource allocation in OFDMA systems with large numbers
of base station antennas. IEEE Trans. Wirel. Commun. 2012, 11, 3292–3304. [CrossRef]
36. Lu, A.; Gao, X.; Xiao, C. Free deterministic equivalents for the analysis of MIMO multiple access channel.
IEEE Trans. Inf. Theory 2016, 62, 4604–4629. [CrossRef]
37. Couillet, R.; Debbah, M. Random Matrix Methods for Wireless Communications; Cambridge University Press:
Cambridge, UK, 2011. [CrossRef]
38. Dinkelbach, W. On nonlinear fractional programming. Manag. Sci. 1967, 13, 492–498. [CrossRef]
39. Lipp, T.; Boyd, S. Variations and extension of the convex-concave procedure. Optim. Eng. 2016, 17, 263–287.
[CrossRef]
40. Yuille, A.L.; Rangarajan, A. The concave-convex procedure. Neural Comput. 2003, 15, 915–936. [CrossRef]
41. Hunter, D.R.; Lange, K. A tutorial on MM algorithms. Am. Stat. 2004, 58, 30–37. [CrossRef]
42. Zappone, A.; Jorswieck, E. Energy efficiency in wireless networks via fractional programming theory.
Found. Trends R
Commun. Inf. Theory 2015, 11, 185–396. [CrossRef]
43. Marzetta, T.L. Noncooperative Cellular Wireless with Unlimited Numbers of Base Station Antennas.
IEEE Trans. Wirel. Commun. 2010, 9, 3590–3600. [CrossRef]
44. Tulino, A.M.; Lozano, A.; Verdú, S. Capacity-achieving input covariance for single-user multi-antenna
channels. IEEE Trans. Wirel. Commun. 2006, 5, 662–671. [CrossRef]

241
electronics
Article
Artiﬁcial Intelligence-Based Discontinuous Reception
for Energy Saving in 5G Networks
Mudasar Latif Memon 1 , Mukesh Kumar Maheshwari 2 , Navrati Saxena 3, *, Abhishek Roy 4 and
Dong Ryeol Shin 3, *
1 College of Information and Communication Engineering, Sungkyunkwan University, Suwon 16419, Korea
2 Department of Electrical Engineering, Bahria University, Karachi 75260, Pakistan
3 College of Software, Sungkyunkwan University, Suwon 16419, Korea
4 Advanced Communication Technology, Wireless System Design, MediaTek USA Inc.,
San Jose, CA 95134, USA
* Correspondence: [email protected] (N.S.); [email protected] (D.R.S.); Tel.: +82-31-299-4676 (N.S.)

Received: 17 June 2019; Accepted: 9 July 2019; Published: 11 July 2019

Abstract: 5G is expected to deal with high data rates for different types of wireless traffic. To enable
high data rates, 5G employs beam searching operation to align the best beam pairs. Beam searching
operation along with high order modulation techniques in 5G, exhausts the battery power of user
equipment (UE). LTE network uses discontinuous reception (DRX) with fixed sleep cycles to save UE
energy. LTE-DRX in current form cannot work in 5G network, as it does not consider multiple beam
communication and the length of sleep cycle is fixed. On the other hand, artificial intelligence (AI) has
a tendency to learn and predict the packet arrival-time values from real wireless traffic traces. In this
paper, we present AI based DRX (AI-DRX) mechanism for energy efficiency in 5G enabled devices.
We propose AI-DRX algorithm for multiple beam communications, to enable dynamic short and long
sleep cycles in DRX. AI-DRX saves the energy of UE while considering delay requirements of different
services. We train a recurrent neural network (RNN) on two real wireless traces with minimum root
mean square error (RMSE) of 5 ms for trace 1 and 6 ms for trace 2. Then, we utilize the trained RNN
model in AI-DRX algorithm to make dynamic short or long sleep cycles. As compared to LTE-DRX,
AI-DRX achieves 69% higher energy efficiency on trace 1 and 55% more energy efficiency on trace 2,
respectively. The AI-DRX attains 70% improvement in energy efficiency for trace 2 compared with
Poisson packet arrival model for λ = 1/20.

Keywords: discontinuous deception; multiple beam communications; artiﬁcial intelligence; energy

efﬁciency; 5G; wireless communications

1. Introduction
The use of cellular gadgets, like smartphones, notebooks, and tablets has comforted our life.
The Ericsson mobility report predicts the rise of cellular traffic to 8.8 billion by 2024 [1]. These extensive
growing cellular users require improved data rates with heterogeneous services in next generation
networks. 5G expects to deal with various types of traffics including periodic and delay tolerant traffic
for IoT devices or burst type of traffic for delay intolerant services [2,3]. 3rd Generation Partnership
Project (3GPP) planned the standardization of GHz spectrum (mm-wave) to address the users’ demand
of high bandwidth. However, communication over high-frequency bands of the mm-wave requires
directional air interface and narrower beams to reduce the path loss. In directional air interface, UE has
to search for best beam pairs and make adequate beam alignment with the next generation nodeB (gNB)
[4]. In addition to beam searching process in 5G networks, massive MIMO, higher order modulation
schemes and advanced coding techniques also increase UE energy expenses.

Electronics 2019, 8, 778; doi:10.3390/electronics8070778 243 www.mdpi.com/journal/electronics

Electronics 2019, 8, 778

Long term evolution (LTE) networks utilize DRX mechanism to reduce the energy consumption
of the UE [2,5]. DRX enables a UE to save energy by switching off the radio circuitry part, in case of no
incoming data. LTE-DRX turns off radio part during long and short sleep cycles in order to reduce the
energy consumption of UE. The sleep cycles in LTE-DRX are of the fixed time period. If any new packet
arrives during the fixed time sleep cycle, the packets will be buffered at evolved NodeB (eNB). The eNB
serves the stored packets after the completion of each sleep cycle in LTE-DRX. Hence, the LTE-DRX
saves the UE energy at the cost of delay [6,7]. LTE-DRX in its current form is not suitable for energy
savings in 5G enabled cellular devices [8,9], due to two main reasons.
• The communications in LTE do not consider beamforming, whereas in 5G networks UE has to
align the best beam pair before the start of communications.
• The LTE-DRX mechanism has a fixed length of sleep duration, which increases the energy
consumption and delay. Hence, LTE-DRX is not suitable for low latency communications in
5G networks.
Authors in [3] propose DRX for 5G network, which requires UE to search for best beam pairs
after completion of each sleep cycle in order to serve the packets. The additional beam searching
operation after each sleep cycle enhances the energy consumption of UE. The work in [10] suggests
beam aware DRX approach for 5G enabled machine to machine communications, in which UE has
prior information about best beam pairs. Similarly, beam aware DRX mechanism for energy saving
in the 5G network is proposed in [11]. The beam aware mechanism may not perform well in case
of UE mobility and beam misalignment [12]. Kwon et al. [12] show that the probability of beam
misalignment increases with an increase in UE velocity. The probability of misalignment is 0.1 and
0.38 if a UE moves with the velocity of 30 Km/h and 60 Km/h, respectively.
The DRX with built-in state of beamforming is proposed by Liu et al. [13]. Authors present the
concept of DRX for multiple beam communications and utilize the semi-Markov model to design
eleven-states of DRX. Their approach considers the beam training process only in case of beam
misalignment and after completion of long sleep duration. Hence, authors save the energy of UE while
minimizing the delay for 5G services. However, the fixed duration of sleep cycles in their approach
may cause more energy consumption.
On the other hand, Recurrent Neural Network (RNN) in AI has shown incredible results to predict
the upcoming value of a given sequence [14]. Long Short-Term Memory (LSTM) is a popular type
of RNN that is specially designed to learn long-term dependencies of a sequence for predicting the
upcoming value of a sequence [15]. The term long-term dependency refers to the sequence, whose
desired/current output values (prediction results) depend on long-sequence of previous input values
rather than the only single previous input value.
Motivated by the success of RNN to learn and predict long-term dependent sequence values in
various applications [11,14,16], we use RNN to extract the pattern of packet arrival time from real
wireless traffic traces and to predict the values of the next packet arrival time. Based on the prediction
results, we propose an AI-DRX algorithm that works on a ten-state DRX model to enable energy saving.
AI-DRX for multiple beam communications in 5G network saves the UE energy by enabling dynamic
short and long sleep cycles, respectively. To be more specific, the following are our key contributions
to save UE energy in multiple beam communications scenario of 5G networks.

• We perform the training of the LSTM neural network on wireless traffic. During training, LSTM
network extracts the packet arrival time pattern from the wireless traffic trace. The prediction
results show that the trained model predicts with minimum RMSE of 5 ms on random test set
from trace 1 and 6 ms on random test set from trace 2, respectively.
• We devise DRX as a ten-state model.
• We propose an artificial intelligence based DRX mechanism for multiple beam communications in
5G networks. We suggest AI-DRX algorithm using ten state model to enable dynamic short or
long sleep cycles, depending on the prediction results.

244
Electronics 2019, 8, 778

• We evaluate the performance of AI-DRX in terms of energy efﬁciency and mean delay. AI-DRX
achieves the energy efﬁciency of 59% on trace 1 and 95% on trace 2, respectively; while considering
the mean delay requirements of different services.

The remaining paper is organized as follows. In Section 2 we present an overview of existing DRX
mechanism and introduce the RNN with focus on LSTM neural network. Section 3 proposes AI-DRX
algorithm with a ten-state DRX model, to enable dynamic short or long sleep cycles in multiple beams
communications of the 5G network. Section 4 presents the performance analysis of AI-DRX in terms of
energy efﬁciency and mean delay. Finally, we conclude our work in Section 5.

2. Related Work

2.1. DRX in LTE

LTE networks use DRX mechanism to minimize the power consumption of UE. The energy
expenditure of UE can be reduced by switching off radio components during the unavailability of
incoming data packets [17]. LTE-DRX is conﬁgured and controlled by evolved node B (eNB). The eNB
informs the UE to turn off its radio components in case of no data in the buffer. The LTE-DRX can be
regulated by radio resource control (RRC) layer at eNB. RRC sends the packets’ information to UE
via a physical downlink control channel (PDCCH). RRC operates in two modes: (1) RRC_connected
mode; (2) RRC_idle mode, after a UE is turned on [18]. The RRC_idle mode is only responsible
for paging operations, UE neither receives nor transmits the data, but only monitors the paging
signals during the paging occasion. Whereas, all the data exchange between UE and eNB takes place
during RRC_connected mode. Since all transmissions take place during the RRC_connected mode
and this mode is responsible for more energy consumption of UE, our work focuses on the DRX in
RRC_connected mode. Figure 1 shows the LTE-DRX in RRC_connected mode that works on two types
of states. These states are:

• Active State
• Sleep State

RRC Inactivity
Timer

RRC_
RRC_ idle
connected
DRX
Inactivity
Timer Packet Activity
Active

RRC_connected Mode
Short
Long RRC_idle Mode
Sleep
Sleep
DRX Sleep State

DRX Active State

Figure 1. LTE-DRX in RRC_connected mode.

Based on the above two states, different DRX parameters are conﬁgured in RRC_connected mode
while considering various services’ delay requirements.

245
Electronics 2019, 8, 778

2.1.1. Active State

A UE sends or receives the data packets during DRX active state. This state does not allow UE to
save the energy because UE has to receive the data packets during this state. Moreover, a UE has to
turn on the radio circuitry throughout the active state. The active state of DRX can be controlled by
two parameters.

• Inactivity timer
• Active time

All the data packets are transmitted/received during the active time. The inactivity timer is a
countdown timer in DRX active state. This timer re-starts every time a new data packet arrives at eNB
and then eNB serves the received packets to UE. In case of no new packet arrival, the inactivity timer
gets expired and the UE switches to sleep state for a certain time period.

2.1.2. Sleep State

A UE turns off the radio components during sleep state in order to save power. UE cannot receive
or send the data packets during DRX sleep state but can only monitor the PDCCH for any incoming
data. DRX in a sleep state is controlled by the following parameters:

• ON time
• Short sleep cycle
• Short sleep timer
• Long sleep cycle
• Long sleep timer

During ON time, a UE monitors the PDCCH. ON time always starts after completion of each
sleep cycle. A short sleep cycle is a small duration of time that saves the UE energy by switching off
the transceiver part. A short sleep cycle is repeated up to Nsc number of short sleep cycles. After
the expiry of Nsc , a UE transits to the long sleep cycle. The long sleep cycle and long sleep timer are
similar to short sleep cycle and short sleep timer but have a longer time period than the counterpart,
respectively.
Figure 2 delineates the timing diagram of LTE-DRX. As shown in Figure 2, all the data transmission
and reception take place during the active state. After reception of each new packet, inactivity timer
restarts. In the case that no new packet reaches and inactivity timer ﬁnishes the countdown timer, then
UE switches to short sleep cycle. After every short sleep cycle, a UE monitors the PDCCH for any
incoming packet during the ON time. If no new packet arrives before the completion of short sleep
timer, the UE switches to the long sleep cycle and remains there until intimation of the new packet
is received. UE transits from a long sleep to idle state if no new packets arrive and the long sleep
timer expires.

246
Electronics 2019, 8, 778

Incoming Data Packets Sleep State

Time

ON Time

ON Time
Long Sleep Cycle
Short Sleep Cycle
ON Time
Short Sleep Cycle
Inactivity
Timer

Active State
ON Time Sleep Cycle

Incoming Data Packets Inactivity Timer Active Time

Figure 2. LTE-DRX timing diagram.

2.2. Non-Compatibility of LTE-DRX for 5G Networks

The LTE-DRX mechanism in its existing form is not suitable for 5G networks. The main reason
for non-suitability is the directional communications in 5G networks, which require beamforming
operation prior to data transmission [4]. Whereas, LTE-DRX mechanism does not include beam
searching operation [19,20]. Figure 3 shows the concept of directional communication with multiple
beams in 5G network. The gNB transmits K number of beams and a UE has L number of beams.
A UE needs to search and align the best beam pair form K × L beam pairs, prior to the start of
communications [4]. The process of beam searching and beam alignment in 5G is not included in
LTE-DRX. Moreover, the process of beam searching in 5G network causes a UE to remain in the
active state more than that of the LTE-DRX, which also increases the power consumption of UE [21].
Furthermore, the beamforming process in 5G requires additional time for beam searching and aligning
best beam pairs [22]. The additional beam searching and beam alignment time cause more delay
than the delay in LTE-DRX. Hence, directional communications in 5G is one of the reasons, which
makes LTE-DRX mechanism in its existing form, not a suitable solution for power saving in 5G
networks [21,22].
The second main reason for non-compatibility of existing LTE-DRX mechanism with 5G networks
is the fixed length of sleep cycles in LTE-DRX [11]. LTE-DRX uses fixed length short and long sleep
cycles to economize the power consumption of UE. Whereas, 5G network is expected to deal with
different types of services, simultaneously [4]. These services may have different size of packet lengths,
variable packet arrival time and variable transmission time interval (TTI) [23]. The fixed length of sleep
cycles in LTE-DRX mechanism may not be suitable for 5G services as these cycles may under-utilize or
over-utilize TTI. Moreover, fixed length sleep cycles in 5G may be responsible for least power savings
in 5G enabled devices [23]. However, DRX design can still be used for 5G [9,24,25].

247
Electronics 2019, 8, 778

Tx BeamsTx Beam 1

Rx Beams
… …
Beam
Alligned

UE
gNB
Figure 3. Directional communication with multiple beams in 5G network.

2.3. Recurrent Neural Network for Predicting Sequential Data

Recurrent Neural Network (RNN) is a part of AI that is widely used for predictions over sequential
data [26]. Long short-term memory (LSTM) network is one of the popular kinds of RNN that performs
well while predicting the long-term dependent time sequences [15]. LSTM is different from other
kinds of RNN due to its gated structure and internal cell state in every single unit, as shown in
Figure 4. LSTM neural network utilizes various previous inputs to decide the values of internal gates
(forget gate, input gate, input modulation gate, output modulation gate), which contributes to cell
state. Cell state helps the LSTM to remember or forget the impact of past inputs while deciding the
prediction results. This is the main reason LSTM has better performance for predicting the long-term
dependent sequences [14,15]. Table 1 shows the notations used in our work.
LSTM unit includes forget gate FT that can be mathematically shown by Equation (1). The output
of forget gate varies from 0 to 1, due to sigmoid Sig = 1+1e−T function. The letters wx and wh in
the Equation (1) represent the weights associated with the current input x T and previous output
h T −1 , respectively. The term bF is the bias for the forget gate. Similarly, the input gate (IT ) and
input modulation gate (GT ) can be calculated by Equations (2) and (3), respectively. The hyperbolic
tangent function in input and output modulation gates can be mathematically written as tanh( T ) =
2Sig(2T ) − 1 and ranges from −1 to +1. Moreover, b I and bC are biases for input gate and cell state,
respectively.
FT = Sig(wx x T + wh h T −1 + bF ) (1)

IT = Sig(wx x T + wh h T −1 + b I ) (2)

GT = tanh (wx x T + wh h T −1 + bC ) (3)

248
Electronics 2019, 8, 778

Current
Packet Time
6 LJ Input Gate Output Gate 6 LJ
[7
,7 27
*7 &7 Hyp

Hyp
Tan Cell Tan K7
Input Output Predicted
K7 Modulation Gate Modulation Gate Time of
Previous
& 7 Next Packet
Packet Time )7
Value 6 LJ Forget Gate

6 LJ Sigmoid Hyp
Tan
Hyberbolic Tangent Pointwise Multiplicative Pointwise Additive
Operation
Layer Layer Operation

Figure 4. Gated structure of single long short-term memor (LSTM) unit.

Table 1. List of abbreviations.

Abbreviation Notation Abbreviation Notation

Forget gate FT DRX active time TAc
Input modulation gate GT DRX ON time TON
Input gate IT DRX Inactivity timer TI N
Output layer OT Packet inter-arrival time λκ
Cell state CT DRX Dynamic sleep time TDY
Output modulation gate hT Weight corresponding to each gate wi

Cell state CT and the output layer OT can be calculated by Equations (4) and (5), respectively.
The ﬁnal prediction results can be observed by output modulation gate (h T ) using Equation (6).

CT = FT ⊗ CT −1 + IT ⊗ GT (4)

OT = Sig(wx x T + wh h T −1 + bO ) (5)

h T = OT ⊗ tanh(CT ) (6)

During the training process, LSTM learns to extract the relationship between the input and desired
output (upcoming value of sequence) by adjusting various weight values in Equations (1)–(3) and (5).
Once the model is trained with the least error, the learned weight values can be used in Equations (1)–(6)
to calculate the output values of a sequence (prediction). The error between prediction result and
observed (actual) value can be computed by the root mean square error (RMSE) and is given as:
7
∑ TN=1 ( Predicted T − Observed T )2
RMSE = (7)
N

where the notation T represents the number of samples and N shows the maximum number of samples.

249
Electronics 2019, 8, 778

3. AI-DRX for Multiple Beam Communications

3.1. System Model

In 5G directional air interface, beam information is a very crucial operation as a UE wakes
up to receive data or control packet [3]. Hence, our work considers DRX for multiple beam
communications [13], having dynamic long and short sleep cycles instead of ﬁxed length sleep cycles.
Dynamic sleep cycles have different values of sleep time in each cycle rather than one static time value
in each sleep cycle. The random values of dynamic sleep time can be predicted based on previous
values of packet arrival time.
We model DRX as a ten-state model. These states range from S0 to S9 and are shown in Figure 5.
Let us elaborate on the states below.

• S0 is the active time. A UE sends and receives the data during S0 state. UE predicts the dynamic
time TDY during S0 state.
• S1 state is threshold comparison state. S1 compares TDY with both threshold values (Th Min and
Th Max ) to decide whether to remain in the active time or switch to the dynamic sleep cycle.
• S2 state is dynamic short sleep cycle. It saves the UE power for a short period of time up to
TDY . The predicted value of TDY in the dynamic sleep cycle is always lower than that of value in
S3 state.
• S3 state highlights the dynamic long sleep cycle. It is similar to S2 state but has a larger sleep
period than S2 . The predicted value of TDY always has larger value of dynamic long sleep cycle
than that of the dynamic short sleep cycle.
• S4 state shows beam training during active time. This state allows UE to train and search for best
available beam pairs between gNB and UE.
• S5 state represents feedback after beam training operation in active time.
• S6 state shows active time after best beam pairs are aligned. S6 is different from S0 as S6 always
occurs after beam training and feedback process. Moreover, S6 enables UE to search beam pairs in
active time until best beam pairs are found.
• S7 state delineates beam training process after the execution of dynamic long or short sleep cycle.
• S8 state is feedback operation that occurs after beam training and after the execution of dynamic
long or short sleep cycle, respectively.
• S9 state is the ON period, a UE only monitors PDCCH for an incoming packet. UE could not send
or receive data packets during ON time.

3.2. Proposed AI-DRX Algorithm

We propose artiﬁcial intelligence based DRX (AI-DRX) algorithm for multiple beams
communications in 5G network. Our proposed algorithm enables DRX to achieve dynamic long
or short sleep cycles and to reduce the power consumption of UE. AI-DRX algorithm works on
ten-state model of DRX as shown in Figure 5. Algorithm 1 demonstrates AI-DRX mechanism and is
elaborated below:

• AI-DRX algorithm takes ON timer TON , minimum threshold value Th Min and maximum threshold
value Th Max as input (Line 1).
• AI-DRX examines the buffer for any incoming data packets (Line 2).
• If any packet is received in the buffer, the packets will be served and the value of dynamic sleep
time TDY will be predicted, simultaneously (Line 4).
• During the active time, if no beam misalignment occurs between UE and gNB, AI-DRX compares
the predicted value of dynamic sleep time TDY with threshold values Th Min and Th Max .
• If the predicted value of dynamic sleep time TDY is less than the minimum threshold value Th Min
(Line 8), UE continues to remain in the active time (Line 9).

250
Electronics 2019, 8, 778

• If the predicted value of dynamic sleep time TDY is greater than or equal to the minimum threshold
value Th Min and less than maximum threshold value Th Max (Line 11). UE observes dynamic short
sleep cycle (Line 12).
• In case of any beam misalignment after UE wakes up from dynamic short sleep cycle (Line 13), UE
starts the beam training process (Line 14) and Feedback (Line 15) followed by ON time (Line 16).
• During ON time, if beam pairs are still misaligned (Line 17), UE searches beam pairs again
(Line 18).
• During ON time, if a new packet arrives (Line 21), UE switches to active mode and start receiving
data packets (Line 22).
• After completion of ON time, if no new packet arrives, UE continues to sleep for previous
predicted time value TDY of the long sleep cycle or short sleep cycle. respectively (Line 24).
• If TDY is greater than Th Max (Line 28) the UE will go to dynamic long sleep cycle (Line 29).
• UE will perform beam training and feedback after completion of each dynamic long sleep cycle
(Line 30).
• In the case of beam misalignment during an active time (Line 35), UE performs beam training
(Line 36) and feedback (Line 37) and then re-enter active time (Line 38).
• During the active time (Line 38), if UE still ﬁnd beam misalignment (Line 39), AI-DRX executes
beam training process again (Line 40).
• After the alignment of beam pairs, the UE transits to active time (Line 42).

Predict 7'< Check 7'<

If No Beam
Active
Misalignemnt
6 6
7K

Packet
0

Arrives or
LQ
Misaligned

7'< 7K0LQ 7'< ! 7K0D[

Beam

7'
<

Beam

Train
7K

Dynamic
6
0

6 Long Sleep
D[

7'<
Beam Dynamic
No New Packet, Continue Sleep for

Train Short Sleep

7'<
Previous Predicted time

6
Beam Misaligned
6
Beam Misaligned

Feed
back
6
Beam Misaligned

Feed
back
7'<

6
Active 6
ON

Figure 5. State diagram for AI-DRX.

251
Electronics 2019, 8, 778

Algorithm 1 AI based DRX for Multiple Beam Communications.

1: Input TON , Th Min , Th Max
2: Examine buffer for incoming packets
3: if Packets in buffer > 0 then
4: Serve the packets and Predict TDY (S1 )
5: else
6: if No Beam Misalignment then
7: Check TDY (S1 )
8: if TDY < Th Min then
9: Go to Step 4 (S0 )
10: else
11: if Th Min ≤ TDY < Th Max then
12: Go to Dynamic Short Sleep up to TDY (S2 )
13: if Beam Misalignment then
14: Execute Beam Training (S7 )
15: Feedback (S8 )
16: ON (S9 )
17: if No Beam Aligned then
18: Go to Step 14 (S7 )
19: end if
20: else
21: if New packet arrives before completion of ON Time TDY then
22: Go to Step 4 (S9 to S0 )
23: else
24: Go to Dynamic sleep for previous predicted time TDY (S2 /S3 )
25: end if
26: end if
27: else
28: if TDY > Th Max then
29: Go to Dynamic Long sleep (S3 )
30: Go to Step 14 (S7 )
31: end if
32: end if
33: end if
34: else
35: if Beam Misalignment then
36: Execute Beam Training (S4 )
37: Feedback (S5 )
38: Active after Beam Training (S6 )
39: if No Beam Aligned then
40: Go to Step 36 (S4 )
41: else
42: Go to Step 4 (S0 )
43: end if
44: end if
45: end if
46: end if

3.3. AI-DRX for Enabling Dynamic Long and Short Sleep Cycles
Algorithm 1 (AI-DRX) demonstrates the use of artificial intelligence in the implementation of
DRX for 5G networks. AI-DRX makes dynamic short and long sleep cycles in DRX. AI-DRX utilizes
trained LSTM model to predict the upcoming value of packet arrival time and subsequently to enable
dynamic sleep cycles in DRX. The training process is conducted offline on two traces of real wireless
traffic acquired from the University of Massachusetts (UMass) trace repository [27] and Crawdad
data set repository [28]. Training process learns the packet arrival time pattern of increasing sequence
values from both traces. Once the LSTM network is trained offline with least prediction error, the
trained model predicts the upcoming packet time of real wireless traffic. AI-DRX algorithm calculates
the dynamic sleep cycles by using prediction results of upcoming packet arrival time value.

252
Electronics 2019, 8, 778

AI-DRX enables a UE to reduce power consumption in multiple beam communications scenario

of 5G networks. AI-DRX saves energy by enabling dynamic short and long sleep cycles, respectively.
AI-DRX enables dynamic short sleep cycles or dynamic long sleep cycles based on two threshold values
Th Min and Th Max . The values of Th Min and Th Max can be varied according to delay requirements of
various services. We discuss three cases of AI-DRX: (1) dynamic short sleep cycles; (2) dynamic long
sleep cycles; (3) dynamic inactivity timer.
During an active time of AI-DRX, the packets are served to UE. The trained LSTM model predicts
the value of upcoming packet arrival time (TDY ) while serving the packets. UE checks for new packets
in the buffer via PDCCH during ON state of AI-DRX. If no new packet is observed in the buffer, UE
continues to sleep up to TDY . However, if a new packet is observed in the ON state, UE transits to
active time and receives the packet(s). Meanwhile, inactivity timer in the active mode is restarted on
the reception of every new packet. Furthermore, AI-DRX also considers the case of the active state
having empty buffer, the UE counts down for dynamic inactivity timer (third case below) to complete
and then transits to sleep cycle for predicted sleep time (TDY ).
The ﬁrst case enables dynamic short sleep cycle if the value of TDY is greater than or equal to
Th Min and less than Th Max . The condition of dynamic short sleep cycle gets satisﬁed and UE sleeps
for the predicted time value of TDY . There are very small chances of beam misalignment after a short
sleep period [12,13]. Hence, AI-DRX considers the beam training process after dynamic short sleep
cycles only if beams are misaligned. Figure 6 shows the timing diagram of dynamic short sleep cycle
using AI-DRX algorithm.

Dynamic Inactivity Timer 7'<

Predicting 7'< Predicting 7'<
W '< 7K0LQ 7K0LQ d W '< 7K0D[

Active Active Active

Dynamic Short Sleep

Beam Training
Feedback
ON Time

ON Time

Active State

Dynamic Sleeping State

Beam Training

Feedback ON Time

Figure 6. Timing diagram of AI-DRX based dynamic short sleep cycle.

The second case considers the dynamic long sleep cycle up to the predicted value of TDY , if TDY is
greater than Th Max . The concept of the dynamic long sleep cycle is depicted in Figure 7. The dynamic
long sleep cycle saves more energy than that of short sleep cycle. Moreover, there are more chances of
beam misalignment after the dynamic long sleep cycles. Hence, AI-DRX performs beam training and
feedback after completion of each dynamic long sleep cycle.
The third case deals with dynamic inactivity timer. If the predicted value of TDY is less than
Th Min , the UE will remain active until the period TDY or any new packet arrives. The concept of
dynamic inactivity timer can be seen in Figures 6 and 7. AI-DRX also addresses the problem of beam
misalignment during active time. AI-DRX performs beam training and feedback in case of any beam
misalignment during active time.
It may be noted that AI-DRX utilizes dynamic short and long sleep cycles instead of static ﬁxed
time sleep cycles. Moreover, the inactivity timer value is also dynamic according to the prediction
results (using trained LSTM model). Our proposed algorithm keeps updating the predicted time value
based on most recent received packets and their packet arrival time.

253
Electronics 2019, 8, 778

Dynamic Inactivity Timer 7'<

Predicting 7'< Predicting 7'< Predicting 7'<
7'< 7K0LQ 7'< ! 7K0D[ 7'< ! 7K0D[

Active Active Active Dynamic Long

Active
Dynamic Long

Beam Training

Beam Training
Beam Training

Feedback

Feedback
Feedback

ON Time
ON Time
ON Time
Sleep Sleep

Active State Dynamic Sleeping State Beam Training Feedback ON Time

Figure 7. Timing diagram of AI-DRX based dynamic long sleep cycle.

DRX saves the power of UE at the cost of delay. Hence, we considered two performance
parameters: energy efﬁciency (EE) and mean delay. The performance parameter; EE of the UE
is the ratio of dynamic short sleep time and dynamic long sleep time to the sum of active time (TAC ),
ON time (TON ), beam training time, feedback time, dynamic short sleep (TDY ) time and dynamic long
sleep time (TDY ). EE can be calculated by Equation (8). The beam training and feedback processes are
considered during active time (TAC ) and dynamic short and long sleep cycles take place during (TDY ).

TDY
EE = (8)
TAC + TON + TDY

Similarly, for packet inter-arrival time λκ , the mean holding period of active state can be calculated
as [5]:
1 − e−λκ TI N
TAC = −λ T (9)
e κ I N (1 − e − λκ )
Furthermore, the packets arrived during the sleep state and ON duration are stored in the buffer
until next ON period. The packet arrival events are the random observer to the sleep period and ON
state. Therefore, the mean delay is deﬁned as the sum of mean sleep time and ON duration and is
given as:
T T
Mean Delay = DY + ON (10)
2 2

4. Performance Analysis
We use MATLAB 2019a for training and testing of RNN on two different traces (data sets).
These traces of burst traffic type are taken from the Crawdad dataset repository (trace 1) [28] and
UMass trace repository (trace 2) [27]. The trace 2 shows the traffic pattern of HTTP and video streaming
applications. The video streaming trace is used as it is expected by 2024 over three-quarters of mobile
data traffic will be video traffic [1]. The trace 1 and trace 2 include seven parameters as shown in [11].
These parameters are: (1) serial number; (2) packet arrival time in seconds; (3) source address of the
packets; (4) destination address of the packet; (5) protocol used; (6) length of packets in bytes; and
(7) additional information. We have utilized the time parameter from both traces to train the LSTM
network. Time parameter contains the information of packet arrival time. Our purpose is to train the
LSTM network until it learns the time interval pattern of packet arrival time from a given sequence.
Data pre-processing makes the training process simple and avoid training from the divergence [11,29].
Hence, we standardize the data with zero value of mean and unit value of the variance of the training
set during the training process. Moreover, we also standardize the test set during the prediction time.
Hyper-parameters in LSTM network are selected manually to make the training process more
efficient. These parameters include the number of hidden units, learning rate drop factor, the maximum
number of epochs, initial learning rate, and the optimizer used. The best selection of hyper-parameters

254
Electronics 2019, 8, 778

may result in the learning with least prediction error. The training process performance can be
measured in terms of RMSE. The less the value of RMSE, the better is the prediction results of the
trained model on unseen data (test set). We have selected 200 and 125 hidden units in LSTM network
during training process over trace 1 and trace 2, respectively. The value of the learning rate drop factor
remains 0.2 for both traces. Maximum number of epochs during the training process of trace 1 are
considered to be 600 while for trace 2 are 1000 epochs. Initial learning rate during the training process
of both traces remains 0.004. Furthermore, the optimizer used during the training process for both
traces is Adam optimizer. The above mentioned hyper-parameters reduce the RMSE during training
and testing processes.
We have considered the total length up to 130,820 sample values (323.886 s) of trace 1, while
trace 2 has 77,470 values of time samples (417.642 s) [11]. We have divided both traces into 10% of
samples as the training set and a number of different test sets randomly selected from the remaining
90% of both traces. Each test set has an equal number of samples. Figure 8a shows the RMSE for the
initial test set of trace 1 that is as small as 12 ms. Whereas, Figure 8b shows the RMSE value for trace 2
is 10 ms for the ﬁrst random test set.

3UHGLFWLRQ5HVXOWIRU)LUVW7HVW6HW7UDFH 3UHGLFWLRQ5HVXOWRQ)LUVW7HVW6HW7UDFH

7LPH PV

2EVHUYHG
3UHGLFWHG
2EVHUYHG

3UHGLFWHG

6DPSOHV
6DPSOHV
506( 506(

(UURU

6DPSOHV 6DPSOHV
(a) Trace 1. (b) Trace 2.
Figure 8. Prediction result and root mean square error (RMSE) for ﬁrst test set.

The RMSE values for random test sets from the remaining 90% samples of trace 1 and trace 2 are
shown in Figure 9. It can be seen from Figure 9a,c that the minimum value of RMSE is least at 6 ms
and 5 ms on random test sets from samples 70,500 to 71,953 and 90,700 to 92,153 of trace 1, respectively.
Whereas, Figure 9b,d show the RMSE value of 6 ms and 8 ms from 10,000 to 10,899 and 65,000 to
65,899 samples of trace 2, respectively.
AI-DRX can be implemented at the gNB of 5G network. During the execution of AI-DRX,
we consider packet generation event, active event, dynamic long sleep event, dynamic short sleep
event, ON duration event, beam searching event and feedback event. Moreover, our approach
enables dynamic sleep cycles in multiple beam communications scenario for 5G networks. The packet
generation event in our simulation scans the time column of both traces and produces the data packets
of the identical length on the same instant in the respective trace. The generated data packets are
collected in the buffer and served to the UE during the active event. UE checks the buffer during ON
period, in case of any packet in buffer, the UE switches to the active event, or else continues to sleep.
During an active event, the buffered packets are served to UE after getting the beam pairs alignment
between UE and gNB. At the same time, packet arrival time is inputted to the trained LSTM model to
predict the upcoming packet arrival time. We can obtain the dynamic sleep duration by subtracting
the previous packet arrival time value from the predicted dynamic time value TDY (upcoming packet
arrival time).

255
Electronics 2019, 8, 778

3UHGLFWLRQ5HVXOWRQ5DQGRP7HVW6HW7UDFH 3UHGLFWLRQ5HVXOWRQ5DQGRP7HVW6HW7UDFH

7LPH PV
7LPH PV

2EVHUYHG
2EVHUYHG )RUHFDVW

)RUHFDVW

6DPSOHV 6DPSOHV
506( 506(

(UURU
(UURU

6DPSOHV 6DPSOHV

(a) Test set from trace 1 (samples: 70,500 to 71,953). (b) Test set from trace 2 (samples: 10,000 to 10,899).

3UHGLFWLRQ5HVXOWRQ5DQGRP7HVW6HW7UDFH 3UHGLFWLRQ5HVXOWRQ5DQGRP7HVW6HW7UDFH

7LPH PV

2EVHUYHG
2EVHUYHG
)RUHFDVW )RUHFDVW

6DPSOHV 6DPSOHV
506( 506(

(UURU

(UURU

6DPSOHV 6DPSOHV
(c) Test set from trace 1 (samples: 90,700 to 92,153). (d) Test set from trace 2 (samples: 65,000 to 65,899).

Figure 9. Prediction results and RMSE for random test sets.

Figure 10a,b, demonstrate the energy efficiency and mean delay with varying ON Period (TON )
for AI-DRX and LTE-DRX over trace 1 & trace 2, respectively. In Figure 10, TON is varied from 1 ms
to 160 ms while the values for Th Min = 20 & Th Max = 100 are considered. Figure 10a highlights the
decrease in energy efficiency with increase in TON . The reason for the drop in energy efficiency is,
UE waits for longer period in ON state prior switching to active state (to serve packets). It can be
seen from Figure 10b, the mean delay for trace 1 ranges from 150 ms to 194 ms with an increase in
ON period. The reason for rise in mean delay lies in the feet that with an increase in ON duration,
the time spend by UE in ON state will be higher, which results in higher delay. Moreover, by selecting
the optimum value of TON , delay observed by UE can be minimized. From Figure 10a, it is noticed
that the energy efficiency of trace 2 is higher than that of trace 1 due to higher arrival rate of trace 1 as
compared to trace 2.
We have compared the performance of AI-DRX with LTE-DRX. To implement the LTE-DRX, we
select the value of short sleep cycle to Th Min , the value of long sleep cycle to Th Max and fed trained
model with trace 1 and trace 2. It is observed in Figure 10a AI-DRX energy efficiency for trace 1 is 69%
higher than that of LTE-DRX, at the cost of higher delay. The reason arises from the fact that AI-DRX

256
Electronics 2019, 8, 778

calculates sleep time form real wireless traffic trace based on arrival rate, while LTE-DRX uses Th Min
and Th Max to set the short and long sleep time. For AI-DRX, the small value of Th Min and Th Max
achieves higher energy efficiency as UE easily transits to long sleep time, which results in a larger
delay. Whereas, for small values of Th Min and Th Max in LTE-DRX, a UE sleeps for a short period that
results in less energy efficiency and smaller mean delay. AI-DRX achieves 55% higher energy efficiency
as compared to that of LTE-DRX using trace 2.

(QHUJ\(IILFLHQF\

0HDQ'HOD\ PV
$,'5;7UDFH
$,'5;7UDFH

$,'5;7UDFH /7('5;7UDFH
$,'5;7UDFH /7('5;7UDFH
/7('5;7UDFH
/7('5;7UDFH

213HULRG7 21 PV 213HULRG7 21 PV

(a) Energy Efﬁciency. (b) Mean Delay.

Figure 10. AI-DRX energy efﬁciency and mean delay with varying TON (Th Min = 20, Th Max = 100).

Figures 11 and 12 present the energy efficiency and mean delay with varying ON period TON for
Th Min = 200 & Th Max = 1000 and Th Min = 300 & Th Max = 1600, respectively. It is observed from
Figures 11 and 12 that the energy efficiency and mean delay of LTE-DRX increase with an increase
in Th Min & Th Max , as UE sleep longer. The energy efficiency of AI-DRX decreases with an increase
in Th Min & Th Max , as UE will not able to transit to sleep state if TDY < Th Min . With a small value of
sleep time TDY or higher value of Th Min & Th Max , UE remains in the active state, which results in less
energy efficiency.

(QHUJ\(IILFLHQF\

0HDQ'HOD\ PV

$,'5;7UDFH $,'5;7UDFH
$,'5;7UDFH $,'5;7UDFH
/7('5;7UDFH /7('5;7UDFH
/7('5;7UDFH /7('5;7UDFH

213HULRG7 21 PV 213HULRG7 21 PV

(a) Energy Efﬁciency. (b) Mean Delay.

Figure 11. AI-DRX energy efﬁciency and mean delay with varying TON (Th Min = 200, Th Max = 1000).

257
Electronics 2019, 8, 778

$,'5;7UDFH
$,'5;7UDFH

/7('5;7UDFH
/7('5;7UDFH
(QHUJ\(IILFLHQF\

0HDQ'HOD\ PV

$,'5;7UDFH
$,'5;7UDFH

/7('5;7UDFH
/7('5;7UDFH

213HULRG7 21 PV 213HULRG7 21 PV

(a) Energy Efﬁciency. (b) Mean Delay.

Figure 12. AI-DRX energy efﬁciency and mean delay with varying TON (Th Min = 300, Th Max = 1600).

To validate our proposal, we have compared our work to traditional Poisson arrival model.
We generated three data sets considering Poisson arrival rate (λ) with mean value of λ = 1/20,
λ = 1/10, and λ = 1. We have trained model using Poisson arrival rate. The generated traces are
fed to AI-DRX algorithm to analyze the energy efficiency and mean delay. Figure 13a,b shows the
energy efficiency and mean delay for AI-DRX trace 2 and Poisson arrival with varying TON . The energy
efficiency of AI-DRX is 70% higher than Poisson arrival rate (λ = 1/20). The corresponding mean delay
of AI-DRX is 100 ms (on an average) higher as compared to Poisson arrival for λ = 1/20. The gain in
energy efficiency is achieved as AI-DRX considers real traffic arrival rate for selection of sleep cycles
and inactivity timer, while Poisson arrival considers mean arrival rate. The energy efficiency and mean
delay both are zero for higher Poisson arrival rate λ = 1. The reason lies in the fact that for higher
arrival rate, UE could not transit to sleep state to save the power.

(QHUJ\(IILFLHQF\

0HDQ'HOD\ PV

$,'5;7UDFH
3RLVVRQ$UULYDO
3RLVVRQ$UULYDO $,'5;7UDFH
3RLVVRQ$UULYDO 3RLVVRQ$UULYDO
3RLVVRQ$UULYDO

3RLVVRQ$UULYDO

213HULRG7 21 PV 213HULRG7 21 PV

(a) Energy Efﬁciency. (b) Mean Delay.

Figure 13. AI-DRX energy efﬁciency and mean delay comparison with Poisson Arrival with varying
TON (Th Min = 300, Th Max = 1600).

Various services in a wireless network can tolerate different delay levels while not compromising
quality of service (QoS). QoS class identifier (QCI) is a metric that is used to identify the characteristics
of traffic. QCI measures the QoS with two parameters; (1) packet loss rate (PLR) and (2) packet
delay budget (PDB). PDB can be defined as maximum tolerable waiting time by a packet during
its delivery from eNB to UE. Standardized QCI characteristics are shown in Table 2 [6,30,31]. In

258
Electronics 2019, 8, 778

various kinds of non-real-time services like email, web browsing, after a certain time period, a
UE does not require to monitor PDCCH continuously [6,32]. Hence, these types of services can
tolerate higher delays up to 300 ms [6,30,31]. These types of services require DRX with higher values
of sleep time (TDY > Th Min & TDY > Th Max ) and smaller values of ON timer for better energy
efficiency. Whereas, the real-time services like voice and live video streaming cannot tolerate delay
[33]. Therefore, the delay should be a higher priority than energy saving. If we limit the mean
delay to 125 ms, the energy efficiency will be 95.02%. For this energy efficiency we have to select
Th Min = 200 , Th Max = 1000, TON = 6 ms. If we increase the TON the energy efficiency of UE
decreases as UE remains in active state. The mean delay also increases with increase in TON as UE
does not receive the data during ON period but only monitors the Physical Downlink Control Channel
(PDCCH). The mean delay observed by UE increases to 150 ms when TON = 80, with energy efficiency
of 80.2%. The network can maximize the energy efficiency of UE by selecting optimum value of
Th Min , Th Max , TON depending on QCI value of different services.

Table 2. Standardized QoS class identiﬁer (QCI) characteristics in LTE/LTE-A [6,30,31].

Type Qos Class Identiﬁer Packet Loss Rate Packet Delay Budget Examples
GBR 1 10−2 100 ms Voice services
GBR 2 10−3 150 ms Live streaming services
GBR 3 10−3 50 ms Real time gaming services
GBR 4 10−6 300 ms Buffered streaming services
Non-GBR 5 10−6 100 ms IMS Signaling services
Non-GBR 6 10−6 300 ms TCP based application services
Non-GBR 7 10−3 100 ms Interactive gaming services
Non-GBR 8 10−3 300 ms TCP based video services
Non-GBR 9 10−6 300 ms TCP based video services

5. Conclusions
In this work, we have suggested an AI-based DRX mechanism for energy saving in multiple
beams communications scenario. We have modeled DRX as a ten-state model and suggested AI-DRX
algorithm depending on these 10 states. AI-DRX algorithm enables dynamic short and long sleep
cycles for energy efficiency of UE in the 5G network. We have trained LSTM network, a popular type
of RNN, to extract the packet arrival time pattern from real wireless traffic traces. Later, we have
utilized the learned model in AI-DRX algorithm for energy saving in 5G enabled devices. AI-DRX
economizes power consumption of a UE by enabling dynamic short and long sleep cycles. Extensive
training with selected hyper-parameters achieves the least RMSE of 5 ms on a random test set from
trace 1 and 6 ms on a random test set from trace 2, respectively. The energy efficiency obtained with
AI-DRX is approximately 60% and 95% for trace 1 and trace 2, respectively. AI-DRX achieves 69%
higher energy efficiency on trace 1 and 55% more energy efficiency on trace 2 as compared to LTE-DRX,
respectively. We also validated the performance of AI-DRX with traditional Poisson packet arrival
model. AI-DRX attains 70% more energy efficiency on trace 2 as compared to Poisson packet arrival
rate for λ = 1/20.

Author Contributions: Conceptualization, M.L.M. and M.K.M.; methodology, A.R.; investigation, M.L.M. and
M.K.M.; resources, M.L.M.; data curation, M.L.M.; writing—original draft preparation, M.L.M. and M.K.M.;
writing—review and editing, N.S. and A.R.; supervision, N.S., A.R. and D.R.S.; project administration, D.R.S.;
funding acquisition, D.R.S.
Funding: This research received no external funding.
Acknowledgments: This research was supported by the Basic Science Research Program through the National
Research Foundation of Korea (NRF) funded by the Ministry of Education (NRF-2016R1D1A1B03935633).

259
Electronics 2019, 8, 778

Conﬂicts of Interest: The authors declare no conﬂict of interest.

References
1. Ericsson Mobility Report June 2019. Available online: https://www.ericsson.com/assets/local/mobility-
report/documents/2019/ericsson-mobility-report-june-2019.pdf (accessed on 12 June 2019).
2. Fowler, S.; Shahidullah, A.O.; Osman, M.; Karlsson, J.M.; Yuan, D. Analytical evaluation of extended DRX
with additional active cycles for light traffic. Comput. Netw. 2015, 77, 90–102. [CrossRef]
3. Maheshwari, M.K.; Agiwal, M.; Saxena, N.; Roy, A. Directional Discontinuous Reception (DDRX) for
mmWave enabled 5G Communications. IEEE Trans. Mobile Comput. 2018. [CrossRef]
4. Agiwal, M.; Roy, A.; Saxena, N. Next generation 5G wireless networks: A comprehensive surve.
IEEE Commun. Surv. Tut. 2016, 18, 1617–1655. [CrossRef]
5. Zhou, K.; Nikaein, N.; Spyropoulos, T. LTE/LTE-A discontinuous reception modeling for machine type
communications. IEEE Wireless Commun. Lett. 2013, 2, 102–105. [CrossRef]
6. Maheshwari, M.K.; Roy, A.; Saxena, N. DRX over LAA-LTE—A New Design and Analysis based on
Semi-Markov Model. IEEE Trans. Mob. Comput. 2018, 18, 276–289. [CrossRef]
7. Ferng, H.-W.; Tseng, Y.-T.; Wang, T.-H. Parameter-Adaptive Dynamic and Adjustable DRX Schemes for
LTE/LTE-A. IEEE Trans. Green Commun. Netw. 2019. [CrossRef]
8. Gódor, I. Revitalizing DRX for enhanced mobile sleep modes. Comput. Commun. 2019, 138, 98–105. [CrossRef]
9. 3GPP TSG-RAN WG2 #94 Tdoc R2-164023 Std., 2016. Available online: https://www.3gpp.org/ftp/tsg_
ran/WG2_RL2/TSGR2_94/Docs/R2-164023.zip (accessed on 4 April 2019).
10. Ho, C.H.; Huang, A.; Hsieh, H.-J.; Wei, H.-Y. Energy-Efficient Millimeter-Wave M2M 5G Systems with
Beam-Aware DRX Mechanism. In Proceedings of the 2017 IEEE 86th Vehicular Technology Conference
(VTC-Fall), Toronto, ON, Canada, 24–27 September 2017.
11. Mudasar, L.M.; Maheshwari, M.K.; Shin, D.R.; Roy, A. Deep-DRX: A framework for deep learning–based
discontinuous reception in 5G wireless networks. Trans. Emerg. Telecommun. Technol. 2019, 30, e3579.
12. Kwon, S.-W.; Hwang, J.; Agiwal, A.; Kang, H. Performance analysis of DRX mechanism considering analogue
beamforming in millimeter-wave mobile broadband system. In Proceedings of the 2014 IEEE Globecom
Workshops (GC Wkshps), Austin, TX, USA, 8–12 December 2014.
13. Liu, D.; Wang, C.; Rasmussen, L.K. Discontinuous Reception for Multiple-Beam Communication. IEEE Access
2019, 7, 46931–46946. [CrossRef]
14. Wang, J.; Tang, J.; Xu, Z.; Wang, Y.; Xue, G.; Zhang, X.; Yang, D. Spatiotemporal modeling and prediction
in cellular networks: A big data enabled deep learning approach. In Proceedings of the IEEE INFOCOM
2017—IEEE Conference on Computer Communications, Atlanta, GA, USA, 1–4 May 2017.
15. Gers, A.F.; Schmidhuber, J.; Cummins, F.A. Learning to forget: Continual prediction with LSTM.
Neural Comput. 1999, 12, 850–855.
16. Google. Google AI Blog. Available online: https://ai.googleblog.com/2018/05/duplex-ai-system-for-
natural-conversation.html (accessed on 1 June 2019).
17. Bontu, C.S.; Illidge, E. DRX mechanism for power saving in LTE. IEEE Commun. Mag. 2009, 47, 48–55.
[CrossRef]
18. Evolved universal terrestrial radio access (E-UTRA); Medium access control (MAC) protocol specification.
Available online: https://portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?
specificationId=2437 (accessed on 2 March 2018)
19. Chang, H.-L.; Tsai, M.-H. Optimistic DRX for Machine-Type Communications in LTE-A Network. IEEE Access
2018, 6, 9887–9897. [CrossRef]
20. Chang, H.-L.; Lu, S.-L.; Chuang, T.-H.; Lin, C.-Y.; Tsai, M.-H.; Sou, S.-I.Optimistic DRX for machine-type
communications. In Proceedings of the 2016 IEEE International Conference on Communications (ICC),
Kuala Lumpur, Malaysia , 22–27 May 2016 .
21. Agiwal, M.; Maheshwari, M.K.; Saxena, N.; Roy, A. Directional-DRX for 5G wireless communications.
Electron. Lett. 2016, 52, 1816–1818. [CrossRef]
22. Maheshwari, M.K.; Agiwal M.; Saxena, N.; Roy, A. Hybrid directional discontinuous reception (HD-DRX)
for 5G communication. IEEE Commun Lett. 2017, 21, 1421–1424. [CrossRef]

260
Electronics 2019, 8, 778

23. Maheshwari, M.K.; Roy, A.; Saxena, N. Analytical modeling of DRX with flexible TTI for 5G communications.
Trans. Emerg. Telecommun. Technol. 2018, 29, e3275. [CrossRef]
24. Study on new radio access technology Physical layer aspects. Available online: https://portal.3gpp.org/
desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3066 (accessed on 2 March 2018)
25. Study on new radio access technology Radio interface protocol aspects. Available online: https://
portal.3gpp.org/desktopmodules/Specifications/SpecificationDetails.aspx?specificationId=3070 (accessed
on 2 March 2018).
26. Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning Internal Representations by Error Propagation; Technical
Report; Institute for Cognitive Science, University of California: San Diego, CA, USA, 1985.
27. UMass Trace Repository. ACM MMSys conference dataset archive. Available online: http://traces.cs.umass.
edu/index.php/Mmsys/Mmsys (accessed on2 March 2018).
28. CRAWDAD Data Set Cambridge/Inmotion. Available online: https://crawdad.org/cambridge/inmotion/
20051001 (accessed on 2 March 2018).
29. Baughman, M.; Haas, C.; Wolski, R.; Foster, I.; Chard, K. Predicting Amazon spot prices with LSTM networks.
In Proceedings of the 9th Workshop on Scientific Cloud Computing, Tempe, AZ, USA, 11 June 2018.
30. Policy and charging control architecture. Available online: https://portal.3gpp.org/desktopmodules/
Specifications/SpecificationDetails.aspx?specificationId=810 (accessed on 2 March 2018).
31. Liang, J.M.; Chen, J.-J.; Cheng, H.-H.; Tseng, Y.-C. An energy-efficient sleep scheduling with QoS
consideration in 3GPP LTE-advanced networks for internet of things. IEEE J. Emerg. Sel. Top. Circ. Syst. 2013,
3, 13–22. [CrossRef]
32. Tseng, C.-C.; Wang, H.-C.; Kuo, F.-C.; Ting, K.-C.; Chen, H.-H.; Chen, G.-Y. Delay and power consumption in
LTE/LTE-A DRX mechanism with mixed short and long cycles. IEEE Trans. Veh. Technol. 2015, 65, 1721–1734.
[CrossRef]
33. Huang, B.; Tian, H.; Chen, L.; Zhu, J. DRX-aware scheduling method for delay-sensitive traffic.
IEEE Commun. Lett. 2010, 14, 1113–1115.

261
electronics
Article
Planar Array Diagnostic Tool for Millimeter-Wave
Wireless Communication Systems
Oluwole J. Famoriji 1, *, Zhongxiang Zhang 2 , Akinwale Fadamiro 1 , Rabiu Zakariyya 1 and
Fujiang Lin 1
1 Micro-/Nano Electronic System Integration R & D Centre (MESIC), University of Science and Technology of
China (USTC), Hefei 230026, China; [email protected] (A.F.); [email protected] (R.Z.);
[email protected] (F.L.)
2 Applied Electromagnetic Field Group, Microwave and Radio Frequency Laboratory, Hefei Normal
University, Hefei 230601, China; [email protected]
* Correspondence: [email protected]

Received: 8 October 2018; Accepted: 23 November 2018; Published: 3 December 2018

Abstract: In this paper, a diagnostic tool or procedure based on Bayesian compressive sensing
(BCS) is proposed for identification of failed element(s) which manifest in millimeter-wave planar
antenna arrays. With adequate a priori knowledge of the reference antenna array radiation pattern,
a diagnostic problem of faulty elements was formulated. Sparse recovery algorithms, including total
variation (TV), mixed 1 /2 norm, and minimization of the 1 , are readily available in the literature,
and were used to diagnose the array under test (AUT) from measurement points, consequently
providing faster and better diagnostic schemes than the traditional mechanisms, such as the back
propagation algorithm, matrix method algorithm, etc. However, these approaches exhibit some
drawbacks in terms of effectiveness and reliability in noisy data, and a large number of measurement
data points. To overcome these problems, a methodology based on BCS was adapted in this paper.
From far-field radiation pattern samples, planar array diagnosis was formulated as a sparse signal
recovery problem where BCS was applied to recover the locations of the faults using relevance vector
machine (RVM). The resulted BCS approach was validated through simulations and experiments
to provide suitable guidelines for users, as well as insight into the features and potential of the
proposed procedure. A Ka-band (28.9 GHz) 10 × 10 rectangular microstrip patch antenna array
that emulates failure with zero excitation was designed for far-field measurements in an anechoic
chamber. Both simulated and measured far-field samples were used to test the proposed approach.
The proposed technique is demonstrated to detect diagnostic problems with fewer measurements
provided the prior knowledge of the array radiation pattern is known, and the number of faults is
relatively smaller than the array size. The effectiveness and reliability of the technique is verified
experimentally and via simulation. In addition to a faster diagnosis and better reconstruction
accuracy, the BCS-based technique shows more robustness to additive noisy data compared to
other compressive sensing methods. The proposed procedure can be applied to next-generation
transceivers, aerospace systems, radar systems, and other communication systems.

Keywords: far-ﬁeld; antenna array; diagnosis procedure; noisy data; BCS; millimeter-wave

1. Introduction
Antenna array is a key technology component in various communication systems such as radar,
radio-astronomy, remote sensing, satellite communications, and next-generation (ﬁfth generation,
5G) wireless communications [1], where a very large number (in the hundreds) of radiating
elements are particularly used to meet the increasing demands of high radiation performance and
reconﬁgurability [2]. Conversely, the higher the number of radiating elements in the beam-forming

Electronics 2018, 7, 383; doi:10.3390/electronics7120383 263 www.mdpi.com/journal/electronics

Electronics 2018, 7, 383

configuration, the higher the probability of failed element(s) will be. This causes abrupt field variations
across the aperture of the array, and distortion in the radiation features (e.g., beamwidth, peak sidelobe,
and boresight). Therefore, the availability of reliable and effective diagnosis tools for large arrays
remains an asset, because manual dismantling and replacement operations consume excessive time
and cost, and are even unfeasible in satellite-borne installations. Currently, failure identification in
antenna arrays is a theoretical and practical important research domain. Detection of faulty elements in
antenna arrays is of great interest in both military and civilian markets. Upcoming technologies adopt
active or passive antenna arrays with a large number of elements [1–5]. For instance, millimeter-wave
transceivers implement multiple-input multiple-output (MIMO) features and beamforming for future
5G applications, as shown in Figure 1. The block diagram shows the location of the AWMF-0108 in a
5G MIMO system [6]. The integrated circuit (IC) contents in the circle are the gain and phase control
blocks with amplification and RX/TX switching. The first industrial and commercial millimeter-wave
quad-core IC transceiver for 5G applications is the AWMF-0108 [6]. Many compactable antennas
were designed for that purpose. Thus, some communication systems will evolve for 5G technology,
even before full deployment, which is not expected until 2020. The large number of elements in the
planar antenna required by the transceiver must function optimally. Failure in the element(s) causes
far-field degradation of antenna systems. The detection of failed elements from field measurements
taken from a suitable observation point is very important to re-calibrate the feeding network and
to reinstall the needed radiation characteristics by reconfiguring the excitations of the healthy
elements [1,3]. Testing of antennas is then a necessity when a certain number of elements exhibit fault.
Therefore, the fast diagnosis of complex antenna structures is always a fundamental need.
Far-field measurements are a very powerful approach for antenna array testing. The measurement
data are sequentially presented for probable failure identification in the array under test (AUT).
The matrix method algorithm (MMA) and back propagation algorithm (BPA) are the most commonly
used mechanisms to detect the number and corresponding positions of defective elements using
a reference antenna (healthy) and the AUT (defective). BPA [7] was established using the Fourier
relationship between radiated far-field and the field situated on the array aperture, and it is applicable
to planar antenna arrays. A generalized form of BPA is MMA [8,9]. MMA uses linear algebra standard
tools to stabilize the inversion matrix, which relates the array aperture field to radiated far-field.
However, MMA and BPA demand a large number of measurements, thus causing long post-processing
of large arrays. One approach to mitigate the problem is the use of a priori knowledge of the array
without failure; consequently, only the defective array elements are identified. The modeled diagnostic
problem is solved by employing available customs whose computation time is a little longer than
standard methodologies. At this point, it is evident that the total time taken to get the antenna array
diagnosed greatly depends on measurement time, with post-processing times having a higher order
of magnitude. This is why sparse recovery-based methods require fewer measurement numbers and
provide faster antenna array diagnosis.
Recently, compressive sensing emerged rapidly as a potential technique for solving sparse
recovery problems [10–20]. Within this context, the appropriateness of compressive sensing in
addressing the array diagnosis problem was examined in References [3,12,14,16,18,21]. Evidently,
faulty element distribution in array configurations in practice were found to be highly sparse because it
accounts for small non-null entries in the excitation vector of the transmit/receive modules. Beginning
from that hypothesis, 1 -norm minimization mechanism was applied successfully to detect failures
in planar arrays using a small number of near-field [9] or far-field [14] measurements. Conversely,
deterministic compressive sparse techniques require a “measurement matrix” to comply with the
restricted isometric property (RIP) condition, for which the estimation of large matrices remains an
open challenge [3,14,21]. An alternative is the probabilistic compressive sensing approach reported
in Reference [21] to diagnose linear arrays from far-field measurements. However, most of these
techniques were not tested experimentally.

264
Electronics 2018, 7, 383

Figure 1. Ka-band active antenna array formation which integrates multiple-input multiple-output
(MIMO) and beamforming.

In this work, the problem of antenna element excitation level was not examined; however,
we estimated field distribution on the array aperture. This helps us identify the modifications
of aperture field distribution as a result of factors that cannot be quantified by simple failure of
elements, such as different reflections of the array and its feed. The problem faced in getting more
information about the AUT is the larger number of unknowns. However, in sparse recovery methods,
the required number of measurements increases slowly and logarithmically with the number of
unknowns [10–13]. Hence, the field reconstruction scheme benefits more in sparse recovery-based
mechanisms. Different sparse recovery algorithms used to conduct antenna array diagnosis were
unveiled [14] and compared to the traditional BPA and MMA. In particular, total variation (TV) norm,
mixed 1 /2 norm, and minimization of the 1 norm were used to proffer solutions to the resulting
inversion issues. From the field reconstructed on the antenna aperture, Fuchs et al. [20] acquired a
good antenna diagnosis. The approach was applied to far-field simulation data generated from a
100-element antenna array. The performance of the diagnosis was evaluated and compared to the two
standard techniques (BPA and MMA) under different conditions. The approaches were also applied
to far-field measurement data of an antenna array with failure to justify the practical applicability
of the proposed schemes. Although there were many more works on sparse recovery methods in
applied electromagnetics and microwaves involving the diagnosis processes of antenna arrays [13,14],
experimental data, which are fundamental for testing any procedure, were reported in few of them.
In References [15–20], differential scenarios with sparse recovery algorithms were employed to
perform antenna diagnosis and retrieve element excitations. Reference [21] proposed a joint scheme
for adaptive diagnosis of antenna arrays using communication signal fusion (radar-communication
scheme) and the echoes of probe signals received at the same antenna. This method equally solved
the antenna diagnosis problem at low signal-to-noise ratios (SNRs) to ensure optimal performance
of smart sensors in wireless sensor networks. Also, Reference [22] attempted array diagnosis in
millimeter waves using compressive sensing. This work considered both full and partial blockage,
which occurs from a plethora of particles (such as ice, water droplets, salt, and dirt) and the technique
jointly computed the locations of the blocked elements, and the induced phase-shifts and induced
attenuation provided the prior knowledge of the angles of departure/arrival. Reference [23] proposed a
deterministic sampling strategy for failure detection in uniform linear arrays via compressed sensing or
a sparse recovery approach. This is an extension of the Weyl formula which is basically used for prime

265
Electronics 2018, 7, 383

numbers. The strategy obtained was good for nonprime number (i.e., valid for any number of array
elements). This sampling approach is good for sparse electromagnetic (EM) problems encompassing
Fourier matrices. Reference [24] gives a review of different capacities of sparse recovery by analyzing
how compressive sensing can be applied to antenna array synthesis, diagnosis, and processing.
Illustrations of a set of applicative examples were given, including direction-of-arrival estimation,
along with present challenges and current trends in compressive sensing applications to the solution
of innovative and traditional antenna array challenges. In general, compressive sensing generates
few unknown numbers; however, it needs a comprehensive array model with exact knowledge of the
radiating element patterns to produce useful results. The technique is sparse with respect to the whole
array structure, and requires a priori information to recast it as the function of minimization of 1 norm.
However, these techniques are applicable only if the relationships between the data and the unknowns
satisfy the restricted isometry property (RIP). To overcome this challenge, the Bayesian compressive
sensing (BCS) approach is adopted. This technique was explored in many electromagnetic problems,
such as antenna design and synthesis [25,26], microwave imaging [27–30], and direction-of-arrival
estimation [31–33]. It is employed in this paper to estimate the number, magnitude, and location of
failures in antenna arrays from far-field measurements. The BCS approach was attempted to diagnose
large linear arrays [11], and more recently, planar array configurations [34]; however, no experimental
validation was reported. Hence, there is a need for a more reliable procedure tested experimentally
and via simulation, because experiments are fundamental tests of any given procedure.
Specifically, this work is an extension of that described in References [15,16]. The BCS method is
applied to both the simulated and measured far-field data of a millimeter-wave 100-element microstrip
patch antenna array in which failures were added intentionally. A new regularization technique
was unveiled and applied to field distribution in order to enhance the efficiency and reliability of
antenna array diagnosis. The proposed BCS-based approach is a better choice due to its fast nature
and robustness under different noise conditions. The key contributions of this paper are summarized
as follows:

1. The BCS technique was applied to diagnose a millimeter-wave planar array, and the result was
compared with other approaches reported in the literature.
2. The procedure shows high effectiveness and reliability with fewer measurement points compared
to the other methods, and is highly robust to additive noisy data. This was validated
experimentally and via simulations.

However, some boundary conditions were observed. The BCS-based approach detects diagnostic
problems with few measurements, provided prior knowledge of the reference array radiation pattern,
and the number of faults is relatively smaller than the array size. The remainder of this paper is
arranged as follows: Section 2 contains the problem formulation of antenna array diagnosis. Section 3
presents compressed sparse recovery methods. Resolution via the BCS-based approach is given in
Section 4. The numerical simulations are presented in Section 5. Diagnoses from experimental data are
presented and discussed in Section 6. Finally, some conclusions are drawn in Section 7.

2. Antenna Array Diagnosis Problem Formulation

Consider an antenna array in space (Figure 2a). The antenna radiated far-field is usually quantified
by phase and amplitude. The AUT is depicted in Figure 2b. All the parameters associated with the
AUT are marked with superscript “u”. Specifically, Eu ( x, y) is the tangential field situated on the
antenna aperture, i.e.,
Eu ( x, y) = Exu ( x, y) x̂ + Eyu ( x, y)ŷ, (1)

where Exu ( x, y) x̂ and Eyu ( x, y)ŷ are the x and y planes of the aperture’s electric field,
respectively. Far-field F u (r, θ, ∅) is the measured field on part of the hemispherical surface
(0 ≤ θ ≤ π/2, 0 ≤ φ ≤ 2π ) at radius r from the phase center of the AUT, and r > 2D2 /λ, where D is
the diameter. Also, the amplitude and phase of a reference array (RA; array without failures) shown

266
Electronics 2018, 7, 383

in Figure 2b are assumed to be available. Associated quantities are marked with superscript “o”.
Eo ( x, y) is the field on the aperture Σ of the reference array (RA) and Fo (r, θ, ∅) represents the far-field
radiation. For the differential antenna (DA) shown in Figure 2c, the tangential distribution E( x, y) on
the aperture Σ is equal to the difference between the field distributions of the reference array and the
antenna under test, and the corresponding far-field F(r, θ, φ) is expressed as the difference between the
fields of reference array (RA) and AUT as

E( x, y) = Eu ( x, y) − Eo ( x, y), (2)

F(r, θ, φ) = Fu (r, θ, φ) − Fo (r, θ, φ). (3)

*-+

Figure 2. Antenna array: (a) reference antenna without failures; (b) antenna under test (AUT);
(c) differential antenna (DA). The number of failures is 2 within the total element number N = 21.

267
Electronics 2018, 7, 383

The differential antenna gives a resulting problem in which only the corresponding area to the
field modification radiates as a result of failure. By visually monitoring the field distribution on the
DA, the identification of faulty elements in the AUT can be observed.

2.1. Number of Far-Field Measurement Points Required

BPA and MMA approaches require a large number of measurement points. In the differential
antenna, we assumed the ﬁeld was localized, i.e., the unknown we wanted to retrieve was very sparse,
as shown in Figure 2c. In practice, there are a fewer number of failures than the overall elements
N. Sparse recovery algorithms estimate x from a number of measurement points smaller than the
number of measurements required by the standard mechanisms. Hence, it is possible to theoretically
get a reduction in the number of measurement points. Fuch et al. [20] demonstrated this using total
variation (TV), mixed 1 /2 norm, and minimization of 1 . However, better methods/algorithms that
require fewer numbers of far-ﬁeld measurement points for faster array diagnosis are still in demand.

2.2. Signal-to-Noise Ratios (SNRs)

Total variation (TV), mixed 1 /2 norm, and minimization of 1 techniques are the leading
compressive techniques, and they exhibit low efﬁciency and reliability in antenna diagnosis for low
SNRs. This drawback fosters the need for a more robust diagnosis procedure in the presence of
noisy data.

3. Compressed Sparse Recovery Methods

The essence of matrix inversion regularization is to initiate a priori facts within the inversion.
An adequate approach is needed to get this regularization by approximately reducing the selected
norm q of x solution. Then, the optimization to be solved is

min X q subject to y − AX2 ≤ γ, (4)

where ∗ q represents lq norm, and γ is a function of noise and factors inﬂuencing the data. There are
various routines available to effectively solve the convex optimization problem of Equation (4) such
as References [25–27]. The three norms lq , selected based on a priori knowledge of the differential
antenna set-up with the diagnosis problem, can now be described for regularization of the inversion.
We applied them to conduct diagnosis of both the simulated and measured radiating antennas.

3.1. Total Variation (TV) Norm

Based on a priori knowledge that solution X has small discontinuities as a result of failures
present, in addition to the failures, we expect ﬁeld X to be leveled and almost zero. Hence TV norm is
a smooth function to regularize X [27]. Thus, minimizing TV norm is minimizing its gradient, which is
the effect of smoothing. Consider a two-dimensional complex dataset X ∈ C M× N ; TV norm gives

X TV = ∑ |Xm+1,n − Xm,n | + |Xm,n+1 − Xm,n | (5)

m,n

vec(∇ x X)1 + vec X∇y 1 .
Vec(X) generates vector N × M which holds the columns of X stacked beneath each other.
Gradient matrices ∇ x and ∇y are of M × M and N × N size, respectively, which are expressed as
⎡ ⎤
−1 1 0
⎢ .. .. ⎥
∇x = ⎣ . . ⎦, and
0 −1 1

268
Electronics 2018, 7, 383

⎡ ⎤
−1 0
⎢ .. ⎥
⎢ 1 . ⎥
⎢ ⎥
∇y = ⎢ ⎥.
⎢ .. ⎥
⎣ . −1 ⎦
0 1
Then, the optimization problem in Equation (4) transforms to

minX TV subjects to y − Avec(X)2 ≤ . (6)

3.2. The 1 Norm

Since there is a sparse solution X; then, a space of search could be drastically reduced by the
introduction of a priori knowledge in inversion. Speciﬁcally, the 1 -norm ( X 1 = ∑k | xk |) is the
leading convex surrogate of an acceptable estimate sparsity of the vector (quasi-norm 0 that calculates
nonzero occurrences of a given vector). As a result, 1 norm is an efﬁcient approach to enhance sparse
solutions [2,5,10,19]. The regularization problem is then

min X 1 subject to y − AX 2 ≤ . (7)

Minimizing 1 -norm imposes the pointwise sparsity of solution per sample xk of the ﬁeld on the
aperture of the DA.

3.3. Mixed 1 /2 Norm

The radiating aperture’s position and dimensions can also be taken. The solution X is grouped into
G groups X g , which corresponds to the individual radiating element’s aperture g. For a faulty element,
g
all regions of discretization xk in the aperture will be faulty (nonzero). Let vector X of dimension
M × N be divided into G non-overlapping groups depicted X g of size Ng , such as ∑G g=1 Ng = MN.
Hence, the mixed 1 /2 norm is given as
.
G G
g 2 g 2
X 1,2 = ∑ X g 2 = ∑ x1 + . . . + x Ng . (8)
g =1 g =1

The mixed 1 /2 norm has similar behavior to 1 norm on vector X 1 2 , . . . , X g 2 , . . . , X G 2 ;

it, therefore, induces group sparsity at the radiating aperture level. The regularized inversion problem
is then expressed as
minX1,2 subject to y − AX2 ≤ . (9)
X

4. Resolution via Bayesian Compressive Sensing

For a planar antenna configuration of N elements positioned at coordinates ( xn yn ),
n = 1, . . . , N, with error-free excitations αn , n = 1, . . . , N, beaming a familiar field
E(u, v), (where u = sinθcosϕ and v = sinθsinϕ), referencing a noisy case with element failure,
the estimated far-field radiation pattern of (AUT) is expressed as

N
6(ul , vl ) =
E ∑ β n e j λ ( xn ul +yn vl ) + vl ,
2π
(10)
n =1

where (ul , vl ) for l = 1, . . . , L is the angular location of the l-th angular sample, and vl is the noise effect
considered as Gaussian-distributed with zero mean and variance σ2 . β n , n = 1, . . . , N, is the failed
excitations vector, expressed as

269
Electronics 2018, 7, 383

hαn with probability Φ
βn = , n = 1, . . . , N. (11)
αn otherwise

h ∈ (0, 1) is the failure factor, while Φ is the rate of failure, and αn is the weighting coefﬁcient.
From knowledge of the difference in ﬁeld pattern, W (ul , vl ) = E(ul , vl ) − E 6(ul , vl ), l = 1, . . . , L,
array failures can be estimated by determining the minimum 0 − norm vector

Υ = {Υn = αn − β n ; n = 1, . . . , N }, (12)

which satisﬁes
W − ΨΥ = v. (13)

The aim is to determine the entries of the “failure” vector

W = { W (ul , vl ); l = 1, . . . , L}, v = {vl ; l = 1, . . . , L} from the prior knowledge of the difference
between the ﬁeld samples in Equation (10) on the AUT and those of the golden antenna with coefﬁcients
αn n = 1, . . . , N. Against the deterministic approaches aimed at retrieving the vector γ with minimum
0 − norm satisfying the condition of Equation (13), Ψ is the L × N radiation measurement matrix
expressed as
⎡ [ j2π ( x1 u1 +y1 v1 )] ⎤
e . . . e[ j2π ( x N u1 +y N v1 )]
⎢ .. .. .. ⎥
Ψ=⎣ . . . ⎦. (14)
e[ j2π ( x1 u L +y1 v L )] . . . e[ j2π ( x N u L +y N v L )]
Hence, the BCS technique (summarized in Figure 2) can be employed to determine the sparsest
solution Υ̂ to the problem
Υ̂ = arg max[ P(Υ|W )] , (15)
Υ

which gives
1 ΨT Ψ
Υ̂ = + diag f NP , (16)
σNP σNP

where T is the transpose operator, and σNP and f are the ﬁgures that are used to maximize the
NP
likelihood function
1

L σ, f = − Nlog2π + logC + W T C −1 W . (17)
2
Equation (11) is computed using RVM [29], with C = σ + ΨF −1 Ψ T , where F = diag( f ).
The implementation of the BCS technique (as shown in Figure 3) is summarized in Algorithm 1.

Algorithm 1. Proposed diagnostic procedure

a. Step 1—Array parameter selection and definition of problem: For each element (n = 1, . . . N), accumulate the
array field at N measurement points (reference antenna) and randomly sample the AUT field.
Appropriate the noise variance and threshold for the maximization of L σ, f .
b. Step 2—Radiation pattern measurement matrix Ψ definition. Input the parameters of Ψ.
c. Step 3—Posterior mode estimation. Maximize Equation (10) iteratively to estimate σ and f using the RVM
procedure [29].
d. Step 4—Source difference reconstruction. Determine W(ul , vl ) for l = 1, . . . , L.
e. Step 5—Failed field excitation reconstruction. Determine the vector of failed excitations
β n (n = 1, . . . , N) using Equation (11).

270
Electronics 2018, 7, 383

Figure 3. Flowchart of the proposed Bayesian compressive sensing (BCS) procedure.

5. Simulations and Analysis

Assessing the performance of the BCS algorithm, we consider an RA with N = 316 with Taylor
taper and peak sidelobe level of −25 dB. Assuming a complete failure (h = 0), the given percentage of
failed elements is Φ = 4%. To determine the detection error numerically, the index of detection can be
expressed as
2
∑ N Υn − Υ̂n
ζ = 100 × n=1N 2
, (18)
∑ n =1 | Υ n |
where Υn and Υ̂n , n = 1, . . . , N, are the real and predicted failure entries of vector Υ. The uniformly
sampled (k = 316 samples) far-field radiation pattern is within (u, v) space, with the signal-to-noise ratio
SNR = 30 dB. The configuration of the AUT is presented in Figure 4, while the excitation coefficients
β n of the failed array are equally shown. Figure 5 shows Υ̂n of the failure vector estimated via the
proposed technique. As expected, there is good correlation between the location and the number of
the real and predicted failed elements. Accuracy of the estimation was ascertained by a very small
index of detection figure of ζ = 3.83 × 10−3 .

271
Electronics 2018, 7, 383

Figure 4. Normalized array excitation of the array under test (AUT).

Figure 5. Normalized array excitation with estimated failure vector Υ̂ and location and amplitude
of entries.

To analyze the impact on performance metrics of the technique of the noise on far-ﬁeld patterns,
we ranged the SNR between 0 dB and 100 dB. Figure 6 shows the obtained result. The estimation error
ζ is high for low SNRs irrespective of failed element percentage. Also, for higher SNR, the robustness
of the approach increases, which shows that the best performances are attained at Φ = 3%. The impact
of the percentage of failed elements on the performance of the method proposed was also assessed by
varying the percentage of the failed elements from Φ = 2% to Φ = 20% (see Figure 7). Expectedly,
the performance metrics of the approach reduced for a higher percentage of failed elements, even for
very low noise levels. Conversely, the approach achieved a degree acceptable accuracy until about
10% of damaged elements at any SNR. This result validates the efﬁciency of the BCS technique in the
diagnosis of sparse failures in arrays.

272
Electronics 2018, 7, 383

Figure 6. Detection index against signal-to-noise ratio (SNR) for different failure percentages.

Figure 7. Detection index against failure rate for various SNRs.

Example Using Full-Wave Simulation Set-Up

A 10 × 10 microstrip patch antenna array with an aperture size of 31 × 31 mm2 operating at
28.9 GHz was designed, as shown in Figure 8. We designed and computed the radiation pattern and
S-parameter (Figure 9) of the antenna using full-wave three-dimensional (3D) EM software Ansys
HFSS v. 17. The elements were uniformly spaced along the x and y directions. Each element had an
excitation port. Practically, measurements are made impure by noise; hence, a Gaussian noise n was
q
added to the data on both radiation patterns as yn = yq + nq , with q = {r, d}. The noise level was
determined by signal-to-noise ratio (SNR) defined from the maximum received field magnitude fitting
the dynamic measurement range. The noise was estimated as

N (0, 1) + jN (0, 1)
nq = √ max |yq | × 10−SNRdB /20 , (19)
2

273
Electronics 2018, 7, 383

where N (0, 1) is a Gaussian random vector of mean 0 and standard deviation 1.

The faulty elements cause low gain, high sidelobe level, wider beamwidth, lower front-to-back
ratio, and a boresight pointing error. The effects are shown in Figure 9b. A Rogers 5880 dielectric
substrate with 20 mm thickness and 3.48 dielectric constant was used for the antenna design because it
has low signal loss, low dielectric loss, low outgassing (which is good for space applications), and cheap
circuit fabrication. The total number of elements in the array was 100. The length and width of a single
patch were estimated to be 7.4 mm and 9.5 mm, respectively. Elements were uniformly spaced by 8.947
mm and 6.847 mm along x and y, respectively.
All elements were fed with the same excitation value which equal to one to emulate the array
without failure (reference array). Then K failures (in this case, the estimated percentage of failure rate)
were also initiated intentionally by making the excitation equal to zero in order to model the AUT
effectively. At first, we considered the reference array, i.e., the array without failures. The excitation
coefficients are depicted in Figure 10, and the reconstructed excitation coefficients are presented in
Figure 11. Also, for quantitative knowledge of the error estimated, the computed excitation error in dB
is shown in Figure 12. The result indicates an exact reconstruction in the case of the reference array,
and shows a low probability of a false alarm.

Figure 8. Designed microstrip patch antenna array in Ansys HFSS for diagnosis.

Also, considering an AUT with K = 5 element failures (Φ = 5%) (elements with zero excitation)
because failed elements are usually of small number in practice, the resulted excitation coefﬁcients
are presented in Figure 13, and the estimated excitations by 30 random noisy measurement points are
presented in Figure 14, while the dB excitation error is depicted in Figure 15. The result is an indication
of good estimation of RA excitations and the locations of the faulty elements.

274
Electronics 2018, 7, 383

*-+

Figure 9. The antenna array for diagnostic purposes: (a) S-parameter; (b) RA radiation pattern; (c)
AUT radiation pattern.

275
Electronics 2018, 7, 383

Figure 10. Excitation ﬁeld of reference array.

Figure 11. Reconstructed excitation ﬁeld of reference array employing 30 random noisy
measurement points.

Figure 12. Reference array reconstructed excitation error in dB by 30 random noisy measurement points.

276
Electronics 2018, 7, 383

Figure 13. Reference excitation ﬁeld of AUT with Φ = 5%.

Figure 14. AUT with K = 5 failures Φ = 5%: reconstructed excitation error ﬁeld by 30 random noisy
measurement points.

Figure 15. AUT with K = 5 failures Φ = 5%: reconstructed excitation error in dB by 30 random noisy
measurement points.

277
Electronics 2018, 7, 383

6. Antenna Array Diagnosis from Measured Data

6.1. Measurement Set-Up

The proposed diagnostic technique was subjected to an experimental test, as presented in
Figure 16. Although, this was a controlled environment (anechoic chamber), a more practical condition
(uncontrolled environment) was also possible using the same set-up in Figure 16, without the chamber.
The AUT was a 10 × 10 microstrip patch antenna array (see Figure 17a), reradiating a signal tilted in
the two planes. Figure 17b,c show the measured radiation patterns for an ideal antenna and defected
antenna, respectively. It can be see that the failure causes higher sidelobe level, reduced gain, lower
front-to-back ratio (FBR), wider beamwidth, and a boresight pointing error.

Figure 16. Schematic of experimental set-up.

The antenna was particularly designed and fabricated for this purpose, and each element had its
feeding port which was excited using a power divider. Five radiating elements in the array were not
excited (zero excitation) to successfully emulate the failure of elements. The AUT set-up is depicted in
Figure 18. About 1000 co-polar and cross-polar measurements were taken on the far-field half-sphere
at 28.9 GHz in an anechoic chamber (see Figure 18).
Radiation pattern measurements obtained from the array with five faulty elements were fed into
the proposed algorithm for post-processing. The reconstructed excitation error which identified the
specific faulty elements is depicted in Figure 19, and the corresponding dB equivalence is shown
in Figure 20. Moreover, the performance metrics of the BCS-based approach were experimentally
tested, and are shown in Figure 21. Figure 21a shows the obtained reconstruction error versus
the measurement number at different degrees of failure. The error decreased as the number of
measurement points increased irrespective of failure percentage. The reconstruction error profile
increased with increased failure rate. In Figure 21b, it is demonstrated how the reconstruction error
profile changes with SNR for different failure rates. The reconstruction error degraded exponentially
with increased SNR independent of failure rate. The reconstruction error increased with increased
failure rate. Figure 21c depicts reconstruction error versus different levels of failure for various SNRs.
It can be observed that the error increased with increased failure rate. Also, the reconstruction error
decreased as SNR increased.

278
Electronics 2018, 7, 383

*-+

Figure 17. Photograph of (a) the fabricated antenna array, (b) measured radiation pattern without fault,
and (c) measured radiation pattern with emulated failures.

279
Electronics 2018, 7, 383

Figure 18. Measurement set-up of AUT for diagnostic purpose.

The imperfection of the curves (compared to the simulation) could be attributed to measurement
errors, and errors due to experimental set-ups which provide different conditions from those in the
simulations. The experiment was conducted in an anechoic chamber, which is a controlled environment.
Hence, the results presented here may show a little variation if the experiment is conducted in a
more practical environment (i.e., uncontrolled environment). Moreover, the BCS-based procedure
presented here can be trusted to effectively and reliably address sparse recovery problems, particularly
the detection of faulty radiators in planar arrays for next-generation 5G wireless communications.
Once the suitable data are collected, and used to diagnose the array, then the array feeding network
can be recalibrated to restore the needed radiation features via excitation reconﬁguration of the healthy
antenna elements. However, prior knowledge of the golden array must be provided, and the failure
rate is relatively smaller than the array size. Therefore, from the simulation results veriﬁed by the
experiment, the BCS-based approach is adequate and reliable for noisy data. This technique overcomes
the shortcomings of BPA, MMA, etc., demanding off-line phase training to form accurate mapping
between the response of the AUT and the failure location. Hence, the proposed procedure will be
highly useful for millimeter-wave planar array optimal performance.

Figure 19. Reconstructed excitation error ﬁeld by 30 random noisy measurement points with
20 dB signal-to-noise-ratio.

280
Electronics 2018, 7, 383

Figure 20. Reconstructed excitation error in dB by 30 random noisy measurement points.

6.2. Antenna Array Diagnosis from Simulated and Measured Far-Field Radiation Patterns
There are differences between the simulated and measured antenna patterns due to measurements
errors, uncontrollable array fabrication errors, and experimental set-ups that give different conditions
from the simulations. For example, in our design, a finite flange was employed to feed the ground
plane. Hence, the induced current on flange rescattered and redistributed somehow against the very
large ground plane which was used for the simulations. Antenna array diagnosis procedures based
on simulated radiation pattern (such as References [3,16–18,27,35]) may not be reliable and accurate,
except when tested with the corresponding measured data. Although, in this work, simulation
and measurement data exhibit little difference in field intensity of the identified faulty elements
(in Figures 14 and 19, respectively) caused by the electromechanical coupling effect. In general,
the proposed BCS-based approach shows good reliability and accuracy against both simulated and
measured far-field radiation patterns.

6.3. Number of Measurement Points versus Noisy Measured Data

Measurement points affect field reconstruction fidelity and, hence, the scheme of diagnosis.
The proposed BCS-based procedure performed well despite significantly reduced measurement data
due to added sparse information. According to the study and experiments performed by Fuch et al. [14],
total variation (TV), mixed 1 /2 norm, and minimization of 1 techniques require 64 measurement
points for accurate reconstruction and diagnostics, compared to the BCS technique that requires 30 or
less measurement points. The proposed method significantly reduces the number of measurements
needed for diagnosis as compared to those three approaches. Since the speed of diagnosis is inversely
related to the required number of measurement points, the BCS approach enables a faster diagnosis of
antenna arrays.
Also, according to the diagnostic procedures proposed by Fuch et al. [14], total variation (TV),
mixed 1 /2 norm, and minimization of 1 techniques can only accommodate measured data with
the lowest SNR of 40 dB. A lower SNR results in bad diagnostics for the three procedures. However,
there are practical measurements that exhibit SNRs lower than 40 dB which require diagnosis. To this
advantage, the proposed BCS approach was theoretically and practically used to diagnose antenna
arrays from measurement data with 20 dB SNR. It can equally adapt to measured data with lower
SNR. The comparison is summarized in Table 1. The BCS approach requires a few seconds more
computational time; however, this is very small with respect to the measurement time cost. Therefore,

281
Electronics 2018, 7, 383

the proposed method shows its robustness to noisy measured data, and a reliable diagnosis was
obtained for low SNRs.

*-+

Figure 21. Experimental performance assessment of the proposed BCS-based diagnostic procedure.
(a) Reconstruction error versus number of measurements, (b) Reconstruction error against SNR,
(c) Reconstruction error with different failure rate

282
Electronics 2018, 7, 383

Table 1. Comparison between Bayesian compressive sensing (BCS)-based approach and previous
compressive sensing techniques. SNR—signal-to-noise ratio.

This Work Minimization of 1 Total Variation (TV) Mixed 1 /2 Norm

Approach
(BCS-Based) [14,36] [14,37] [14,38]
Required signal characteristics Amplitude Amplitude or Phase Amplitude or Phase Amplitude or Phase
Minimum SNR (dB) 20 40 40 40
Nature More complex Simpler Simple Complex
Required measurement points 30 64 64 64
Post-processing time (s) 7.6 0.4 2.0 4.9

7. Conclusions
A faster and robust antenna array diagnosis procedure from far-field radiation pattern
measurement points using Bayesian compressive sensing (BCS) approach was proposed in this paper.
Previous compressive sensing procedures exhibit shortcomings based on reliability with noisy data,
and require a large number of far-field measurement points. The proposed method solves these
problems by formulating planar array diagnosis within the concept of the BCS framework, resolved
using fast relevance machine (RVM). We are not the first to apply the BCS approach to antenna
array diagnosis. It was applied only to linear configurations in References [15,16] without practical
measurements, which are fundamental for testing any procedure. To the best knowledge of the
authors, this is the first attempt to apply the BCS approach to planar antenna array diagnosis from
far-field measurement points, validated with experimental measurements. Diagnoses from simulated
and measured far-field points from a designed microstrip patch antenna array show the method’s
robustness to additive data noise, as well as its reconstruction accuracy and faster diagnosis speed,
which is desired in practical applications. Hence, the proposed method is a better practical choice
whenever an efficient, faster, and more reliable antenna array diagnosis (testing) is needed.
Also, it is important to comment on the choice of sampling strategy. We considered a random
selection of measurement points from a uniform lattice. The choice of the sampling technique is not
critical because it affects all the techniques in the same manner at the far-field. However, it was pointed
out, from a non-uniform near-field lattice, that proper non-uniform random sampling (NURS) using
a priori information on the problem provides meaningful reduction in the cardinality of the set of
measured data compared to uniform random sampling and random sampling from a λ/2 equispaced
dataset [39]. Moreover, the BCS-based technique was compared to other methods using the data
reported in the literature. In the future, we will compare different techniques using experimental data
from controlled and uncontrolled environments, and the same parameters in order to quantify the
error affecting the result of different techniques. For example, we will determine what happens if we
use 30 measurements instead of 64 measurements in the experimental data using 1 minimization,
i.e., the same number of data used by the BCS, as well as the error compared to BCS. A complete
comparison among the techniques using real data is still absent in the present literature.

Author Contributions: Conceptualization, methodology, writing-review and editing, original draft preparation,
O.J.F.; resources, Z.Z.; visualization, software, A.F. and R.Z.; supervision, F.L.
Funding: This research received no external funding. The APC was funded by University of Science and
Technology of China.
Acknowledgments: The majority of the work was conducted at MESIC (a joint lab of USTC and IMECAS),
and partially carried out at the USTC Center for Micro and Nanoscale Research and Fabrication. The authors
would like to thank the Information Science Laboratory Center of USTC for software and hardware services.
The authors acknowledge the Applied Electromagnetics and Microwave Engineering Group of Hefei Normal
University for the provision of the anechoic chamber. The support of the Chinese Academy of Science and the
World Academy of Science (CAS-TWAS) is appreciated. The authors appreciate Zhang [40] and the CVX research
group [41] for making useful codes accessible online.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

283
Electronics 2018, 7, 383

References
1. Oliveri, G.; Donelli, M.; Massa, A. Linear array thinning exploiting almost difference sets. IEEE Trans.
Antennas Propag. 2009, 57, 3800–3812. [CrossRef]
2. Oliveri, G.; Manica, L.; Massa, A. ADS-based guidelines for thinned planar arrays. IEEE Trans.
Antennas Propag. 2010, 58, 1935–1948. [CrossRef]
3. Carlin, M.; Oliveri, G.; Massa, A. On the robustness to element failures of linear ADS-thinned arrays. IEEE
Trans. Antennas Propag. 2011, 59, 4849–4853. [CrossRef]
4. 5G Vision White Paper, DMC R&D Center. Samsung Electronics Co., Ltd., 2015. Available online:
http://www.samsung.com/global/business-images/insights/2015/Samsung-5G-Vision-0.pdf (accessed
on 4 October 2018).
5. 5G: A Technological Vision, HUAWEI white Paper. Huawei Technologies Co.: Shenzhen. Available online:
http://www.huawei.com/5gwhitepaper/ (accessed on 4 October 2018).
6. Frenzel, L. Ka-Band Transceiver Implements MIMO and Beamforming for 5G. Available online:
Electronicdesign.com (accessed on 8 October 2018).
7. Lee, J.J.; Ferrer, E.M.; Woollen, D.P.; Lee, K.M. Near-field probe used as a diagnostic tool to locate defective
elements in an array antenna. IEEE Trans. Antennas Propag. 1988, 36, 884–889. [CrossRef]
8. Gattoufi, L.; Picard, D.; Samii, Y.R.; Bolomey, J.C. Matrix method for near-field diagnostic techniques of
phased array antennas. In Proceedings of the IEEE International Symposium on Phased Array Systems and
Technology, Boston, MA, USA, 15–18 October 1996; pp. 52–57.
9. Bucci, O.M.; Migliore, M.D.; Panariello, G.; Sgambato, G. Accurate diagnosis of conformal arrays from
near-field data using the matrix method. IEEE Trans. Antennas Propag. 2005, 53, 1114–1120. [CrossRef]
10. Candes, E.J.; Romberg, J.; Tao, T. Robust uncertainty principles: Exact signal reconstruction from highly
incomplete frequency information. IEEE Trans. Inf. Theory 2006, 52, 489–509. [CrossRef]
11. Donoho, D.L.; Tanner, J. Precise undersampling theorems. Proc. IEEE 2010, 98, 913–923. [CrossRef]
12. Migliore, M.D.; Fuchs, B.; le Coq, L.; Ferro-Famil, L. Compressed sensing approach for reflectarray diagnostic
from far field measurements. In Proceedings of the European Microwave Conference, Paris, France,
9–11 September 2015; pp. 289–292.
13. Massa, A.; Rocca, P.; Oliveri, G. Compressive sensing in electromagnetics—A review. IEEE Antennas
Propag. Mag. 2015, 57, 224–238. [CrossRef]
14. Fuchs, B.; Coq, L.L.; Migiliore, M.D. Fast antenna array diagnosis from a small number of far-field
measurements. IEEE Trans. Antennas Propag. 2016, 64, 2227–2235. [CrossRef]
15. Migliore, M.D. A compressed sensing approach for array diagnosis from a small set of near-field
measurements. IEEE Trans. Antennas Propag. 2011, 59, 2127–2133. [CrossRef]
16. Oliveri, G.; Rocca, P.; Massa, A. Reliable diagnosis of large linear arrays—A Bayesian compressive sensing
approach. IEEE Trans. Antennas Propag. 2012, 60, 4627–4636. [CrossRef]
17. Migliore, M.D. Array diagnosis from far-field data using the theory of random partial Fourier matrices.
IEEE Trans. Antennas Wirel. Propag. Lett. 2013, 12, 745–748. [CrossRef]
18. Fuchs, B.; Migliore, M.D. Accurate array diagnosis from near-field measurements using reweighted
minimization. In Proceedings of the IEEE Antennas Propagation Symposium, Orlando, FL, USA, 7–13 July
2013; pp. 2255–2256.
19. Fuchs, B.; le Coq, L.; Ferro-Famil, L.; Migliore, M.D. Comparison of methods for reflectarray diagnostic from
far field measurements. In Proceedings of the IEEE International Symposium on Antennas Propagation,
Boston, MA, USA, 19–25 July 2015; pp. 398–399.
20. Donoho, D.L. Compressive sensing. IEEE Trans. Inf. Theory 2006, 52, 1289–1306. [CrossRef]
21. Li, Q.; Zhang, Y.; Dai, K.; Huang, L.; Pan, C.; Song, J. An adaptive diagnosis scheme for integrated
radar-communication antenna array with huge noise. IEEE Access 2018, 6, 25785–25796. [CrossRef]
22. Eltayeb, M.E.; Al-Naffouri, T.; Heath, R.W., Jr. Compressive sensing for millimeter wave antenna array
diagnosis. IEEE Trans. Commun. 2018, 66, 2708–2721. [CrossRef]
23. Li, W.; Deng, W.; Migiliore, M.D. A deterministic far-field sampling strategy for array diagnosis using sparse
recovery. IEEE Antennas Wirel. Propag. Lett. 2018, 17, 1261–1265. [CrossRef]

284
Electronics 2018, 7, 383

24. Massa, A.; Bertolli, M.; Gottardi, G.; Hannan, A.; Marcantonio, D.; Oliver, G.; Polo, A.; Robol, F.;
Rocca, P.; Viani, F. Compressive sensing as applied to antenna arrays: Synthesis, diagnosis, and processing.
In Proceedings of the IEEE International Symposium on Circuits and Systems (ISCAS), Florence, Italy,
27–30 May 2018.
25. Oliveri, G.; Massa, A. Bayesian compressive sampling for pattern synthesis with maximally sparse
non-uniform linear arrays. IEEE Trans. Antennas Propag. 2011, 59, 467–481. [CrossRef]
26. Oliveri, G.; Carlin, M.; Massa, A. Complex-weight sparse linear array synthesis by Bayesian Compressive
Sampling. IEEE Trans. Antennas Propag. 2012, 60, 2309–2326. [CrossRef]
27. Jaehyun, P.; Lee, J.; Chun, J.; Kwak, S. Robust monopulse beam synthesis with sparse elements in linear and
planar arrays with element failure detection. IET Radar Sonar Navig. 2017, 11, 1251–1258.
28. Poli, L.; Oliveri, G.; Massa, A. Microwave imaging within the first-order Born approximation by means of the
contrast-field Bayesian Compressive Sensing. IEEE Trans. Antennas Propag. 2012, 60, 2865–2879. [CrossRef]
29. Poli, L.; Oliveri, G.; Rocca, P.; Massa, A. Bayesian compressive sensing approaches for the reconstruction
of two-dimensional sparse scatterers under TE illuminations. IEEE Trans. Geosci. Remote Sens. 2013, 51,
2920–2936. [CrossRef]
30. Poli, L.; Oliveri, G.; Viani, F.; Massa, A. MT-BCS-based microwave imaging approach through
minimum-norm current expansion. IEEE Trans. Antennas Propag. 2013, 61, 4722–4732. [CrossRef]
31. Carlin, M.; Rocca, P. A Bayesian compressive sensing strategy for direction-of-arrival estimation.
In Proceedings of the 6th European Conference on Antennas Propag. (EuCAP 2012), Prague, Czech Republic,
26–30 March 2012; pp. 1508–1509.
32. Carlin, M.; Rocca, P.; Oliveri, G.; Viani, F.; Massa, A. Directions-of-arrival estimation through Bayesian
compressive sensing strategies. IEEE Trans. Antennas Propag. 2013, 61, 3828–3838. [CrossRef]
33. Carlin, M.; Rocca, P.; Oliveri, G.; Massa, A. Bayesian compressive sensing as applied to directions-of-arrival
estimation in planar arrays. J. Elect. Comput. Eng. 2013, 2013, 1–12. [CrossRef]
34. Salucci, M.; Gelmini, A.; Oliver, G.; Massa, A. Planar array diagnosis by means of an advanced Bayesian
compressive processing. IEEE Trans. Antenna Propag. 2018, 66, 5892–5906. [CrossRef]
35. Oliveri, G.; Rocca, P.; Massa, A. A Bayesian-compressive-sampling-based inversion for imaging sparse
scatters. IEEE Trans. Geosci. Remote Sens. 2011, 49, 3993–4006. [CrossRef]
36. Candès, E.J.; Wakin, M.B.; Boyd, S.P. Enhancing sparsity by reweighted minimization. J. Fourier Anal. Appl.
2008, 14, 877–905. [CrossRef]
37. Rudin, L.; Osher, S.; Fatemi, E. Nonlinear total variation based noise removal algorithms. Phys. D 1992, 60,
259–268. [CrossRef]
38. Bach, F.; Jenatton, R.; Mairal, J.; Obozinski, G. Optimization with sparsity-inducing penalties.
Found. Trends Mach. 2012, 4, 1–106. [CrossRef]
39. Migliore, M.D. On the sampling of the electromagnetic field radiated by sparse sources. IEEE Trans.
Antennas Propag. 2014, 63, 553–564. [CrossRef]
40. Zhang, Y.; Yang, J.; Yin, W. YALL1: Your ALgorithms for L1. Available online: Yall1.blogs.rice.edu (accessed
on 8 October 2018).
41. CVX Research, Inc. CVX: Matlab Software for Disciplined Convex Programming, Version 2.0 Beta. 2012.
Available online: http://cvxr.com/cvx (accessed on 8 October 2018).

285
electronics
Article
Mobile-Phone Antenna Array with Diamond-Ring
Slot Elements for 5G Massive MIMO Systems
Naser Ojaroudi Parchin 1, *, Haleh Jahanbakhsh Basherlou 2 , Mohammad Alibakhshikenari 3 ,
Yasser Ojaroudi Parchin 4 , Yasir I. A. Al-Yasir 1 , Raed A. Abd-Alhameed 1 and Ernesto Limiti 3
1 Faculty of Engineering and Informatics, University of Bradford, Bradford BD7 1DP, UK;
[email protected] (Y.I.A.A.-Y.); [email protected] (R.A.A.-A.)
2 Bradford College, Bradford BD7 1AY, UK; [email protected]
3 Electronic Engineering Department, University of Rome Tor Vergata, Via del Politecnico 1, 00133 Rome, Italy;
[email protected] (M.A.); [email protected] (E.L.)
4 Young Researchers and Elite Club, Germi Branch, Islamic Azad University, Germi 63764-56517, Iran;
[email protected]
* Correspondence: [email protected]; Tel.: +44-7341436156

Received: 31 March 2019; Accepted: 7 May 2019; Published: 10 May 2019

Abstract: A design of mobile-phone antenna array with diamond-ring slot elements is proposed for
fifth generation (5G) massive multiple-input/multiple-output (MIMO) systems. The configuration of
the design consists of four double-fed diamond-ring slot antenna elements placed at different corners
of the mobile-phone printed circuit board (PCB). A low-cost FR-4 dielectric with an overall dimension
of 75 × 150 mm2 is used as the design substrate. The antenna elements are fed by 50-Ohm L-shaped
microstrip-lines. Due to the orthogonal placement of microstrip feed lines, the diamond-ring slot
elements can exhibit the polarization and radiation pattern diversity characteristic. A good impedance
bandwidth (S11 ≤ −10 dB) of 3.2–4 GHz has been achieved for each antenna radiator. However, for
S11 ≤ −6 dB, this value is 3–4.2 GHz. The proposed design provides the required radiation coverage
of 5G smartphones. The performance of the proposed MIMO antenna design is examined using both
simulation and experiment. High isolation, high efficiency and sufficient gain-level characteristics
have been obtained for the proposed MIMO smartphone antenna. In addition, the calculated total
active reflection coefficient (TARC) and envelope correlation coefficient (ECC) of the antenna elements
are very low over the whole band of interest which verify the capability of the proposed multi-antenna
systems for massive MIMO and diversity applications. Furthermore, the properties of the design in
Data-mode/Talk-mode are investigated and presented.

Keywords: 5G; diamond-ring slot; dual-polarized antenna; massive MIMO; mobile-phone antenna;
pattern diversity

1. Introduction
Nowadays, there is an increased interest in research on MIMO systems in wireless
communication [1,2]. It has incomparable advantages in improving the wireless link transmission
capacity and reliability. In MIMO systems, multiple antennas are deployed at both transmitter and
receiver sides [3]. This technology is a key component and probably the most established to truly reach
the promised transfer data rates of ﬁfth generation (5G) communication systems [4,5]. MIMO antennas
are important for increasing channel capacity and link reliability [6,7]. Standard MIMO networks tend
to use two or four antennas in a single physical package. However, massive MIMO is a MIMO system
with an especially high number of antennas [8]. A 2 × 2 MIMO system has been successfully applied
for fourth generation (4G) mobile communications and it is expected that the massive MIMO system
with a large number of MIMO antennas is very promising for 5G wireless communications [9]. The

Electronics 2019, 8, 521; doi:10.3390/electronics8050521 287 www.mdpi.com/journal/electronics

Electronics 2019, 8, 521

greater number of elements in a network will make it more resistant to interference and intentional
jamming [10].
Among the antennas which are used for MIMO applications, printed antennas are more appropriate
due to their low cost, easy fabrication and their capability of easily being integrated to small terminal
devices [11]. However, placing multiple antennas in the limited space of a transceiver poses a significant
challenge in the incorporation of the MIMO technique. According to the requirement of cellular
communications, compact, wideband and high isolation MIMO antenna is an urgent demand in the
future mobile terminal and the portable applications [12–17]. Recently, several techniques have been
introduced to design massive MIMO antennas for 4G and sub-6 GHz 5G mobile terminals [18–27].
We propose here a new design of Eight-port mobile-phone antenna with compact dual-polarized
radiation elements providing wide impedance bandwidths for 5G applications. Eight-element MIMO
smartphone antenna can achieve the channel capacity of 37 bps/Hz which is close to eight times
that of a single antenna for single-input/single-output operation. With such a channel capacity and
a wide frequency spectrum (200 MHz, at least), the data rate can be much higher than 1 Gbps.
The antenna is designed to operate at 3.6 GHz, a candidate frequency band for sub-6-GHz 5G
cellular networks, proposed by Ofcom, UK [28]. The design configuration contains four elements of
double-fed/dual-polarized slot-ring antennas placed at corners of the printed circuit board (PCB). The
antenna elements exhibit wide impedance bandwidth with low mutual coupling function providing
pattern and polarization diversity characteristics at different sides of the mobile-phone PCB. As
a result, the design not only can provide full radiation coverage but also it can support different
polarizations. The computer simulation technology (CST) software was used to investigate antenna
characteristics [29]. Fundamental properties of the single-element and its MIMO design in terms of S
parameters, efficiency, radiation pattern, envelope correlation coefficient (ECC), total active reflection
coefficient (TARC) are investigated. In addition, the performance of the designed mobile-phone
antenna in Data-Mode/Talk-Mode are studied.

2. Single-Element Diamond-Ring Slot Antenna

Figure 1 depicts the configuration of the dual-polarized diamond-ring antenna. The antenna is
designed on an FR-4 substrate (h = 1.6 mm, ε = 4.4, and δ = 0.025).
Its configuration contains a diamond-ring slot radiator with a pair of L-shaped microstrip feed
lines. Parameter values of the designed antenna and its MIMO configuration are specified in Table 1.
The motive behind the presented design is to achieve a dual-polarized wideband antenna with
compact-size and capability of integration onto smartphone PCB. This has been achieved by using
the diamond-ring slot antenna with L-shaped microstrip feed lines. The slot antenna is one type
of printed antenna that has been investigated extensively for different wireless systems for several
decades because of its attractive features including light-weight, compactness and ease of integration
with radio frequency (RF) circuit [30]. The ring-slot antenna can excite two orthogonal polarization if it
is fed differently [31]. This makes the printed-ring-slot antennas attractive. The resonant frequency of
the antenna is mainly determined by the circumference length of the employed diamond-ring slot.
Therefore, the circumference length of the diamond-ring slot needs to satisfy the dielectric wavelength
at the corresponding frequency point, where Wx /2 + g = λ. However, the length of feed-line (Lf + L1 )
also has a little effect on the frequency point and impedance-bandwidth of the design.

288
Electronics 2019, 8, 521

( )

(a)

(b) (c)

Figure 1. The antenna schematic, (a) side view, (b) top and (c) bottom layers.

Table 1. Parameter values of the single-element antenna and its MIMO array design.

Value Value Value Value

Parameter Parameter Parameter Parameter
(mm) (mm) (mm) (mm)
WS 75 LS 150 x 1 W 24
L 24 Wf 3 Lf 5.25 W1 2.5
L1 3.9 W2 8.9 L2 9.9 Wx 7.25

Configurations and S parameters of the square-ring slot with rectangular feed line, a diamond-ring
slot with rectangular feed line, and the proposed diamond-ring slot radiator with L-shaped feed line
are illustrated and compared in Figure 2a–c, respectively. It can be observed that by using the proposed
design (Figure 2c), the antenna not only provides wider impedance bandwidth but it also exhibits high
isolation with low mutual coupling characteristic (less than −20 dB) at the desired operation band. As
shown, the operation frequency of the slot radiator with L-shaped feed lines spans from 3.2 to 4 GHz
(800 MHz bandwidth). For S11 ≤ −6 dB, this value is 3–4.2 GHz.
S11 characteristics of varying design parameters including Wx , L1 , x and W1 are illustrated in
Figure 3. Figure 3a depicts the effects of diamond-ring size (Wx ) on the resonance frequency: when
its size decreases from 8 to 6 mm, the antenna resonance varies from 3.2 to 4.6 GHz. The frequency
resonance of the antenna is also affected by the size of the L-shaped feed line arm (L1 ). As shown
in Figure 3b, the antenna operation frequency tunes to lower frequencies (without any changes on
its bandwidth or isolation). Figure 3c illustrates the S11 results for various values of x (width of the
diamond-ring slot-line). As shown, it mainly affects the impedance bandwidth of the antenna: when its
size changes from 0.5 to 2 mm, the antenna operation bandwidth varies from 0.3 to 1.2 GHz. Another
important parameter of the dual-polarized diamond-ring slot antenna design is the length of the feed
line arm (L1 ) which tunes the isolation characteristic. As can be observed from the results shown in
Figure 3d, the antenna reflection coefficient tunes from −18 to less than −40 dB.

289
Electronics 2019, 8, 521

(a)

(b)

(c)

Figure 2. Various structures and S parameter results of: (a) square-ring slot with rectangular feed line,
(b) diamond-ring slot with rectangular feed line, and (c) the proposed diamond-ring slot radiator with
L-shaped feed line.

(a) (b)

Figure 3. S11 results of the diamond-ring antenna for various values of (a) Wx , (b) L1 , (c) x, and (d) W1 .

290
Electronics 2019, 8, 521

Figure 4 shows the surface current distributions in the ground plane of the antenna at 3.6 GHz. As
shown, the surface currents are mainly distributed around the diamond-ring slot radiator. In addition,
for the different feeding ports of the antenna, the currents densities are equal and opposite due to the
polarization diversity function [32–34]. Figure 5 illustrates the 3D radiation patterns of the antenna
when it is fed differently (Port 1 and Port 2). As seen, the antenna exhibits similar radiation patterns
with different orthogonal polarizations and more than 3 dB realized-gain. Radiation characteristics
of the dual-polarized diamond-ring slot antenna in terms of radiation efficiency, total efficiency, and
maximum gain are illustrated in Figure 6. As seen, the antenna provides high efficiencies. More than
80% radiation and total efficiency properties are obtained over the entire operation band (3.2–4 GHz).
It can be observed that the antenna exhibits almost similar radiation and total efficiency. In addition,
the antenna has around 2.5 dBi directivity.

(a) (b)

Figure 4. Simulated current densities at 3.6 GHz for (a) 1st feeding port and (b) 2nd feeding port.

(a) (b)

Figure 5. 3D views of the dual-polarized radiation patterns from (a) feeding port 1 and (b) feeding
port 2.

A prototype of the design was fabricated and its S parameters were tested. Figure 7 shows a
photograph of the fabricated prototype in the measurement setup. Figure 8 illustrates the measured
and simulated S parameter results of the fabricated antenna. It is observed that the fabricated antenna
works properly at the desired frequency range.

291
Electronics 2019, 8, 521

Figure 6. Radiation, total eﬃciencies, maximum gain of the diamond-ring slot antenna.

Figure 7. Photograph of the fabricated prototype in the measurement setup.

Figure 8. Measured and simulated S parameter results of the fabricated antenna.

292
Electronics 2019, 8, 521

3. Mobile-Phone Antenna Design

The simulated design layout of the mobile-phone antenna design is shown in Figure 9. It was
implemented on an FR4 substrate with an overall dimension of 75 × 150 mm2 . Four elements of the
dual-polarized diamond-ring slot radiators are employed at the corner of the mobile-phone PCB. As
can be observed, due to the compact size of the employed radiator, there is enough spaces in the
conﬁguration of smartphone antenna PCB to add other antennas covering diﬀerent frequencies of
3G/4G mobile terminals. Figure 10 illustrates the simulated S parameters (including Snn and Smn ) of
the design over its operation band. It is evident that the proposed mobile-phone antenna exhibits good
S parameters with wide bandwidth and low mutual coupling characteristics. As mentioned above, the
dual-polarized radiation elements provide similar performances. 3D radiation patterns of antennas 1
and 2 at 3.6 GHz is represented in Figure 11. As seen, the antenna elements have quasi-omnidirectional
radiation patterns covering the top and bottom portions of the mobile-phone PCB.

(a)

(b) (c)

Figure 9. Designed mobile-phone antenna (a) transparent view, (b) top-layer and (c) bottom layer.

293
Electronics 2019, 8, 521

(a)

(b)

Figure 10. (a) Snn and (b) Smn results of the mobile-phone antenna.

(a) (b)

Figure 11. 3D transparent views of the radiation patterns for (a) feeding port 1 and (b) feeding port 2.

Top-views of the radiation patterns for the proposed mobile-phone antenna design are displayed
in Figure 12. It can be seen that each side of the mobile-phone PCB has been covered with differently
polarized radiation patterns. Thus, the MIMO antenna exhibited good radiation coverage and
can support different polarizations which make it more suitable to be used in future smartphones.
Furthermore, the antenna provides high radiation and total efficiencies over the operation band, as

294
Electronics 2019, 8, 521

illustrated in Figure 13: more than 70% radiation and total efficiencies were obtained for the radiation
elements at 3.6 GHz.
The proposed mobile-phone antenna was fabricated and its characteristics were tested in the
Antenna Laboratory at the University of Bradford. Top and bottom views of the prototype are shown
in Figure 14a,b, respectively. The mobile-phone antenna is constructed on a cheap FR4 dielectric
with an overall dimension of 75 × 150 × 1.6 mm3 . During the measurement process, 50-Ω RF loads
are employed for the elements not under test to avoid their effects, as shown in Figure 14c. The
measured and simulated S parameters (Snn : S11 –S88 and Smn : S21 –S81 ) of the fabricated design are
illustrated in Figure 15. As illustrated, the diamond-ring slot resonators achieve good S parameters
with sufficient impedance bandwidth and low mutual coupling characteristic in the desired frequency
range. Some deviations from the measurements arise from the errors in fabrication, feeding and
experiment processes.

Figure 12. 3D radiation patterns of the ﬁfth generation (5G) mobile-phone antenna.

295
Electronics 2019, 8, 521

Figure 13. Eﬃciencies of the antenna elements (Ant. 1–Ant. 8) for the proposed design.

(a) (b)

(c)

Figure 14. (a) Top, (b) bottom views of the fabricated design and (c) the prototype connected to the
cables and 50-Ohm loads.

296
Electronics 2019, 8, 521

(a)

(b)

Figure 15. Measured and simulated (a) Snn (S11 –S88 ) and (b) Smn (S21 –S81 ) of the fabricated prototype.

According to the point that the radiation elements with the same placements and polarizations
provide similar radiation patterns, 2D polar radiation patterns of the adjacent resonators (including
Ant. 1 and Ant. 2) were measured at center operating frequency (3.6 GHz) and illustrated in Figure 16.
As shown, the sample prototype exhibits good radiation patterns and provides acceptable agreement
with simulations. In addition, the antenna elements with diﬀerent polarizations provide suﬃcient gain
values at the center frequency of the operation band.

(a) (b)

Figure 16. Measured and simulated 2D radiation patterns for (a) Ant.1 and (b) Ant.2.

297
Electronics 2019, 8, 521

In order to ensure that the MIMO antenna can work properly, ECC and TARC characteristics
are two important parameters which should be considered in MIMO antennas [35,36]. The ECC and
TARC of two elements can be calculated from the S parameters using the formula described as:

2
S∗mm Snm + S∗mn Snn
ECC = ∗ , (1)
1 − |Smm |2 − |Smn |2 1 − |Snm |2 − |Snn |2
1
(Smm + Smn )2 + (Snm + Snn )2
TARC = − . (2)
2
Figure 17 shows the calculated ECC and TARC results from simulated and measured S parameters
of the mobile-phone antenna design. As evident from figures, the calculated ECC and TARC results
are very low over the whole band of interest. The design provides less than 0.01 ECC over the
entire operating band and proves that two adjacent antenna elements are irrelevant. In addition,
its TARC value is less than −30 at 3.6 GHz. Table 2 summarizes and compares the fundamental
characteristics of the presented mobile-phone antenna with the recently reported 5G mobile-phone
antenna designs [16–25]. It can be observed that the proposed design can provide better performances
in terms of efficiency, isolation and ECC. In addition, it exhibits wider bandwidth with pattern and
polarization diversity characteristics to cover different sides of the mobile-phone PCB.

(a)

(b)

Figure 17. Calculated (a) envelope correlation coefficient (ECC) and (b) total active reflection coefficient
(TARC) from measured S parameters.

298
Electronics 2019, 8, 521

Table 2. Comparison between the presented and recently reported 5G mobile-phone antennas.

Reference Bandwidth (GHz) Eﬃciency (%) Size (mm2 ) Isolation (dB) ECC
[18] 3.4–3.6 55–60 100 × 50 10 -
[19] 3.4–3.6 - 136 × 68 15 -
[20] 3.55–3.65 52–76 150 × 75 11 -
[21] 3.4–3.6 35–50 150 × 75 11 <0.40
[22] 3.4–3.6 30–50 145 × 70 15 <0.2
[23] 3.4–3.6 40–60 136 × 68 14 <0.2
[24] 3.4–3.6 60–75 150 × 80 17 <0.05
[25] 3.4–3.6 50–70 150× 73 17 <0.07
[26] 3.4–3.6 50–80 150 × 75 15 <0.2
[27] 3.4–3.6 60–70 150 × 75 18 <0.015
Proposed 3.3–3.9 60–80 150 × 75 17 <0.01

4. User-Hand/User-Head Impacts on the Mobile-Phone Antenna Performance

The impact of human-hand/human-head on the characteristics of the design in terms of total
efficiency and antenna realized-gain were investigated in this section [37–39]. As illustrated in Figure 18,
different scenarios including right-hand and left-hand modes for top-layer and back-layer of the design
are studied in simulations. According to the obtained results, the mobile-phone antenna design and
its radiators exhibit good performances and provide sufficient total efficiencies in the presence of the
human hand. Due to the symmetric configuration of the proposed design, it performs almost similarly
for different hand scenarios. The maximum reductions of the radiation properties are observed for
the radiation elements partially covered by the user hand which is due to the nature of hand tissue
properties which can highly absorb the radiation power. As can be observed, the antenna elements
provide 25–55% total efficiencies over the operation band of 3.2–4 GHz.

(a) (b)

Figure 18. Placement and total eﬃciencies of the design for diﬀerent user-hand scenarios
(a) right-hand/top-layer, (b) right-hand/back-layer, (c) left-hand/top-layer and (d) left-hand/back-layer.

3D radiation patterns of the mobile-phone antenna in Talk-Mode at 3.6 GHz are illustrated in
Figure 19a. It should be noted that the radiation performance of each antenna element mainly depends
on its locations in the Talk-Mode scenario. As shown, the realized gain characteristic of the design
varies from 3.2 to 4.9 dB. Compared with the radiation patterns in free space (Figure 12), due to the
existence of the user’s head and hand, radiation patterns are a bit distorted and become weaker. One

299
Electronics 2019, 8, 521

can see that antenna elements are touched by diﬀerent parts of the hand and head phantoms in the
presented Talk-Mode.

(a)

(b)

Figure 19. (a) 3D and (b) 2D linearly-scaled radiation patterns of the mobile-phone antenna in
Talk-Mode scenario.

The maximum reductions of the radiation properties are observed for the elements that are located
near to user-head (Ant. 3 and Ant. 4) [40]. However, the diﬀerence is not very signiﬁcant. The 2D-polar

300
Electronics 2019, 8, 521

(linear-scaling) radiation patterns of the design are illustrated in Figure 19b. As can be observed, the
directivity of the antenna radiation pattern is maximum in the opposite direction of the head, which is
most important part of the body to protect from the radiation. The main lobe of each single-element
directs most of the power while the other lobes should be negligible.
Figure 20 depicts the total efficiencies and reflection coefficient (Snn ) of the antenna elements in
the presence of the user-head and user-hand in Talk-Mode scenario. As seen, the diversion of the Snn
characteristic of the design is not significant. In addition, the proposed MIMO design exhibit sufficient
efficiency in its operation bandwidth. Based on the above analysis, we can conclude the MIMO design
provides sufficient efficiency, radiation coverage and gain levels for diamond-ring slot radiators.

(a) (b)

Figure 20. (a) Snn and (b) total eﬃciencies of the proposed smartphone antenna in Talk-Mode scenario.

5. Conclusions
A mobile-phone antenna design with dual-polarized radiators is proposed for 5G massive MIMO
communications. The antenna configuration contains eight-port/four elements of diamond-rings
slot radiators with L-shaped microstrip feed lines deployed at four corners of the PCB. The antenna
elements exhibit wide bandwidth with the center frequency of 3.6 GHz. S parameters, radiation
patterns, efficiency, ECC and TARC results of the design are studied and sufficient results are achieved.
In addition, a prototype of the mobile-phone antenna was fabricated and measured. Moreover, the
performances of the antenna in Hand-Mode and Talk-Mode scenarios are investigated. The obtained
results demonstrated that the proposed smartphone antenna provides good characteristics and meets
the requirements for use in future mobile handsets.

Author Contributions: Writing—original draft preparation, N.O.P., H.J.B., M.A., Y.O.P., Y.I.A.A.-Y., R.A.A.-A.,
and E.L.; writing—review and editing, N.O.P. and R.A.A.-A.; investigation, N.O.P., H.J.B., M.A., Y.O.P., Y.I.A.A.-Y.;
resources, N.O.P., R.A.A.-A., E.L. and; For other cases, all authors have participated.
Funding: This project has received funding from the European Union’s Horizon 2020 research and innovation
programme under grant agreement H2020-MSCA-ITN-2016 SECRET-722424.
Acknowledgments: The authors wish to express their thanks to the support provided by the innovation
programme under grant agreement H2020-MSCA-ITN-2016 SECRET-722424.
Conﬂicts of Interest: The authors declare no conﬂict of interest.

References
1. Jensen, M.; Wallace, J. A review of antennas and propagation for MIMO wireless communications. IEEE
Trans. Antennas Propag. 2004, 52, 2810–2824. [CrossRef]
2. Gesbert, D.; Shaﬁ, M.; Shiu, D.S.; Smith, P.J.; Naguib, A. From theory to practice: An overview of MIMO
space-time coded wireless systems. IEEE J. Sel. Areas Commun. 2003, 21, 281–302. [CrossRef]
3. Votis, C.; Christoﬁlakis, V.; Raptis, V.; Tatsis, G.; Chronopoulos, S.K.; Kostarakis, P. Design and analysis of
a multiple-output transmitter based on DDS architecture for modern wireless communications. AIP Conf.
Proc. 2010, 1203, 421–426.

301
Electronics 2019, 8, 521

4. Kammoun, A.; Debbah, M.; Alouini, M.S. Design of 5G full dimension massive MIMO systems. IEEE Trans.
Commun. 2018, 66, 726–740.
5. Osseiran, A.; Boccardi, F.; Braun, V.; Kusume, K.; Marsch, P.; Maternia, M.; Queseth, O.; Schellmann, M.;
Schotten, H.; Taoka, H.; et al. Scenarios for 5G mobile and wireless communications: The vision of the METIS
project. IEEE Commun. Mag. 2014, 52, 26–35. [CrossRef]
6. Angelis, C.T.; Chronopoulos, S.K. System performance of an LTE MIMO downlink in various fading
environments. In Proceedings of the Ambient Media and Systems, Berlin, Germany, 27–28 January 2011;
pp. 36–43.
7. Ojaroudiparchin, N.; Shen, M.; Pedersen, G.F. Multi-layer 5G mobile phone antenna for multi-user MIMO
communications. In Proceedings of the 23rd Telecommunications Forum (TELFOR), Belgrade, Serbia, 24–26
November 2015; pp. 559–562.
8. Hassan, N.; Fernando, X. Massive MIMO Wireless Networks: An Overview. Electronics 2017, 6, 63. [CrossRef]
9. Larsson, E.; Edfors, O.; Tufvesson, F.; Marzetta, T. Massive MIMO for next generation wireless Systems. IEEE
Commun. Mag. 2014, 52, 186–195. [CrossRef]
10. Yang, H.H.; Quek, T.Q.S. Massive MIMO Meet Small Cell; Springer Briefs in Electrical and Computer
Engineering; Springer International Publishing: Basel, Switzerland, 2017.
11. Sharawi, M.S. Printed MIMO Antenna Engineering; Artech House: Norwood, MA, USA, 2014.
12. Parchin, N.O.; Basherlou, H.J.; Al-Yasir, Y.I.A.; Abd-Alhameed, R.A.; Abdulkhaleq, A.M.; Noras, J.M. Recent
developments of reconfigurable antennas for current and future wireless communication systems. Electronics
2019, 8, 128. [CrossRef]
13. Parchin, N.O.; Al-Yasir, Y.; Abdulkhaleq, A.M.; Elfergani, I.; Rayit, A.; Noras, J.M.; Rodriguez, J.;
Abd-Alhameed, R.A. Frequency reconfigurable antenna array for mm-Wave 5G mobile handsets. In
Proceedings of the 9th International Conference on Broadband Communications, Networks, and Systems,
Faro, Portugal, 19–20 September 2018.
14. Al-Yasir, Y.; Abdullah, A.; Ojaroudi Parchin, N.; Abd-Alhameed, R.; Noras, J. A new
polarization-reconfigurable antenna for 5G applications. Electronics 2018, 7, 293. [CrossRef]
15. Hussain, R.; Alreshaid, A.T.; Podilchak, S.K.; Sharawi, M.S. Compact 4G MIMO antenna integrated with a
5G array for current and future mobile handsets. IET Microw. Antennas Propag. 2017, 11, 271–279. [CrossRef]
16. Chen, Q.; Lin, H.; Wang, J.; Ge, L.; Li, Y.; Pei, T. Single ring slot based antennas for metal-rimmed 4G/5G
smartphones. IEEE Trans. Antennas Propag. 2019, 67, 1476–1487. [CrossRef]
17. Ojaroudi Parchin, N.; Abd-Alhameed, R.A. A compact Vivaldi antenna array for 5G channel sounding
applications. In Proceedings of the 12th European Conference on Antennas and Propagation, London, UK,
9–13 April 2018.
18. Al-Hadi, A.A.; Ilvonen, J.; Valkonen, R.; Viikan, V. Eight-element antenna array for diversity and MIMO
mobile terminal in LTE 3500MHz band. Microw. Opt. Technol. Lett. 2014, 56, 1323–1327. [CrossRef]
19. Abdullah, M. Compact 4-Port MIMO antenna system for 5G mobile terminal. In Proceedings of the
International Applied Computational Electromagnetics Society Symposium, Florence, Italy, 26–30 March
2017.
20. Ojaroudi Parchin, N.; Al-Yasir, Y.I.A.; Abd-Alhameed, R.A.; Noras, J.M. Dual-polarized MIMO antenna array
design using miniaturized self-complementary structures for 5G smartphone applications. In Proceedings of
the EuCAP Conference, Krakow, Poland, 31 March–5 April 2019. accepted.
21. Wong, K.L.; Lu, J.Y.; Chen, L.Y.; Li, W.Y.; Ban, Y.L. 8-antenna and 16-antenna arrays using the quad-antenna
linear array as a building block for the 3.5-GHz LTE MIMO operation in the smartphone. Microw. Opt.
Technol. Lett. 2016, 58, 174–181. [CrossRef]
22. Rao, L.-Y.; Tsai, C.-J. 8-loop antenna array in the 5 inches size smartphone for 5G communication the
3.4 GHz–3.6 GHz band MIMO operation. In Proceedings of the Progress in Electromagnetics Research
Symposium (PIERS), Toyama, Japan, 1–4 August 2018.
23. Abdullah, M.; Ban, Y.L.; Kang, K.; Li, M.Y.; Amin, M. Eight-element antenna array at 3.5GHz for MIMO
wireless application. Prog. Electromagn. Res. C 2017, 78, 209–217. [CrossRef]
24. Li, Y.; Luo, Y.; Yang, G. High-isolation 3.5-GHz 8-antenna MIMO array using balanced open slot antenna
element for 5G smartphones. IEEE Trans. Antennas Propag. 2019. [CrossRef]
25. Sun, L.; Feng, H.; Li, Y.; Zhang, Z. Compact 5G MIMO mobile phone antennas with tightly arranged
orthogonal-mode pairs. IEEE Trans. Antennas Propag. 2018, 66, 6364–6369. [CrossRef]

302
Electronics 2019, 8, 521

26. Li, M.Y.; Ban, Y.L.; Xu, Z.Q.; Guo, J.; Yu, Z.F. Tri-polarized 12-antenna MIMO array for future 5G smartphone
applications. IEEE Access 2018, 6, 6160–6170. [CrossRef]
27. Zhao, A.; Ren, Z. Size reduction of self-isolated MIMO antenna system for 5G mobile phone applications.
IEEE Antennas Wirel. Propag. Lett. 2019, 18, 152–156. [CrossRef]
28. Statement: Improving Consumer Access to Mobile Services at 3.6 GHz to 3.8 GHz. Available online:
https://www.ofcom.org.uk/consultations-and-statements/category-1/future-use-at-3.6-3.8-ghz (accessed on
21 October 2018).
29. CST Microwave Studio; ver. 2018; CST: Framingham, MA, USA, 2018.
30. Chang, K. Microwave Ring Circuits and Antennas; Wiley: New York, NY, USA, 1996.
31. Batchelor, J.C.; Langley, R.J. Microstrip annular ring slot antennas for mobile applications. Electron. Lett.
2019, 32, 1635–1636. [CrossRef]
32. Yoshimura, Y. A microstripline slot antenna. IEEE Trans. Microw. Theory Tech. 1972, 20, 760–762. [CrossRef]
33. Elfergani, I.; Hussaini, A.S.; Rodriguez, J.; Abd-Alhameed, R. Antenna Fundamentals for Legacy Mobile
Applications and Beyond; Springer Nature: Basingstoke, UK, 2017; pp. 1–659.
34. Al-Nuaimi, M.K.T.; Whittow, W.G. Performance investigation of a dual element IFA array at 3 GHz for
MIMO terminals. In Proceedings of the Antennas and Propagation Conference (LAPC), Loughborough, UK,
14–15 November 2011; pp. 1–5.
35. Nguyen, Q.D.; Le, T.T.; Le, D.T.; Tran, X.N.; Yamada, Y. A compact MIMO ultra-wide band antenna with low
mutual coupling. Appl. Comput. Electromagn. Soc. J. 2018, 31, 252–260.
36. Parchin, N.O.; Al-Yasir, Y.I.; Ali, A.H.; Elfergani, I.; Noras, J.M.; Rodriguez, J.; Abd-Alhameed, R.A.
Eight-element dual-polarized MIMO slot antenna system for 5G smartphone applications. IEEE Access 2019,
9, 15612–15622. [CrossRef]
37. Syrytsin, I.; Zhang, S.; Pedersen, G.F. Performance investigation of a mobile terminal phased array with user
eﬀects at 3.5 GHz for LTE advanced. IEEE Antennas Wirel. Propag. Lett. 2017, 16, 1847–1850. [CrossRef]
38. Sharawi, M.S. Printed multi-band MIMO antenna systems and their performance metrics [wireless corner].
IEEE Antennas Propag. Mag. 2013, 55, 218–232. [CrossRef]
39. Ojaroudi Parchin, N.; Alibakhshikenari, M.; Jahanbakhsh Basherlou, H.; AAbd-Alhameed, R.; Rodriguez, J.;
Limiti, E. mm-Wave phased array quasi-Yagi antenna for the upcoming 5G cellular communications. Appl.
Sci. 2019, 9, 978. [CrossRef]
40. Basar, M.R.; Malek, M.F.; Juni, K.M.; Saleh, M.I.; Idris, M.S.; Mohamed, L.; Saudin, N.; Mohd Aﬀendi, N.A.;
Ali, A. The use of a human body model to determine the variation of path losses in the human body channel
in wireless capsule endoscopy. Prog. Electromagn. Res. 2013, 133, 495–513. [CrossRef]

303
electronics
Article
High-Isolation Leaky-Wave Array Antenna Based on
CRLH-Metamaterial Implemented on SIW with ±30o
Frequency Beam-Scanning Capability
at Millimetre-Waves
Mohammad Alibakhshikenari 1, *, Bal Singh Virdee 2 , Chan H. See 3,4 , Raed A. Abd-Alhameed 5 ,
Francisco Falcone 6 and Ernesto Limiti 1
1 Electronic Engineering Department, University of Rome “Tor Vergata”, Via del Politecnico 1, 00133 Rome,
Italy; [email protected]
2 Center for Communications Technology, School of Computing & Digital Media, London Metropolitan
University, London N7 8DB, UK; [email protected]
3 School of Engineering & the Built Environment, Edinburgh Napier University, 10 Colinton Rd.; Edinburgh,
EH10 5DT, UK; [email protected]
4 School of Engineering, University of Bolton, Deane Road, Bolton BL3 5AB, UK
5 Faculty of Engineering & Informatics, University of Bradford, Bradford, West Yorkshire BD7 1DP, UK;
[email protected]
6 Electrical and Electronic Engineering Department, Public University of Navarre, 31006 Pamplona, Spain;
[email protected]
* Correspondence: [email protected]

Received: 22 April 2019; Accepted: 5 June 2019; Published: 6 June 2019

Abstract: The paper presents a feasibility study on the design of a new metamaterial leaky-wave
antenna (MTM-LWA) used in the construction of a 1 × 2 array which is implemented using
substrate-integrated waveguide (SIW) technology for millimetre-wave beamforming applications.
The proposed 1 × 2 array antenna consists of two LWAs with metamaterial unit-cells etched on the top
surface of the SIW. The metamaterial unit-cell, which is an E-shaped transverse slot, causes leakage
loss and interrupts current flow over SIW to enhance the array’s performance. The dimensions
of the LWA are 40 × 10 × 0.75 mm3 . Mutual-coupling between the array elements is suppressed
by incorporating a metamaterial shield (MTM-shield) between the two antennas in the array. The
LWA operates over a frequency range of 55–65 GHz, which is corresponding to 16.66% fractional
bandwidth. The array is shown to exhibit beam-scanning of ±30◦ over its operating frequency range.
Radiation gain in the backward (−30◦ ), broadside (0◦ ), and forward (+30◦ ) directions are 8.5 dBi,
10.1 dBi, and 9.5 dBi, respectively. The decoupling slab is shown to have minimal effect on the array’s
performance in terms of impedance bandwidth and radiation specifications. The MTM-shield is
shown to suppress the mutual coupling by ~25 dB and to improve the radiation gain and efficiency
by ~1 dBi and ~13% on average, respectively.

Keywords: Metamaterials (MTM); leaky-wave antenna (LWA); antenna arrays; substrate integrated
waveguide (SIW); transverse slots; beam-scanning; mutual coupling isolation; millimetre-wave;
composite right/left-handed transmission line (CRLH-TL)

1. Introduction
Leaky-wave antennas (LWAs) are travelling wave antennas with electrically large radiating
aperture [1,2]. Such antennas can provide high gain directive beam without using a complex feeding
network [3,4]. The advantage of LWA over a conventional array antenna is a simple feed structure

Electronics 2019, 8, 642; doi:10.3390/electronics8060642 305 www.mdpi.com/journal/electronics

Electronics 2019, 8, 642

with low loss [5–7]. Conventional planar LWAs radiate higher order modes in the forward direction [8].
However, the periodic structure-based LWA can radiate in both forward and backward directions. It
has been shown that metamaterial-based LWA designs can achieve a continuous main beam scanning
from the backward to the forward direction as a function of frequency [9]. Various LWA designs based
on metamaterial structures have been considered in the past. Such designs include (1) an LWA with
a composite right/left-hand (CRLH)-folded substrate-integrated waveguide (SIW) structure that is
shown to provide beam scanning from of −58◦ to 65◦ with a gain of 1 dBi [10]; (2) an interdigital-shaped
slotted-SIW-based LWA that is reported to achieve a scanning angle of −60◦ to 70◦ with gain of around
8 dBi [11]; (3) a CRLH LWA based on a rectangular waveguide structure that has been demonstrated
for a continuous main beam scanning range from −70◦ to 70◦ with gain of 8.64 dBi [12]; and (4) a planar
slotted SIW LWA that provides a scanning range of −66◦ to 78◦ with consistent gain [13].
In this paper, a new type of LWA in array conﬁguration is proposed based on SIW with metamaterial
inclusions for millimetre-wave applications. Mutual coupling between the closely-spaced antennas
in the array can undermine the array’s performance. To circumvent this, a metamaterial shield is
embedded between the two LWAs. With this approach, mutual coupling is shown to reduce by an
average of ~25 dB over the array’s operating frequency range. Engraved on the upper layer of the SIW
LWA are several metamaterial unit-cells comprising of transverse E-shaped slots. Dimensions of the
LWAs were modiﬁed for optimum array performance.

2. Design Process of The Proposed Mtm-Lwa Array Based On Siw

The proposed 1 × 2 array antenna based on MTM-LWA implemented on SIW technology is
designed on a RO3003 dielectric substrate with εr = 3.0, tangδ = 0.0010 and thickness of 0.75 mm.
Figure 1 displays the layout of the proposed array structure that is constructed with two MTM-LWA
on SIW. Engraved on the top of each SIW antenna are several metamaterial unit-cells consisting of
transverse E-shaped slots. Leakage loss at the slots interrupts the current ﬂow over SIW, which is shown
to enhance the array’s impedance bandwidth and beamwidth that scans as a function of frequency. In
the structure, the transverse slots behave as series left-handed capacitance and the grounded via holes
acts as shunt left-handed inductors.

Figure 1. Cont.

306
Electronics 2019, 8, 642

Figure 1. Proposed 1 × 2 array antenna based on MTM-LWA using SIW technology. (a) Top view; (b)
view to show the Substrate integrated waveguide slots; (c) back side to show the ground plane.

The structural parameters of the MTM-LWA array are summarized in Table 1. Each antenna has
dimensions of 40 × 10 × 0.75 mm3 . The overall ground plane dimensions are 50 × 35 × 0.75 mm3 . The
S-parameter responses of the proposed array antenna are exhibited in Figure 2, which shows it operates
throughout the frequency range of 55–65 GHz, which corresponds to 16.66% fractional bandwidth.

Table 1. Structural parameters of the array.

Structural Parameters Dimensions (Units in mm)

LA L A 40
W A WA 10
LG L G 50
W G WG 35
LS L S 5
W S WS 0.5
DviaDvia 0.5
SviaSvia 0.5
SA SA 5
LM LM 5
W M WM 1
hh 0.75

307
Electronics 2019, 8, 642

Figure 2. S-parameter responses of the proposed array antennas. Since the structure is symmetrical,
we have not plotted all curves.

Radiation gain patterns of the proposed array antenna at three spot frequencies within its operating
frequency range are plotted in Figure 3. It is evident that the array antenna is capable of beam-scanning
◦
from −30◦ to +30◦ with backward radiation at −30◦ , broadside radiation at 0 , and forward radiation
at +30◦ . In backward, broadside, and forward directions, the gain is 8.5, 10.1, and 9.5 dBi, respectively.

Figure 3. Cont.

308
Electronics 2019, 8, 642

Figure 3. Radiation characteristics of the proposed 1 × 2 MTM-LWA array at 55 GHz, 60 GHz, and 65
GHz. (a) Backward-radiation at 55 GHz; (b) broadside-radiation at 60 GHz; (c) forward-radiation at
65 GHz.

3. Suppress the Mutual Coupling Between the Closely Spaced Mtm-Lwa Arrays
Mutual coupling between closely-spaced radiation elements can severely undermine the array’s
radiation performances. Here the isolation between the two MTM-LWAs is increased by introducing
a metamaterial shield which is based on the SIW structure, as indicated in Figure 4. It comprises of
transverse slots that are tapered. The slots have a width of 0.5 mm and essentially play the role of
the series left-handed capacitances (CL ), where the metallic via-holes with diameter of 0.25 mm act as
shunt left-handed inductances (LL ). The MTM-shield suppresses surface waves created by the LWAs
to increase isolation between the two antennas in the array. The overall dimensions of the shield are
40 × 4 mm2 .
Figure 5 shows the S-parameter response before and after applying the MTM-shield. After
applying MTM-shield, the minimum, average and maximum suppression observed are 8 dB, ~25 dB,
and 42.5 dB, respectively. This shows the effectiveness of the proposed isolation technique. The shield
has no influence on the reflection coefficient response which is S11 ≤ −10 dB.
Radiation patterns of the proposed antenna arrays with no and with MTM-shield through its
operational bandwidth at spot frequencies of 55 GHz, 60 GHz & 65 GHz are plotted in Figure 6. It is
clear from this figure that with the shield the cross-polarized radiation over its operating range is
substantially reduced. The average gain of the co-polarized radiation is only marginally affected.
All details are tabulated in Table 2.
In addition, the radiation gain and efficiency curves over frequency bandwidth for both antennas
without and with the proposed shield are shown in Figure 7. Obviously, after realizing the metamaterial
shield based on SIW, the radiation gain and efficiency performances improved by ~1dBi and ~13% on
average, respectively.
Surface current density distributions without and with the MTM-shield are exhibited in Figure 8.
This figure shows that the MTM-shield is an effective EM band-gap structure to remarkably block
surface currents from EM interacting with adjacent radiation elements in the antenna array. Destructive
influences of surface currents in the antenna are dramatically suppressed from effecting the far-field of
the array antennas.

309
Electronics 2019, 8, 642

Figure 4. Proposed SIW-based leaky-wave antenna array with MTM-shield. (a) Proposed MTM-SIW
shield located between the array antennas; (b) top-view of the leaky-wave array antennas; (c) back-side
to show ground plane.

Figure 5. Reﬂection and transition coeﬃcients of the proposed antenna array before and after applying
the MTM-shield. Since the structure is symmetrical, we have not plotted all curves.

310
Electronics 2019, 8, 642

Figure 6. Co- and cross-polarized radiation gain patterns of the proposed structure without (WO) and
with (W) metamaterial-shield.

311
Electronics 2019, 8, 642

Table 2. Radiation properties.

Radiation Gain
Without MTM With MTM
Minimum 8.69 dBi 9.6 dBi
Maximum 9.11 dBi 10.16 dBi
Average 8.90 dBi 9.86 dBi
Improvement in Average ~1 dBi
Radiation Eﬃciency
Without MTM With MTM
Minimum 62.53% 73.44%
Maximum 65.21% 78.12%
Average 63.5% 76.38
Improvement in Average ~13%

Figure 7. Radiation gain and eﬃciency curves over frequency band for both cases with no and with
MTM shield. (a) Radiation-gain; (b) Radiation-eﬃciency.

Figure 8. Surface current density distributions without and with the metamaterial shield at 60 GHz. (a)
without the metamaterial shield; (b) with the metamaterial shield.

4. Circuit Model of The Proposed MTM-Lwa Array and its Dispersion Phenomenon
One way to explain the metamaterials (MTMs) is the transmission-line theory in terms of the circuit
models. The concept of the composite right/left-handed metamaterial transmission lines (CRLH-MTM
TLs) is investigated and realized based on this approach. This solution has been broadly recognized
and adopted as a powerful analysis tool for the understanding and modelling of MTM devices. By

312
Electronics 2019, 8, 642

considering the right-handed (RH) effects within a purely left-handed (LH) circuit, it demonstrates
a general configuration of a practical MTM-TL. The circuit model of a generic symmetrical CRLH
transmission-line unit-cell has exhibited in Figure 9 where the loss is neglected for simplicity. The
series capacitance (CL ) and the shunt inductance (LL ), which have been realized by the slots and
via-holes, respectively, contribute to the left-handedness while the series inductance (LR ) and the shunt
capacitance (CR ), which have been implemented by the unwanted currents flowing on the patches and
the gap space between the patches and ground plane, respectively, actualize its right-handed (RH) dual
counterpart. The one indicated in Figure 9a is called T-type model with the LH capacitance placed
at the two ends. The mushroom unit cell belongs to this type [14]. The circuit exhibited in Figure 9b
is called π-type model with the LH capacitance in the centre. One example is the CRLH-SIW unit
cell [15–17]. Therefore, each unit cell of the proposed leaky wave array antennas is based on the π-type
model, which has been identified in Figure 4b.

Figure 9. Equivalent circuit models for the symmetrical CRLH-metamaterial unit-cells. (a) T-type
circuit model. (b) π-type circuit model.

By using the periodic boundary conditions corresponding to the Bloch–Floquet theorem, these two
CRLH transmission line unit-cells basically become equal and their dispersion relevance is determined
to be [9]:
1 ω ω2 ω ω2
2 2
1
β(ω) = cos−1 (1 − ( L2 + 2 − 2L − 2L )) (1)
p 2 ω ωR ωse ωsh
where p = 5 mm is the length of the unit-cell and

1
ωL = √ (2)
CL LL

1
ωR = √ (3)
CR LR
1
ωse = √ (4)
C L LR
1
ωsh = √ (5)
CR LL
Seemingly, there are two frequency spots referred to as the inﬁnite-wavelength points (β = 0)
with a bandgap in between. In the balanced case (ωse = ωsh ), the bandgap vanishes. Generally, just
one particular zeroth-order resonance will be excited that depends on the boundary conditions and the
circuit values. For the short-ended resonator, it is determined by ωse , while for the open-ended case, it
is represented by ωsh [9]. Multiplex resonances containing the negative-, zeroth-, and positive- order
resonances can be produced by cascading more than one unit-cell. Those resonance frequencies of

313
Electronics 2019, 8, 642

diﬀerent order modes for an M-stage CRLH-transmission line can be discovered on the dispersion
diagram when the following condition is satisﬁed [9]:

θM = βpM = βMp = nπ (6)

(
nπ n = 0, ±1, . . . , ±{M − 1 , f or T type unice cell
βp = (7)
M n = 0, ±1, . . . , ±M, f or π type unice cell .
The proposed leaky wave antenna (LWA) array applying the CRLH-SIW unit-cell as shown in
Figures 1 and 4 has an π-type circuit model with the two ends terminated by the LH inductances
(LL ) realized by the metallic via-holes. The E-shaped transverse and the tapered transverse slots have
presented within the circuit model as the LH capacitances (CL ). RH contribution comes from the
distributed shunt capacitor (CR ) and the series inductor (LR ). Figure 10 shows the dispersion curves of
the proposed leaky wave array antennas based on the CRLH-SIW unit-cell that has been extracted
by the CST Microwave Studio package and the equivalent circuit model exhibited in Figure 9b. It
has been observed that the results achieved from CST Microwave Studio package and the circuit
elements are in an excellent coherence with each other, and also it illustrates the dispersion relation
of the unit-cell very well. There is one zeroth-order resonance frequency occurring at 61 GHz that
defines the upper and lower edges of the stop-band. This LW antenna array falls into the short-ended
case because it is operated below the original waveguide cutoff-frequency and the metallic via-holes
offer the short-ended condition. According to Equations (6) and (7), there is one resonance that can
be excited comprising the zeroth-order resonance at fse = 61 GHz. Figure 5 shows the reflection
coefficient which has verified the proposed model. Notice that this resonance frequency can be simply
controlled by engineering the dispersion diagram. Magnitudes of the equivalent circuit parameters,
which were determined from full-wave EM simulation using CST Microwave Studio package, are LL =
6.45 nH, CL = 8.69 pF, LR = 1.53 nH, and CR = 4.12 pF. This data was then utilized to determine the
equivalent circuit-model of the antenna, displayed in Figure 9b, which was verified applying ADS
(RF-circuit solver).

Figure 10. Dispersion diagrams for the proposed LWA array based on SIW-MTM extracted by CST
Microwave Studio package and the corresponding equivalent circuit shown in Figure 9b.

The proposed LWA is exhibited as a traveling-wave antenna, where the current propagates along
a guiding structure. Since that, the perturbations are introduced along the structure by implementing
the E-shaped transverse and the traveling-wave leaves the structure and radiates into free-space.
Therefore, in the ideal case, no energy reaches the end of the structure. In a practical scenario, any
energy that reaches the end is absorbed by a matching load. Usually, LWA is designed, in which
at the least 90% of the power at the structures leaks away before the traveling-waves reach the end

314
Electronics 2019, 8, 642

of the antenna. Leaky-wave phenomenon is demonstrated with fast propagating waves only. The
propagating wave number Kp is deﬁned by [18,19]

Kp = K02 − Kz2 (8)

In this case, Kz = − jα is for surface-wave or slow-wave, or Kz = β is for leaky-wave or fast-wave,

where Kz is the longitudinal wave-number and K0 refers to the free-space propagating wave-number.
The complexity of radiation Kz is given by

Kz = β − jα (9)

where α and β are the attenuation and phase constants respectively. Supposing that there is a standard
free-space wave equation for the above wave, the waves outside the leaking-structure are given by:

Ψ(r) = Ψ0 e−j(Kp p+βz ) eαz (10)

If β < Kp i.e., if the phase velocity is smaller than the free-space velocity of light, Vp < C, then it is a
slow-wave and Kp is imaginary. The wave decays exponentially in amplitude along the length of the
structure and it is a bounded wave. If, β > Kp i.e., if the phase velocity is greater than the free space
velocity of light, Vp > C, then it is a fast-wave and Kp is purely real; therefore, the real power at an
angle is radiated with respect to the normal deﬁned by [20]:
* + * +
β cβ
sin(θ) = sin−1 = sin−1 (11)
K0 ω

Since all of the abovementioned terms are functions of the angular frequency, the angle changes with
frequency; hence, this shows frequency scanning behaviour. The main beam-width is

0.91
Δθ0 = (12)
l
λ cosθ0

If the above equation is applied for large antenna lengths, high directivity can be speciﬁed as

4πAe
D= (13)
λ0

However, the eﬀect of enhancing directivity is negligible if there is no power left near the end of the
structures. To specify this parameter, the attenuation/leakage constant is determined as [20]:

A2 ( z )
2
er
αz = / l /z (14)
0
A2 (z)dz − er 0
A2 (z)dz

Thus, if α is suﬃciently small so that (1 − e−2αl > 0), the improvement of directivity is perceptible as
length l enhances.

5. Comparison between This Work and The Literature

Performance parameters of the proposed 1 × 2 MTM-LWA array antenna based on SIW is compared
with the recent works employing various mutual coupling suppression techniques. The comparison
in Table 3 is for array antennas composed of two radiation elements. Most of the arrays listed in
Table 3 exhibit narrow band performance, and to increase isolation between the radiation elements they
employ defected ground structures (DGS) to complement their suppression technique. The proposed
array antenna presented here has the advantage of (i) symmetry; (ii) very wide bandwidth from 55

315
Electronics 2019, 8, 642

GHz to 65 GHz; (iii) simple design; (iv) improved radiation patterns; (v) enhanced radiation gain; (vi)
low cross-polarization levels; and (vii) mutual coupling suppression on average of ~25dB over its
operating band.

Table 3. Performance parameters of the proposed lw array antennas in comparison with the
recent papers.

Max. Isolation Bandwidth Rad. Gain Pattern No. of Application of Symmetry and
Ref. Method
Improvement (FBW) Deterioration Elements DGS Simplicity
[21] SCSRR 10 dB Narrow Yes 2 Yes No
[22] SCSSRR 14.6 dB Narrow Yes 2 Yes No
[23] Compact EBG 17 dB Narrow Yes 2 Yes No
[24] Fractal MTM-EMBG 37 dB Wide (~15%) No 4 No Yes
[25] Meander Line 10 dB Narrow No 2 Yes No
[26] EBG 8.8 dB Narrow - Yes No
[27] EBG 13 dB Wide (~12%) Yes 2 Yes No
Substrate Integrated
[28] Waveguide (SIW) with 15 dB Wide (22.22%) - 1 No Yes
Transverse Slots
Substrate Integrated
[29] Waveguide (SIW) and 20 dB Wide (16.32%) - 1 No Yes
Slots
Periodic Space-Time
[30] - Wide (58.82%) - 1 No Yes
Modulation
Space Time Modulation
[31] (External Linear - - - 1 No Yes
Momentum)
[32] MTM-DS 57 dB Wide No 2 No Yes
[33] Fractal load + DGS 16 dB Narrow (2.5%) No 2 Yes No
[34] Slotted Meander-Line 16 dB Narrow Yes 2 Yes No
[35] Slots >26 dB Wide No 4 No Yes
[36] I-Shaped Resonator 30dB Narrow Yes 2 Yes No
[37] W/g MTM 18 dB Narrow No 2 Yes No
[38] MSWI 13.5 dB Wide No 2 No Yes
[39] Metamaterial Superstrate 25 Narrow No 2 No Yes
Slot Combined
[40] Complementary Split 19 Narrow Yes 2 No No
Ring Resonator
[41] Metamaterial 40 Wide (8.8%) No 4 No Yes
[42] Fractal Load 37 Wide No 2 No Yes
Metamaterials and
This
Substrate Integrated 42.5 dB Wide (16.66%) No 2 No Yes
work
Waveguide

6. Conclusion
A feasibility study of a new metamaterial leaky-wave array antenna based on substrate-integrated
waveguide (SIW) technology with transverse slots and metallic via-holes for operation over 55 GHz to
65 GHz was proposed and investigated. The array antenna provides beam-scanning capability of ±30◦
◦
with the gain of 8.5, 10.1, and 9.5 dBi at backward (-30◦ ), broadside (0 ), and forward (+30◦ ) directions,
respectively. To increase the isolation between the array’s elements, a metamaterial shield based on
SIW was introduced between the antennas, which has reduced the mutual coupling by an average
value of ~25 dB. In addition, the proposed MTM shield has increased the radiation gain and eﬃciency
by an average value of ~1dBi and ~13%, respectively.

Author Contributions: Conceptualization, M.A.; B.S.V.; F.F.; E.L.; methodology, M.A.; C.H.S.; F.F.; E.L.; software,
M.A.; B.S.V; C.H.S.; validation, M.A.; B.S.V.; C.H.S.; R.A.A.-A.; F.F.; E.L.; formal analysis, M.A.; B.S.V.; F.F.; E.L.;
investigation, M.A.; B.S.V.; C.H.S.; R.A.A.-A.; F.F.; E.L.; resources, M.A.; C.H.S.; R.A.A.-A.; E.L.; data curation,
M.A.; C.H.S.; R.A.A.-A.; F.F.; writing—original draft preparation, M.A.; writing—review and editing, B.S.V.;
C.H.S.; R.A.A.-A.; F.F.; E.L.; visualization, M.A.; B.S.V.; C.H.S.; F.F.; E.L.; supervision, E.L.; project administration,
R.A.A.-A.; F.F.; E.L.; funding acquisition, R.A.A.-A.; F.F.; E.L..

316
Electronics 2019, 8, 642

Funding: This work is partially supported by grant agreement H2020-MSCA-ITN-2016 SECRET-722424 and the
UK EPSRC under grant EP/E022936/1.
Conﬂicts of Interest: The authors declare no conﬂict of interest. The funders had no role in the design of the
study; in the collection, analyses, or interpretation of data; in the writing of the manuscript, or in the decision to
publish the results.

References
1. Jackson, D.R.; Oliner, A.A. Leaky-wave antennas. In Modern Antenna Handbook; John Wiley & Sons: New
York, NY, USA, 2008; pp. 325–367.
2. Sievenpiper, D.F. Forward and backward leaky wave radiation with large effective aperture from an
electronically tunable textured surface. IEEE Trans. Antennas Propag. 2005, 53, 236–247. [CrossRef]
3. Jackson, D.R.; Caloz, C.; Itoh, T. Leaky-wave antennas. Proc. IEEE 2012, 100, 2194–2206. [CrossRef]
4. Lim, S.; Caloz, C.; Itoh, T. Metamaterial-based electronically controlled transmission-line structure as a novel
leaky-wave antenna with tunable radiation angle and beamwidth. IEEE Trans. Microw. Theory Tech. 2004, 52,
2678–2690. [CrossRef]
5. Caloz, C.; Itoh, T.; Rennings, A. CRLH metamaterial leaky-wave and resonant antennas. IEEE Antennas
Propag. Mag. 2008, 50, 25–39. [CrossRef]
6. Paulotto, S.; Baccarelli, P.; Frezza, F.; Jackson, D.R. A novel technique for open-stopband suppression in 1-D
periodic printed leaky-wave antennas. IEEE Trans. Antennas Propag. 2009, 57, 1894–1906. [CrossRef]
7. Chen, Z.N.; Qing, X. Slotted SIW Leaky-Wave Antenna with Improved Backward Scanning Bandwidth
and Consistent Gain. In Proceedings of the 2017 11th European Conference on Antennas and Propagation
(EUCAP), Paris, France, 19–24 March 2017; pp. 752–755.
8. Qian, Y.; Chang, B.C.C.; Itoh, T.; Chen, K.C.; Tzuang, C.K.C. High efficiency and broadband excitation
of leaky mode in microstrip structures. In Proceedings of the 1999 IEEE MTT-S International Microwave
Symposium Digest (Cat. No.99CH36282), Anaheim, CA, USA, 13–19 June 1999; pp. 1419–1422.
9. Caloz, C.; Itoh, T. Electromagnetic Metamaterials: Transmission Line Theory and Microwave Applications;
Wiley-IEEE Press: New York, NY, USA, 2005; 376p.
10. Yang, T.; Chi, P.L.; Xu, R.M. Novel composite right/left-handed leaky-wave antennas based on the folded
substrate-integrated waveguide structures. Prog. Electromagn. Res. 2012, 29, 235–248. [CrossRef]
11. Dong, Y.; Itoh, T. Composite right/left-handed substrate integrated waveguide and half mode substrate
integrated waveguide leaky-wave structures. IEEE Trans. Antennas Propag. 2011, 59, 767–775. [CrossRef]
12. Weitsch, Y.; Eibert, T.F. Analysis and design of a composite left/right-handed leaky wave antenna based on
the H10 rectangular waveguide mode. Adv. Radio Sci. 2008, 6, 49–54. [CrossRef]
13. Chen, Z.N.; Qing, X. Multilayered composite right/left-handed leaky-wave antenna with consistent gain.
IEEE Trans. Antennas Propag. 2012, 60, 5056–5062.
14. Lai, A.; Leong, K.M.; Itoh, T. Infinite wavelength resonant antennas with monopolar radiation pattern based
on periodic structures. IEEE Trans. Antennas Propag. 2007, 55, 868–876. [CrossRef]
15. Dong, Y.; Itoh, T. Miniaturized substrate integrated waveguide slot antennas based on negative order
resonance. IEEE Trans. Antennas Propag. 2010, 58, 3856–3864. [CrossRef]
16. Dong, Y.; Itoh, T. Substrate integrated waveguide negative order resonances and their applications. IEEE Trans.
Antennas Propag. 2010, 4, 1081–1091. [CrossRef]
17. Dong, Y.; Itoh, T. Metamaterial-Based Antennas. Proc. IEEE 2012, 100, 2271–2285. [CrossRef]
18. Johnson, R.C.; Jasik, H. Antenna Engineering Handbook, 4th ed.; McGraw-Hill Education: New York, NY, USA,
2007.
19. Mohsen, M.K.; Isa, M.S.M.; Isa, A.A.M.; Zin, M.S.I.M.; Saat, S.; Zakaria, Z.; Ibrahim, I.M.; Abu, M.; Ahmad, A.;
Abdulhameed, M.K. The Fundamental of Leaky Wave Antenna. J. Telecommun. Electron. Comput. Eng. 2018,
10, 19–127.
20. Frank, B.G. Frontiers in Antennas: Next Generation Design & Engineering; McGraw Hill Professional: New
York, NY, USA, 2010; 512p.
21. Bait-Suwailam, M.M.; Siddiqui, O.F.; Ramahi, O.M. Mutual coupling reduction between microstrip patch
antennas using slotted-complementary split-ring resonators. IEEE Antennas Wirel. Prop. Lett. 2010, 9,
876–878. [CrossRef]

317
Electronics 2019, 8, 642

22. Shafique, M.F.; Qamar, Z.; Riaz, L.; Saleem, R.; Khan, S.A. Coupling suppression in densely packed microstrip
arrays using metamaterial structure. Microw. Opt. Tech. Lett. 2015, 57, 759–763. [CrossRef]
23. Islam, M.T.; Alam, M.S. Compact EBG Structure for Alleviating Mutual Coupling between Patch Antenna
Array Elements. Prog. Electromagn. Res. 2013, 137, 425–438. [CrossRef]
24. Alibakhshikenari, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.; Ali, A.H.; Falcone, F.; Limiti, E. Study
on Isolation Improvement Between Closely Packed Patch Antenna Arrays Based on Fractal Metamaterial
Electromagnetic Bandgap Structures. IET Microw. Ant. Propag. 2018, 12, 2241–2247. [CrossRef]
25. Ghosh, T.; Ghosal, S.; Mitra, D.; Bhadra Chaudhuri, S.R. Mutual Coupling Reduction Between Closely Placed
Microstrip Patch Antenna Using Meander Line Resonator. Prog. Electromag. Res. Lett. 2016, 59, 115–122.
[CrossRef]
26. Yang, F.; Rahmat-Samii, Y. Microstrip Antennas Integrated with Elctromagnetic Band-Gap (EBG) Structures:
A Low Mutual Coupling Design for Array Applications. IEEE Trans. Antennas Propag. 2003, 51, 2936–2946.
[CrossRef]
27. Mu’ath, J.; Denidni, T.A.; Sebak, A.R. Millimeter Wave Compact EBG Structure for Mutual Coupling
Reduction Applications. IEEE Trans. Antennas Propag. 2015, 63, 823–828.
28. Liu, J.; Jackson, D.R.; Long, Y. Substrate integrated waveguide (SIW) leaky-wave antenna with transverse
slots. IEEE Trans. Antennas Propag. 2012, 60, 20–29. [CrossRef]
29. Ettorre, M.; Sauleau, R.; Le Coq, L. Multi-beam multi-layer leaky-wave SIW pillbox antenna for
millimeter-wave applications. IEEE Trans. Antennas Propag. 2011, 59, 1093–1100. [CrossRef]
30. Taravati, S.; Caloz, C. Mixer-duplexer-antenna leaky-wave system based on periodic space-time modulation.
IEEE Trans. Antennas Propag. 2017, 65, 442–452. [CrossRef]
31. Taravati, S.; Caloz, C. Space-time modulated nonreciprocal mixing, amplifying and scanning leaky-wave
antenna system. In Proceedings of the 2015 IEEE International Symposium on Antennas and Propagation &
USNC/URSI National Radio Science Meeting, Vancouver, BC, Canada, 19–24 July 2015; pp. 639–640.
32. Alibakhshikenari, M.; Virdee, B.S.; Shukla, P.; See, C.H.; Abd-Alhameed, R.; Khalily, M.; Falcone, F.; Limiti, E.
Interaction Between Closely Packed Array Antenna Elements Using Metasurface for Applications Such as
MIMO Systems and Synthetic Aperture Radars. Radio Sci. 2018, 53, 1368–1381. [CrossRef]
33. Yang, X.; Liu, Y.; Xu, Y.X.; Gong, S.X. Isolation Enhancement in Patch Antenna Array with fractal UC-EBG
Structure and Cross Slot. IEEE Antennas Wirel. Propag. Lett. 2017, 16, 2175–2178. [CrossRef]
34. Alsath, M.G.N.; Kanagasabai, M.; Balasubramanian, B. Implementation of Slotted Meander Line Resonators
for Isolation Enhancement in Microstrip Patch Antenna Arrays. IEEE Antennas Wirel. Propag. Lett. 2013, 12,
15–18. [CrossRef]
35. Alibakhshikenari, M.; Virdee, B.; Shukla, P.; See, C.; Abd-Alhameed, R.; Khalily, M.; Falcone, F.; Limiti, E.
Antenna Mutual Coupling Suppression Over Wideband Using Embedded Periphery Slot for Antenna Arrays.
Electronics 2018, 7, 198. [CrossRef]
36. Ghosh, C.K.; Parui, S.K. Reduction of Mutual Coupling Between E-Shaped Microstrip Antennas by Using a
Simple Microstrip I-Section. Microw. Opt. Tech. Lett. 2013, 55, 2544–2549. [CrossRef]
37. Qamar, Z.; Park, H.C. Compact Waveguided Metamaterials for Suppression of Mutual Coupling in Microstrip
Array. Prog. Electromagn. Res. 2014, 149, 183–192. [CrossRef]
38. Alibakhshikenari, M.; See, C.H.; Virdee, B.S.; Abd-Alhameed, R.A. Meta-surface Wall Suppression of Mutual
Coupling between Microstrip Patch Antenna Arrays for THz-band Applications. Prog. Electromagn. Res.
Lett. 2018, 75, 105–111. [CrossRef]
39. Qamar, Z.; Naeem, U.; Khan, S.A. Mutual coupling reduction for high performance densely packed patch
antenna arrays on finite substrate. IEEE Trans. Antennas Propag. 2016, 64, 1653–1660. [CrossRef]
40. Qamar, Z.; Riaz, L.; Chongcheawchamnan, M.; Khan, S.A.; Shafique, M.F. Slot combined complementary
split ring resonators for mutual coupling suppression in microstrip phased arrays. IET Microw. Antenna
Propag. 2014, 8, 1261–1267. [CrossRef]

318
Electronics 2019, 8, 642

41. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual-Coupling
Isolation Using Embedded Metamaterial EM Bandgap Decoupling Slab for Densely Packed Array Antennas.
IEEE Access 2019, 7, 5182–51840. [CrossRef]
42. Alibakhshikenari, M.; Khalily, M.; Virdee, B.S.; See, C.H.; Abd-Alhameed, R.A.; Limiti, E. Mutual Coupling
Suppression Between Two Closely Placed Microstrip Patches Using EM-Bandgap Metamaterial Fractal
Loading. IEEE Access 2019, 7, 23606–23614. [CrossRef]

319
MDPI
St. Alban-Anlage 66
4052 Basel
Switzerland
Tel. +41 61 683 77 34
Fax +41 61 302 89 18
www.mdpi.com

Electronics Editorial Ofﬁce

E-mail: [email protected]
www.mdpi.com/journal/electronics
MDPI
St. Alban-Anlage 66
4052 Basel
Switzerland
Tel: +41 61 683 77 34
Fax: +41 61 302 89 18
www.mdpi.com ISBN 978-3-03936-017-8

MIMO Text Book PDF
100% (2)
MIMO Text Book PDF
240 pages
Large MIMO Systems PDF
100% (1)
Large MIMO Systems PDF
336 pages
Simulation of Digital Communication Systems Using Matlab
From Everand
Simulation of Digital Communication Systems Using Matlab
Mathuranathan Viswanathan
3.5/5 (22)
2012 60 02 P1
No ratings yet
2012 60 02 P1
288 pages
Part I of Two Parts: February 2012 Number 2 Ietpak (ISSN 0018-926X)
No ratings yet
Part I of Two Parts: February 2012 Number 2 Ietpak (ISSN 0018-926X)
288 pages
Advances in Massive MIMO Antenna Design A Review
100% (1)
Advances in Massive MIMO Antenna Design A Review
2 pages
Massive MIMO in 5G Networks Selected Applications Long Zhao - Own the ebook now with all fully detailed chapters
No ratings yet
Massive MIMO in 5G Networks Selected Applications Long Zhao - Own the ebook now with all fully detailed chapters
57 pages
MIMO Text Book PDF
No ratings yet
MIMO Text Book PDF
240 pages
Antenna Designs For 5GIoT and Space Applications
No ratings yet
Antenna Designs For 5GIoT and Space Applications
270 pages
New Trends in Telecommunications Engineering
No ratings yet
New Trends in Telecommunications Engineering
164 pages
[FREE PDF sample] Massive MIMO in 5G Networks Selected Applications Long Zhao ebooks
100% (1)
[FREE PDF sample] Massive MIMO in 5G Networks Selected Applications Long Zhao ebooks
55 pages
towards_6G
No ratings yet
towards_6G
36 pages
Novel Millimetre Wave Antennas for MIMO and 5G Applications Koul pdf download
No ratings yet
Novel Millimetre Wave Antennas for MIMO and 5G Applications Koul pdf download
53 pages
An Overview of Enhanced Massive MIMO With Array Signal Processing Techniques
No ratings yet
An Overview of Enhanced Massive MIMO With Array Signal Processing Techniques
16 pages
Frontmatter
No ratings yet
Frontmatter
17 pages
Full Massive MIMO in 5G Networks Selected Applications Long Zhao PDF All Chapters
100% (3)
Full Massive MIMO in 5G Networks Selected Applications Long Zhao PDF All Chapters
62 pages
Massive_MIMO_An_Overview_Recent_Challenges_and_Future_Research_Directions
No ratings yet
Massive_MIMO_An_Overview_Recent_Challenges_and_Future_Research_Directions
6 pages
Where can buy Massive MIMO in 5G Networks Selected Applications Long Zhao ebook with cheap price
100% (1)
Where can buy Massive MIMO in 5G Networks Selected Applications Long Zhao ebook with cheap price
55 pages
Research of Frequency Multiplexing in 5G Networks
No ratings yet
Research of Frequency Multiplexing in 5G Networks
6 pages
IEEE MTT V065 I11B 2017 11 Microwave theory and techniques Fantomasping - Read the ebook online or download it as you prefer
No ratings yet
IEEE MTT V065 I11B 2017 11 Microwave theory and techniques Fantomasping - Read the ebook online or download it as you prefer
70 pages
08lwc02 2905315 Completeissue APRIL 2019
No ratings yet
08lwc02 2905315 Completeissue APRIL 2019
328 pages
Toward Extra Large-Scale MIMO New Channel Properties and Low-Cost Designs
No ratings yet
Toward Extra Large-Scale MIMO New Channel Properties and Low-Cost Designs
26 pages
Electronics: Massive MIMO Wireless Networks: An Overview
No ratings yet
Electronics: Massive MIMO Wireless Networks: An Overview
29 pages
Guest Editorial Special Issue On Multiple Antenna Technologies For Beyond 5G-PartI
No ratings yet
Guest Editorial Special Issue On Multiple Antenna Technologies For Beyond 5G-PartI
4 pages
OCTOBER 2018 Number 5 Iwclaf ISSN (2162-2345) : (Contents Continued On Page 686)
No ratings yet
OCTOBER 2018 Number 5 Iwclaf ISSN (2162-2345) : (Contents Continued On Page 686)
3 pages
Performance Analysis and Low-Complexity Design For XL-MIMO With Near-Field Spatial Non-Stationarities
No ratings yet
Performance Analysis and Low-Complexity Design For XL-MIMO With Near-Field Spatial Non-Stationarities
18 pages
Khaleghi-MIMO Systems Theory and Applications
No ratings yet
Khaleghi-MIMO Systems Theory and Applications
500 pages
Hossein Khaleghi Bizaki - MIMO Systems, Theory and Applications PDF
100% (1)
Hossein Khaleghi Bizaki - MIMO Systems, Theory and Applications PDF
500 pages
Efficient DSP Circuit Architectures For Massive MIMO 1807.05882
No ratings yet
Efficient DSP Circuit Architectures For Massive MIMO 1807.05882
21 pages
Ieee Mttv055i08 200708 08th Edition Fantomasping instant download
No ratings yet
Ieee Mttv055i08 200708 08th Edition Fantomasping instant download
80 pages
Towards Massive MIMO 2.0: Understanding Spatial Correlation, Interference Suppression, and Pilot Contamination
No ratings yet
Towards Massive MIMO 2.0: Understanding Spatial Correlation, Interference Suppression, and Pilot Contamination
27 pages
Massive MIMO Detection Techniques: A Survey
No ratings yet
Massive MIMO Detection Techniques: A Survey
25 pages
IEEE MTT V051 I12 2003 12 12th Edition Fantomasping pdf download
No ratings yet
IEEE MTT V051 I12 2003 12 12th Edition Fantomasping pdf download
84 pages
Pilot Decontamination Approach in Massive Mi Mo Systems
No ratings yet
Pilot Decontamination Approach in Massive Mi Mo Systems
11 pages
IEEE MTT V051 I12 2003 12 12th Edition Fantomasping - The ebook is ready for instant download and access
No ratings yet
IEEE MTT V051 I12 2003 12 12th Edition Fantomasping - The ebook is ready for instant download and access
86 pages
Yadav 2022 J. Phys. Conf. Ser. 2273 012007
No ratings yet
Yadav 2022 J. Phys. Conf. Ser. 2273 012007
9 pages
MIMO wireless communications over generalized fading channels 1st Edition Brijesh Kumbhani download
100% (1)
MIMO wireless communications over generalized fading channels 1st Edition Brijesh Kumbhani download
68 pages
Aperture Antennas For Millimeter and Sub-Millimeter Wave Applications
No ratings yet
Aperture Antennas For Millimeter and Sub-Millimeter Wave Applications
494 pages
Hybrid Precoding for Massive MmWave MIMO Systems 2
No ratings yet
Hybrid Precoding for Massive MmWave MIMO Systems 2
10 pages
27350556
No ratings yet
27350556
82 pages
MIMO wireless communications over generalized fading channels 1st Edition Brijesh Kumbhani pdf download
100% (1)
MIMO wireless communications over generalized fading channels 1st Edition Brijesh Kumbhani pdf download
57 pages
JSTSP SI Array Signal
No ratings yet
JSTSP SI Array Signal
1 page
Deep Learning For PHY Layer 5G Challenges
No ratings yet
Deep Learning For PHY Layer 5G Challenges
18 pages
RF and Microwave Engineering: Fundamentals of Wireless Communications
From Everand
RF and Microwave Engineering: Fundamentals of Wireless Communications
Frank Gustrau
No ratings yet
Stretchable Electronics
From Everand
Stretchable Electronics
Takao Someya
No ratings yet
Telephone Communication System Essentials
From Everand
Telephone Communication System Essentials
S. Sudhananthan
No ratings yet
Guidelines for the Determination of Standardized Semiconductor Radiation Hardness Parameters
From Everand
Guidelines for the Determination of Standardized Semiconductor Radiation Hardness Parameters
IAEA
No ratings yet
Wireless Speed Boost
From Everand
Wireless Speed Boost
Mei Gates
No ratings yet
Recent Advances in Electrical Engineering: Applications Oriented
From Everand
Recent Advances in Electrical Engineering: Applications Oriented
SUMAN DEBNATH
No ratings yet
Reconfigurable Radio Systems: Network Architectures and Standards
From Everand
Reconfigurable Radio Systems: Network Architectures and Standards
Maria Stella Iacobucci
No ratings yet
Fiber Optics: Networking And Data Transmission In Action
From Everand
Fiber Optics: Networking And Data Transmission In Action
Rob Botwright
No ratings yet
Innovations Beyond the Wires: Exploring the Frontiers of Electrical Engineering
From Everand
Innovations Beyond the Wires: Exploring the Frontiers of Electrical Engineering
Farzam Mohammadiazar
No ratings yet
The Fibernet Way
From Everand
The Fibernet Way
Pasquale De Marco
No ratings yet
Application of Wireless Technologies in Nuclear Power Plant Instrumentation and Control Systems
From Everand
Application of Wireless Technologies in Nuclear Power Plant Instrumentation and Control Systems
IAEA
No ratings yet
Telecommunications Engineering: Integrating Communication Systems for Autonomous Robotics
From Everand
Telecommunications Engineering: Integrating Communication Systems for Autonomous Robotics
Fouad Sabry
No ratings yet
Analog Revolution: The Wave of the Future
From Everand
Analog Revolution: The Wave of the Future
Pasquale De Marco
No ratings yet
6G Wireless Communications and Mobile Networking
From Everand
6G Wireless Communications and Mobile Networking
Xianzhong Xie
No ratings yet
"The Future of Connectivity: Unleashing the Power of 5G and Beyond": GoodMan, #1
From Everand
"The Future of Connectivity: Unleashing the Power of 5G and Beyond": GoodMan, #1
Patrick Mukosha
No ratings yet
Carbon Nanotube Field Effect Transistor: Advancements in Nanoelectronics for Next Generation Devices
From Everand
Carbon Nanotube Field Effect Transistor: Advancements in Nanoelectronics for Next Generation Devices
Fouad Sabry
No ratings yet
Analog, Digital and Multimedia Telecommunications: Basic and Classic Principles
From Everand
Analog, Digital and Multimedia Telecommunications: Basic and Classic Principles
Omar Fakih Hamad
4/5 (1)
TVS90.digita_TVT_0090_Oct_24
No ratings yet
TVS90.digita_TVT_0090_Oct_24
21 pages
SCN376.Digital April Guide 2025
No ratings yet
SCN376.Digital April Guide 2025
8 pages
Tieline Codec Applications and Solutions v3!0!20240506
No ratings yet
Tieline Codec Applications and Solutions v3!0!20240506
29 pages
SCN371.Digital November Guide 2024
No ratings yet
SCN371.Digital November Guide 2024
5 pages
SCN370.Digital October Guide 2024
No ratings yet
SCN370.Digital October Guide 2024
8 pages
Bullet Journal Basics and Ideas
No ratings yet
Bullet Journal Basics and Ideas
1 page
Mac Terminal Commands
100% (1)
Mac Terminal Commands
3 pages
SVC501.Digital Sept Guide 2024
No ratings yet
SVC501.Digital Sept Guide 2024
25 pages
Funny Interview Questions, 2024
No ratings yet
Funny Interview Questions, 2024
12 pages
RWE69.Digital Sept 2024
No ratings yet
RWE69.Digital Sept 2024
22 pages
RWM1276.digital Sept 01 2024
No ratings yet
RWM1276.digital Sept 01 2024
34 pages
SVC500.Digital August Guide 2024v1
No ratings yet
SVC500.Digital August Guide 2024v1
23 pages
MIXE03.Digital September Ebook 2024
No ratings yet
MIXE03.Digital September Ebook 2024
15 pages
SVCE11.Digital August 2024
No ratings yet
SVCE11.Digital August 2024
22 pages
SCN367.Digital July Guide 2024
No ratings yet
SCN367.Digital July Guide 2024
14 pages
Video Delivery Landscape 2024
No ratings yet
Video Delivery Landscape 2024
1 page
Sim8300g-M2 Mmwave Antenna Module Application Notes v1.00
No ratings yet
Sim8300g-M2 Mmwave Antenna Module Application Notes v1.00
22 pages
Huawei AP6750-10T Access Point
No ratings yet
Huawei AP6750-10T Access Point
12 pages
Huawei AirEngine 5573 23H Access Point PDF en 2024
No ratings yet
Huawei AirEngine 5573 23H Access Point PDF en 2024
10 pages
Chua - Meadmat Term Paper
No ratings yet
Chua - Meadmat Term Paper
6 pages
Communication: Side-Edge Frame Printed Eight-Port Dual-Band Antenna Array For 5G Smartphone Applications
No ratings yet
Communication: Side-Edge Frame Printed Eight-Port Dual-Band Antenna Array For 5G Smartphone Applications
6 pages
LTE Air Interface Interactive
No ratings yet
LTE Air Interface Interactive
18 pages
Adaptive Antenna Systems
No ratings yet
Adaptive Antenna Systems
35 pages
A Simplified Adaptive Neuro-Fuzzy Inference System - ANFIS - Controller Trained by Genetic Algorithm To Control Nonlinear Multi-Input Multi-Output Systems
No ratings yet
A Simplified Adaptive Neuro-Fuzzy Inference System - ANFIS - Controller Trained by Genetic Algorithm To Control Nonlinear Multi-Input Multi-Output Systems
12 pages
Zhang2007 - Ling - MIMO Channel Estimation and Equalization Using Three-Layer Neural Networks With Feedback
No ratings yet
Zhang2007 - Ling - MIMO Channel Estimation and Equalization Using Three-Layer Neural Networks With Feedback
5 pages
MIMO Channel Capacity - File Exchange - MATLAB Central
No ratings yet
MIMO Channel Capacity - File Exchange - MATLAB Central
2 pages
Base Station Antenna Systems PDF
No ratings yet
Base Station Antenna Systems PDF
5 pages
Quadriga Documentation v2.6.1-0
100% (1)
Quadriga Documentation v2.6.1-0
260 pages
Thesis Report On Image Denoising
100% (3)
Thesis Report On Image Denoising
8 pages
4x4 MIMO to optimize Max Throughput
No ratings yet
4x4 MIMO to optimize Max Throughput
3 pages
TOP 200 AMR MF-05 - 5 - 5G MIMO 4x4
No ratings yet
TOP 200 AMR MF-05 - 5 - 5G MIMO 4x4
2 pages
DSP Ieee Papers: Sahasra Software Solutions
No ratings yet
DSP Ieee Papers: Sahasra Software Solutions
13 pages
WWW Ncbi NLM Nih Gov PMC Articles PMC9061263
No ratings yet
WWW Ncbi NLM Nih Gov PMC Articles PMC9061263
64 pages
ec6801 qp
No ratings yet
ec6801 qp
2 pages
Chapter 8. Wi-Fi 7 Network Planning - Wi-Fi 7 in Depth - Your Guide To Mastering Wi-Fi 7, The 802.11be Protocol, and Their Deployment
No ratings yet
Chapter 8. Wi-Fi 7 Network Planning - Wi-Fi 7 in Depth - Your Guide To Mastering Wi-Fi 7, The 802.11be Protocol, and Their Deployment
43 pages
A 4x4 MIMO-OFDM System With MRC in A Rayleigh Multipath Channel For WLAN
No ratings yet
A 4x4 MIMO-OFDM System With MRC in A Rayleigh Multipath Channel For WLAN
4 pages
ZXRAN A9825 S26 Product Description
No ratings yet
ZXRAN A9825 S26 Product Description
15 pages
How To Calculate Peak Data Rate in LTE
No ratings yet
How To Calculate Peak Data Rate in LTE
2 pages
Ber Analysis For Downlink Mimo-Noma Systems Over Rayleigh Fading Channels
No ratings yet
Ber Analysis For Downlink Mimo-Noma Systems Over Rayleigh Fading Channels
16 pages
Ip20c Mimo
100% (1)
Ip20c Mimo
45 pages
OmLondhe - SeminarReport - Wi-Fi 6
No ratings yet
OmLondhe - SeminarReport - Wi-Fi 6
53 pages
Airborne Mesh Radio XK-M353
No ratings yet
Airborne Mesh Radio XK-M353
5 pages
TubeStar Introduction
No ratings yet
TubeStar Introduction
12 pages
Study of Different Types of Microwave Antenna and Its Applications
No ratings yet
Study of Different Types of Microwave Antenna and Its Applications
4 pages
1 - Nokia LTE Solution and Roadmaps
100% (1)
1 - Nokia LTE Solution and Roadmaps
41 pages
ETSI Version PDF
No ratings yet
ETSI Version PDF
169 pages

Massive MIMO Systems, Paper 2020

Uploaded by

Massive MIMO Systems, Paper 2020

Uploaded by

Massive MIMO

Kazuki Maruta and Francisco Falcone

Special Issue Editors

ISBN 978-3-03936-016-1 (Hbk)

About the Special Issue Editors . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . vii

Kazuki Maruta and Francisco Falcone

Seoyoung Yu and Jeong Woo Lee

Ahmed S. Al-hubaishi, Nor Kamariah Noordin, Aduwati Sali

Hieu Trong Dao and Sunghwan Kim

Jong-Gyu Ha, Jae-Hyun Ro and Hyoung-Kyu Song

Qiuna Yan, Yu Sun and Dian-Wu Yue

Hui Feng, Xiaoqing Zhao, Zhengquan Li and Song Xing

Naser Ojaroudi Parchin, Haleh Jahanbakhsh Basherlou, Mohammad Alibakhshikenari,

Mohammad Alibakhshikenari, Bal Singh Virdee, Chan H. See, Raed A. Abd-Alhameed,

Received: 17 February 2020; Accepted: 21 February 2020; Published: 26 February 2020

2. The Present Issue

Electronics 2020, 9, 385; doi:10.3390/electronics9030385 1 www.mdpi.com/journal/electronics

In millimeter-wave communication, almost line-of-sight (LoS) channel or Ricean fading channel

MIMO Multiple-Input Multiple-Output

Received: 25 October 2018 ; Accepted:10 November 2018 ; Published: 12 November 2018

Abstract: To achieve the advantages provided by massive multiple-input multiple-output (MIMO),

Electronics 2018, 7, 317; doi:10.3390/electronics7110317 5 www.mdpi.com/journal/electronics

We consider the downlink transmission of a single-cell multi-user distributed massive MIMO

Figure 1. System Model.

3. Downlink Spectral Efﬁciency Analysis

3.1. Channel Estimation

ĝk (t) = Λk Hk (t)Σ−1 Ψ, (6)

Hk (t) = [Hk,1 (t), · · · , Hk,τ (t)] ,

Under these assumptions, we give a deﬁnition of

then we can rewrite the LMMSE estimation ĝk (t) in (6) as

ĝk (t) = Λk Hk (t)Σ−1/2 ĥk (8)

3.2. Downlink Signal Model

3.3. Downlink Achievable Rates

The strength of the estimated channel from user k to all RAUs is

According to Lemma 1, β2m,k (t)ĥH

k ( t ) ĝk ( t ) can be approximated as Γ ( χk ( t ), θk ( t )), wherein

Proof of Theorem 1. Please refer to Appendix A.

Proof of Theorem 2. Please refer to Appendix B.

Table 1. Basic simulation parameters.

Figure 3. Spectral efﬁciency against M with MRT and ZF precoders.

Figure 5. Spectral efﬁciency against t with MRT and ZF precoders.

Appendix A. Proof of Theorem 1

that E e− j(θk (t)−θk (τ )) = e−

The ﬁrst term of (A2) can be calculated as

where ( a) results from (26).

where ( a) can be obtained by exploiting the fact that eH

For the last term B(t), we ﬁrst calculate

where x  y means lim N →∞ ( x − y) = 0, ( a) results from Lemma 4 (ii) of [32].

Appendix B. Proof of the Theorem 2

The ﬁrst term (A8) can be given by

which results from (27).

Next the second term can be given by

which results from (25) and the fact that eH

For the term B(t), we can have

where ( a) results from the property of ZF precoder.

Received: 15 November 2018; Accepted: 3 December 2018; Published: 4 December 2018

Electronics 2018, 7, 391; doi:10.3390/electronics7120391 21 www.mdpi.com/journal/electronics

Figure 1. System conﬁguration.

2.1. Quantization Noise Model

gi,l,k = ĝi,l,k + g̃i,l,k (5)

2.2. The Energy Efﬁciency Model

ηl,m,i,j = ηl,m,i,j − β l.m,i,j .

Pl = PTC + PLP + PT + PBH . (15)

PTC = M( NPBS + ρPSYN ) + (1 − ρ) PSYN + KPUE + MPADC (16)

where ξ is the ampliﬁed efﬁciency.

where PIND is the power consumption independent of p and can be given by

PIND = PTC + PLP + MP0 . (21)

B ∑lL=1 ∑kK=1 Rl,k (p)

3. Energy Efﬁciency Analysis

Proof. The proof is given in Appendix B.

Proof. The proof is given in Appendix C.

Table 1. Power consumption parameters.

Appendix A. Lemmas for Proof

Appendix B. Proof of Theorem 1

l,l,k ĝl,l,k ∼ Γ ( k̂ l,l,k , θ̂l,l,k ) and Lemma A1.

For the last term, we ﬁrst calculate

where x y means lim N →∞ ( x − y) = 0, ( a) results from Lemma 4 (ii) of [32].

q = q̄ + 0.5 + v(−1)2q̄ ,