2018 Smart Sorting in Massive MIMO Detection
2018 Smart Sorting in Massive MIMO Detection
The research was carried out at Skoltech and supported by the Russian MMSE with ordered successive interference cancellation
Science Foundation (project no. 18-19-00673). (OSIC) MIMO detector is performed from the QRD of the
978-1-5386-6876-4/18/$31.00 ©2018
Authorized licensed use limited to: Ural FederalIEEE
University. Downloaded on June 21,2021 at 03:45:05 UTC from IEEE Xplore. Restrictions apply.
2018 14th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
permuted channel matrix H perm , which is defined as in [5], 1) variable extra complexity of cLLL implementation;
[6] and [7]: 2) slicing (rounding to the nearest point as shown in [1]
y ext = H perm xperm + n, and [5]) and candidates list generation in LR domain
since T transformation results in non-rectangle constel-
xperm = P x, lation and fixed candidates numeration is less efficient;
H perm = P H ext , (3) 3) log-likelihood ratio (LLR) calculation in LR domain
because most modern decoders require soft decision;
where matrix H ext is calculated by equation (2) and used 4) interpolation-based cLLL (i.e. how to interpolate T
instead of H for regularization reasons. QR factorization matrix from pilot symbol to the current one, such
plays a key role in the OSIC detection [6]: approach is described in [9] for QRD without LRA).
H perm = QR, Many researchers have proposed solutions to decrease
LRA complexity or fix the processing iteration of the LR
QH y ext = QH Q Rxperm + QH n, algorithm in their hardware implementation, for example [1]
and [6], but LRA still requires a lot of computations and,
QH y ext = Rxperm + QH n, moreover, performance gap between ML and LRA solution
where R is a [(N + M ) × M ] upper triangular matrix; P is still big in highly correlated UEs scenarios. Nevertheless,
is the permutation matrix, required to reorder UEs before most researchers still ignore problems 1-4, only considering
the SIC detection. Iterative detection starts from xperm (M ) performance gain.
amplitude detection and stops with xperm (1) calculation Using MMSE or MMSE-OSIC MIMO detector after LR
due to upper triangle matrix R property. After that initial cannot achieve satisfying performance for 48×64 MIMO sys-
vector x can be calculated as x = P T xperm . MMSE-OSIC tem. Therefore, the processing K-best detector can be utilized
detection demonstrates improved performance compared to to achieve satisfying performance in adoptable complexity
the MMSE detection, but the improvement is limited due to [1]. Although the proposed distributed K-best (D-K-best, [8])
error propagation issues, caused by non-ideal permutation (or algorithm greatly reduces the sorting complexity compared
sorting). Most known sorting methods are too complex or with a common breadth-first K-best algorithm, the D-K-best
based on power reordering, resulting in performance losses in also results in significant performance losses in correlated
correlated channel. channel. In [1] a variant of D-K-best detector called SR-K-
Lattice reduction aided (LRA) type of methods used in best algorithm was proposed. Sorting the best K survivors
[1] and [8] are pre-processing procedures that operate on the from KM (M = 4) candidates is reduced to sorting the
channel matrix H only rather than pure detection methods best S candidates while other K − S survivors are called
that operate on the received data vector y as well. The LRA ”most expected” and selected without sorting. Thus, paper
MIMO detection algorithms build upon the idea of converting [1] proposes an SR-K-best with index (K, S, p) to reduce the
an ill-conditioned problem into an equivalent well-conditioned sorting operation from K to S times. The vector p defines
problem via a linear transform T that fulfills certain con- positions of the ”most expected” candidates. However, its
ditions. Lattice reduction reorganizes the extended channel architecture is not optimal in candidates selection for the next
matrix to more orthogonal one, lowering the likelihood of iteration step.
noise propagation: In addition, the MIMO detection problem can be refor-
mulated as how to achieve best performance in the fixed
y = H ext T T −1 x + n = H LR z + n, complexity? Alternatively, how to find optimal architecture
z = T −1 x, in fixed complexity? A general approach to this problem is
a machine learning application. Machine learning asymptoti-
H LR = H ext T , cally converges to ML solution and can increase performance
1
in the receiver. There are many machine learning approaches
T
x = aP round (T z) + 1 (1 + i) , to MIMO detection problem: support vector regression (SVR)
2
is proposed in [10], convolutional neural networks (CNN) are
where H LR is the lattice-reduced channel matrix; T is the proposed in [11] and [12]. Authors achieve performance gain
uni-modular matrix; a is the scaling coefficient (a = √210 in high correlated scenarios and their solution requires less
for QAM16 and a = √242 for QAM64) . Thus, many LRA calculations. However, all simulations done with modulation
detectors use function round() to slice the invalid vector order up to QPSK, since phase modulation fits to neural
T z to the nearest point of x. Once the problem is well- networks architecture. With QAM16 or higher the machine
conditioned, common MMSE or MMSE-OSIC detectors can learning algorithm complexity grows significantly. Another
be used to achieve near-optimal performance. The complex problem appears when in uplink channel there are several
Lenstra–Lenstra–Lovász (cLLL) algorithm, described in [1], users with different modulation scheme, for example, N1 UEs
[6] and [8], is a well-known polynomial time LRA algorithm. with QPSK, N2 UEs with QAM16 and N3 UEs with QAM64.
However, it is inefficient for direct implementation because It is clear that the number of combinations N1 N2 N3
of several problems: is rather big when Ni ∈ [1...64] and to learn all such scenarios
Authorized licensed use limited to: Ural Federal University. Downloaded on June 21,2021 at 03:45:05 UTC from IEEE Xplore. Restrictions apply.
2018 14th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
for each signal-to-noise ratio (SNR) and for each correlation III. D ETECTION APPROACH
between UEs the CNN should have huge size. Therefore, the
advantage of direct machine learning application in MIMO A functional scheme of the proposed approach is shown
detection is not evident; however, learning ideas could be use- in Fig. 3. It consists of pre-processing and processing parts.
ful to optimize existing algorithms structures. Our simulations Preprocessing is required to calculate sorted QRD in two
with CNN deep learning in QuaDRiGa channel show that steps.
learning error doesn’t go down when the number of layers
Step 1: QRD interpolation as described in [14].
exceeds the number of UEs. Therefore, CNN demonstrates
performance gain in MIMO detection due to clever iterative For QRD calculation in the MIMO system, we have to esti-
OSIC. mate the channel response of each subcarrier and then perform
In this paper, we show by simulations in QuaDRiGa the QRD for each subcarrier. In practice, the interpolation-
channel that MU-MIMO detection is very sensitive to sorting based QRD only computes the Q and R matrixes for the
operation and propose smart sorting optimization. In fact, pilot subcarriers to reduce a computational complexity. Then,
clever UE sorting before QRD and candidates sorting in SR- the Q and R of the data subcarriers are interpolated from
K-best stage demonstrate final performance similar to LRA those of the pilot subcarriers.
application, but in much less complexity, and the proposed Step 2: UE sorting (strings permutation in matrix H ext )
solution is more feasible in application. to find the P matrix for equation (3).
UE sorting problem is well-known and there are many ap-
II. S IMULATION TOOL proaches to overcome it, for example a post-sorting algorithm
and pre-sorting solution are analyzed in [6]. However, most
QuaDRiGa, short for QUAsi Deterministic Radio chan- methods require many computation resources or result in poor
nel GenerAtor [13], is a Matlab software used for gener- performance. We propose to use P matrix from QRD of the
ating realistic radio channel impulse responses for system- pilot symbols as the first step of P matrix calculation for the
level simulations of mobile radio networks. These simula- data symbol. Therefore, sorting tracking is considered. The
tions are used to determine the performance of new tech- P matrix slightly changes from one symbol to another and a
nologies in order to provide an objective indicator for the low sorting complexity is required to update it. Loss function
standardization process in bodies like the third generation L = L{diag (R)} of diagonal entries of interpolated R
partnership program 3GPP. Besides being a fully-fledged matrix is optimized to guarantee the least number of matrix P
three dimensional geometry-based stochastic channel model, updates (low complexity updates) from one symbol to another
QuaDRiGa contains a collection of channel models along with without performance losses. Processing part is represented by
novel modelling approaches, which provide features to enable
quasi-deterministic multi-link tracking of users movements in
changing environments. QuaDRiGa contains a couple of new
features and is furthermore calibrated against 3GPP channel
models like 3GPP-3D and the latest New Radio channel
model. The QuaDRiGa approach can be understood as a
statistical raytracing model. In our simulation we used non-
line of sight (NLOS) scenario with 48 single antenna UEs
moving with averaged speed of 3km per hour and 64 receiving
antennas of BS (MIMO 48×64). A short fragment (magnitude
spectrum) of generated channel model is shown in Fig. 2 for
600 subcarriers in 90 symbols.
Authorized licensed use limited to: Ural Federal University. Downloaded on June 21,2021 at 03:45:05 UTC from IEEE Xplore. Restrictions apply.
2018 14th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
IV. S MART UE SORTING BEFORE QRD 1) Given an interpolated pre-sorted QRD for the current
Let H be a N = 64 by M = 48 matrix of full rank M . symbol (sorting comes from the pilot symbol);
|R |2 +|Rn+1,n+1 |2
We consider the QRD of matrix H perm = P H ext according 2) We compute the ratios n,n+1 Rn,n 2 for all n
to equations (2) and (3) with permuted strings, i.e. equation and check if there are any less than 1 so that we can
H perm = QR. We assume without loss of generality that the decrease the loss;
matrix R in this decomposition is square. We are interested 3) If yes, we perform the respective elementary permuta-
in finding permutations such that an absolute value of the tion of neighboring strings, i.e. updating the matrices Q
diagonal entries |Rnn | tend to increase as n grows. In general, and R for the current symbol;
it is not possible to find a permutation achieving a complete |R |2 +|Rn+1,n+1 |2
4) After that, we update the ratios n,n+1 Rn,n 2
ordering, but we can fix a specific loss function for new matrix R;
L = L{diag (R)}, 5) Repeat until there are no ratios less than 1.
Let us estimate the complexity of one step of this algorithm.
penalizing the lack of order, and then consider the problem Updating the matrices Q and R requires O(N ) arithmetic
of optimizing the permutation with respect to this loss func- |R |2 +|Rn+1,n+1 |2
tion. One obvious choice for the loss function is the number operations. Updating the ratios n,n+1 Rn,n 2 requires
of pairs of diagonal entries with the wrong order: O(2) operations, since we only need to recompute it at
X n = k − 1 and n = k + 1, where k corresponds to the
L= (Rkk ≥ Rnn ). (4) elementary permutation of the previous step. Checking if there
1≤k<n≤M are ratios less than 1 requires O(M ) comparisons. Thus, the
However, equation (4) is a poor choice for several reasons. total number of one-step comparisons is O(N +M ), i.e. O(N )
First, this loss is excessively (discontinuously) sensitive to arithmetic operations. At each step of the algorithm, we can
the changes in R whenever two diagonal entries become ensure without changing the O(N ) complexity that we choose
equal, and is entirely insensitive otherwise. Second, even if n with the lowest currently available ratio, thus obtaining
we perform only some simple permutation on H ext (say the optimal gain. This can be done simply by minimizing
exchange a pair of neighboring strings), its effect on the loss the ratios over all n’s at each step or, more efficiently, by
is relatively hard to compute: we need to check for each maintaining (and keeping updated between steps) the sorted
diagonal entry, even if it is unaffected by the permutation, list of ratios smaller than 1.
whether any pairs of this entry with the affected diagonal
entries change their status (correct/wrong order). A much
more sensible choice for the loss function is defined as:
M
X
L=− (2n − M − 1)lnRnn . (5)
n=1
Authorized licensed use limited to: Ural Federal University. Downloaded on June 21,2021 at 03:45:05 UTC from IEEE Xplore. Restrictions apply.
2018 14th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
Authorized licensed use limited to: Ural Federal University. Downloaded on June 21,2021 at 03:45:05 UTC from IEEE Xplore. Restrictions apply.
2018 14th International Conference on Wireless and Mobile Computing, Networking and Communications (WiMob)
Authorized licensed use limited to: Ural Federal University. Downloaded on June 21,2021 at 03:45:05 UTC from IEEE Xplore. Restrictions apply.