Study_of_Clustering_Solutions_for_Scalable_Cell-Free_Massive_MIMO
Study_of_Clustering_Solutions_for_Scalable_Cell-Free_Massive_MIMO
ABSTRACT In response to the requirements of 5G and beyond, cell-free systems have emerged to ensure
uniform service throughout the area. However, the idea of having no cells leads to a considerably large
monolithic system, which is not scalable. In this context, common sense suggests the creation of clusters
of Access Points (APs) and User Equipments (UEs) that allow the system to be managed locally to some
extent. The clustering procedure has to ensure certain performance for the global system and, at the same
time, ensure its scalability. In this paper, first, scalability is analyzed. Subsequently, we study and compare
two clustering approaches for uplink and downlink. Finally, we propose a combination of clustering and
resource allocation techniques that outperforms the rest of the analyzed state-of-the-art solutions.
approaches depending on how interference is reduced. The (i) decoding the signal of the first UE considering the signal
first approach creates a different cluster for each UE, locating of the rest as interference, (ii) subtracting the signal decoded
the UEs at the center of their clusters. In this case, the cluster from the first UE from the received signal, (iii) decoding
border is distanced as much as possible. The second approach the signal of the second UE considering the signal from the
includes many UEs in the same cluster and uses interference third UE onwards as interference, and so on. This reception
cancellation techniques to reduce the interference. technique is known to be capacity-achieving if all the sources
In this work, we will analyze these two approaches for of interference have a known covariance. Specifically, since
uplink and downlink under different scenarios in order to the decoding order does not affect the sum rate [5], we assume
figure out which would be the most suitable in terms of that the first signal to be decoded is the one corresponding
achievable data rate under certain circumstances. Based on to the UE with the highest index, i.e., the UE k̃(m) of the
the obtained results, we will propose a combination of clus- m-th cluster, and then we follow the decreasing order of the
tering and resource allocation techniques that outperforms the indices. Therefore, the achievable data rate of the k-th UE in
current solutions in the literature. the uplink with an identity noise covariance is as in (1), shown
The reminder of this paper is organized as follows. at the bottom of the page, where P is the available power at the
In Section II, the system model is presented. Afterward, the UEs. In (1), the interference has been divided into two terms
scalability concept is analyzed in Section III. Section IV according to its origin. In the numerator, the first term is the
describes the two previously mentioned clustering approa- noise covariance; the second term includes the interference
ches. The performance of particular clustering solutions from from UEs in clusters with indices m < m̃(k), the interference
the two approaches with the corresponding resource alloca- from the UEs in the same cluster, and the signal of the
tion techniques is evaluated in Section V for the uplink, and k-th UE; and the third term includes the interference from the
in Section VI for the downlink. Finally, Section VII draws the UEs in clusters with indices m > m̃(k). In the denominator,
conclusion. the terms are similar except for the fact that the signal of the
k-th UE is not included.
II. SYSTEM MODEL Analogously, in the case of the downlink, Dirty Paper Cod-
In this work, a cell-free massive MIMO system with clus- ing (DPC) [6] is considered. This coding technique consists
ters is considered. In particular, we define the system S = of adapting the codebooks of the transmitter to a known
{{Lm }Mm=1 , {Km }m=1 } as a system with M clusters of single-
M
interference in such a way that the interference does not affect
antenna APs and UEs. The APs and UEs in the m-th cluster the achievable rate. Therefore, the CPU of the AP cluster can
are those in Lm and Km , respectively. We assume that the UEs (i) encode the signal of the first UE, (ii) treat this signal as a
are exactly in one of those clusters, although APs can be in known interference for the second UE, and so on. In this case,
several clusters, i.e., Km ∩ Kn = ∅ for all m ̸ = n, but this the first UE sees the signal from the other UEs as interference,
is not necessarily true for any pair of sets in {Lm }M m=1 . The the second UE sees the signal from the third UE onwards
amount of APs and UEs in the system is L = | ∪M m=1 Lm | and as interference, etc. As in the case of SIC, this encoding
K = | ∪M m=1 mK |, respectively. technique is known to be capacity-achieving if all the sources
Let m̃(k) be the cluster the k-th UE belongs to. Without of interference have a known covariance. Some works found
loss of generality, we index the UEs in the same cluster an uplink-downlink duality when SIC and DPC are used [7].
continuously, i.e., the UEs in the m-th cluster are those with In particular, the achievable rates are the same. In [7], it is
indices k̃(m − 1) + 1, k̃(m − 1) + 2, . . . , k̃(m), where k̃(m) is shown that the DPC encoding order that achieves certain
the largest index of the UEs in the m-th cluster, and k̃(0) = 0. achievable rates is the reversed SIC decoding order of the dual
The channel between the k-th UE and the APs of the m-th uplink system. Following this principle, we assume that the
cluster is denoted by Hkm ∈ C|Lm | , where |Lm | is the amount encoding order is the reversed order considered for the uplink,
of APs in the m-th cluster. i.e., the encoding follows the increasing order of the UEs’
Interference cancellation techniques are used for the set indices. Therefore, the achievable data rate of the k-th UE
of UEs in each cluster. In particular, some interference can- in the downlink with an identity noise covariance is as in (2),
cellation is performed between the UEs 1, 2, . . . , k̃(1), also shown at the bottom of the next page, where Qk is the transmit
between the UEs k̃(1) + 1, . . . , k̃(2), etc. covariance matrix of the k-th UE. In (2), the interference has
As a performance measure of the system, the total sum data also been divided into two terms according to its origin, and
rate is calculated for both uplink and downlink. In the case in a similar manner as in (1). Since single-antenna UEs are
of the uplink, Successive Interference Cancellation (SIC) [4] considered, it is not needed to include determinants in the
is considered in the decoding. This algorithm consists of equation.
Pk ∗
PK
|I + P i=1 Him̃(k) Him̃(k) + P H H∗ |
i=k̃(m̃(k))+1 im̃(k) im̃(k)
RUL
k = log . (1)
+ P i=1 Him̃(k) Hi∗m̃(k) + P K
Pk−1
H∗ |
P
|I H
i=k̃(m̃(k))+1 im̃(k) im̃(k)
III. ANALYSIS OF THE SCALABILITY the amount of UEs, specially considering deployments with
Cell-free massive MIMO systems in their canonical form rely more APs than UEs like in cell-free massive MIMO systems.
on the cooperation of the L APs to serve the K UEs, i.e., Second, real systems have access control mechanisms to
M = 1. This implies that, as the network grows, so do the avoid this situation. In this sense, we wonder whether it is
computational requirements of the network and the volume reasonable to criticize the scalability of the solution in [3]
of data and signaling that has to be exchanged between each because the amount of UEs served by an AP is not explicitly
AP and each UE. Due to this, these systems are said to be limited.
non-scalable. It is therefore important to find solutions with
superior performance and good scalability with the network B. PROPOSED DEFINITION
size. To do this, it is important to agree on the main features From our point of view, the definition of scalability should
that a scalable system should satisfy. Then, we could design take into account the system viability in terms of computation
systems with such features. In this section, we analyze a scal- and resource requirements as the system increases in size.
ability definition from the state of the art, and subsequently, To do this, we have to discuss first when a system is viable.
we propose a new definition. In this context, we can define viability using the system denial
of service probability, i.e., the probability that a UE cannot be
A. STATE-OF-THE-ART DEFINITION served by the system due to the lack of computational, front-
In [8], the authors formally defined that the network is scal- haul or radio resources. Let Dk be the denial of service prob-
able if the following tasks have finite complexity and resource ability for the k-th UE. Then, the denial of service probability
requirements for each AP as the number of UEs, K , tends of the system S is defined as
to ∞: ′
K
1) Signal processing for channel estimation; 1 X
2) Signal processing for data reception and transmission; PD (S) = lim Dk . (3)
′
K →K K′
k=1
3) Front-haul signaling for data and CSI sharing;
4) Power control optimization. This definition is valid for both the case K = ∞, and the
In [3], a solution was proposed and considered to be fully case K < ∞.PIn the latter, this definition can be rewritten as
scalable. The solution is based on the definition of fixed PD (S) = K1 K k=1 Dk . We say that a system is viable if the
clusters of APs, each of them connected to a different CPU. denial of service probability is below a maximum threshold,
Each UE selects the best-server AP and a set of serving i.e., PD (S) ≤ Pmax < 1.
APs, up to a maximum quantity of APs. The CPU of the Let Sm = {{Lm }, {Km }} be a subsystem of the system S
best-server is in charge of the channel estimation and the composed of the m-th cluster alone, whose denial of service
reception/transmission processing. This processing and chan- probability is PD (Sm ). Considering that the m-th cluster is
nel estimation is done independently for each UE and takes viable, i.e., PD (Sm ) ≤ Pmax , more clusters can be added with-
into account only the serving APs. Therefore, even in the out affecting the viability of the system if they are designed to
worst case (all UEs require the maximum amount of serving meet the same criterion. This is explained by the fact that the
APs), the total resource requirements of this solution increase availability of computational, front-haul, and radio resources
linearly with the amount of UEs in the network. When the of a cluster does not depend on the rest of the system. It is
network grows, all, the amount of UEs, APs, and CPUs grow worth noticing that with the increase in UEs, these may
with the same proportion and, hence, both the amount of UEs experience a deterioration of the received signal but this is
and the available resources grow with the same proportion. caused not by a reduction in the radio resources available, but
As a consequence, the system is scalable. However, this by interference. Considering that for the k-th UE the denial
scalability claimed in [3] was questioned by [8], since [3] of service probability is Dk = PD (Sm̃(k) ), we say that the
does not explicitly limit the number of UEs that can connect system is viable since following (3), we get, PD (S) ≤ Pmax .
to one AP. In other words, if we design viable clusters, the overall system
This raises the question of whether all the criteria in the composed of those clusters will also be viable, and certainly,
scalability definition in [8] are necessary in practice. For scalable.
instance, although one can argue the necessity of limiting the We now look at the canonical cell-free massive MIMO
amount of UEs that can be served by one AP, the reality is that case, where a unique CPU manages the reception and trans-
many works in the literature dealing with resource allocation mission to all UEs, and collects CSI information from all APs.
problems do not consider this limitation. This is due to two Taking into account that the CPU computational capabilities,
main reasons. First, it is unlikely to saturate the APs by as well as the front-haul resources, are limited, the number
Pk̃(m̃(k)−1) ∗
Hk m̃(i) Qi Hk m̃(i) + K ∗
P
1 + i=1 i=k Hk m̃(i) Qi Hk m̃(i)
RDL
k = log Pk̃(m̃(k)−1) ∗ . (2)
Hk m̃(i) Qi Hk m̃(i) + K ∗
P
1 + i=1 H Q H
i=k+1 k m̃(i) i k m̃(i)
previous
√ section,
√ the AP clusters are assumed to be squares of
√ |L m |× |L
√ m | APs, whereas the cluster cores are squares of preserve the scalability of the system, suboptimal strategies
|Lm |/2 × |Lm |/2 APs at the center of the corresponding should be considered. One of the options is to allocate power
cluster, for m = 1, . . . , M . equally as will be detailed later in Section VI-A for the UaC
In Figure 6 the achievable data rate per UE for a cluster solution. Another option would be to design a scheme to
average size of 256 APs is shown for the UaC, the DC and the make a fixed pre-allocation of power and then optimize the
mixed solutions. The curve of the UaC solution was obtained power allocation in each cluster. However, there is no clear
following the procedure explained in Figure 4. As can be criterion for such pre-allocation. Due to this and in order
noticed, the performance with the mixed solution is better to simplify the subsequent analysis, we will not consider a
than the UaC and the DC solutions. Therefore, in this case, mixed solution, which requires the APs to pre-allocate power
the best performance is not obtained by only (i) avoiding the to all the clusters they belong to.
cluster border effects or (ii) canceling all the interference of
UEs with the master AP in the cluster. In this case, we need to A. USER-AT-CENTER
use a solution that mixes the two clustering approaches. In any According to [8], in the downlink, the AP clusters are created
case, the DC solution is the most similar to the mixed solution. as explained in Section V-A. We recall that this clustering
This highlights the importance of canceling interference. solution produces overlapping AP clusters that serve only one
UE each. Moreover, since the APs serve a maximum of α
VI. ANALYSIS OF THE CLUSTERING SOLUTIONS UEs (or belong to a maximum of α clusters), they split their
FOR THE DOWNLINK available power into this amount of UEs. This implies that the
In the uplink, the APs can be considered passive receivers, APs pre-allocate a certain amount of power for the different
since they just need to send the signal received to the clusters they belong to. Letting P be the available power at the
corresponding CPUs. However, in the downlink, the APs are APs, they use a maximum of P/α for each UE or cluster [8].
actively transmitting so the available power needs to be allo- The actual power used to transmit to the UEs is obtained by
cated to each of those transmissions. This power allocation multiplying P/α by a normalized precoding vector, which,
becomes more challenging if the clusters are overlapping. in the case of the k-th UE, is computed as
In that case, as each AP could be connected to more than
one CPU, all the CPUs involved would have to cooperate Hkk
wk = . (6)
and this could lead to a non-scalable system. In order to |Hkk |
Then, the power used by the l-th AP of the k-th cluster to limited available power in the APs. Mathematically, for the
transmit to the k-th UE is cluster m = 1, we want to solve
P P k̃(1)
|wkl |2 ≤ , (7) X k̃(1)
α α max RDL-2
k ({Qi }i=1 ),
k̃(1)
which indicates that the total power used to transmit to the {Qk }k=1 k=1
UEs by all APs is, indeed, P/α. This power allocation, which s.t. Qk ⪰ 0, k = 1, . . . , k̃(1),
is proposed in [8], ensures the APs do not use more power
k̃(1)
than P, although in a very conservative way. In particular, X
on average, the APs use significantly less power than P qkl ≤ P, l = 1, . . . , N , (11)
to transmit to all the UEs. In order to illustrate this fact, k=1
we assume that each AP serves exactly α UEs. Then, the where qkl is the l-th element of the main diagonal of Qk .
average amount of APs serving each UE is To find a solution for (11), we can use the algorithm proposed
in [10], which is based on [11]. With the optimum transmit
Lα
Lserving = . (8) covariance matrices in (11), the APs use all their available
K power. It is worth recalling that this solution is optimum
As mentioned before, the total power used to transmit to
locally, i.e., for one isolated cluster. The use of all the avail-
each UE is P/α, which is split into the APs serving this
able power can be dangerous in terms of the interference
UE. Therefore, the average power used per AP and UE is
caused to other clusters. Due to this, we also consider a
P/αLserving . Since the APs serve α UEs, the average total
different resource allocation.
power used per AP is
For the alternative resource allocation, we consider that
P PK the available power can be shared among all the APs in
α= . (9)
αLserving Lα the cluster. By doing this, the optimum covariance matrices
We consider here two numerical examples. If α = 4, contain, in the eigenvectors, the optimum precoding matri-
K = 256, and L = 1024, the average total power used per ces without the limitation of the power constraints. These
AP would be P/16, and if α = 8, K = 64, and L = 1024, precoding matrices define the optimum power distribution
the average drops to P/128. This conclusion is quite striking, to maximize the data rate. Due to this, we propose to use a
as it shows that a large part of the available power is unused. scaled version of these covariance matrices. The scaling is
necessary to ensure that the power constraints are met. More
B. DISJOINT CLUSTERS specifically, we solve
As in the case of the uplink, we can also consider the creation
k̃(1)
of disjoint clusters in the downlink with size |Lm | = N for X k̃(1)
all m. As discussed before, the resource allocation in each max RDL-2
k ({Qi }i=1 ),
k̃(1)
{Qk }k=1 k=1
cluster should not be conditioned to the particular allocation
performed in neighboring clusters. If not, the effect of each s.t. Qk ⪰ 0, k = 1, . . . , k̃(1),
cluster would propagate throughout the scenario, making the k̃(1)
X
resource allocation not scalable. In order to avoid this scala- Qk ≤ NP. (12)
bility problem, we propose to perform the resource allocation k=1
in each cluster as if they were isolated from the rest, i.e., k̃(1)
assuming that the interference from the other clusters do not Let {Q̄k }k=1 be the optimum solution of (12). In general,
affect their UEs. Without loss of generality, we are going to these covariance matrices do not satisfy the individual power
particularize this resource allocation for the first cluster, i.e., constraints in each AP. Due to that, we compute
m = 1. In this case, and taking into account that we neglect P
the interference from other clusters, the achievable rate of the
Qk = P Q̄k , k = 1, . . . , k̃(1), (13)
k̃(1)
maxl k=1 q̄kl
k-th UE is, using (2),
P
1 + Hk1∗ k̃(1) where q̄kl is the l-th element of the main diagonal of Q̄k .
k̃(1) i=k i Hk1
Q
k̃(1)
RDL-2
k ({Qi }i=1 ) = log Pk̃(1) , (10) The covariance matrices {Qk }k=1 in (13) satisfy the power
∗
1 + Hk1 i=k+1 Qi Hk1 constraints in each AP and, since they are scaled versions
k̃(1)
of {Q̄k }k=1 , they have the same eigenvectors and, hence,
for k = 1, . . . k̃(1). The rate expression in (10) is the one used
they define the same precoding matrices. The optimization
for the resource allocation, i.e., the computation of the trans-
k̃(1) problem in (12) can be solved with the algorithm presented
mit covariances {Qi }i=1 . However, the actual achievable rate in [10], or with other techniques that make use of the uplink-
is shown in (2), which takes all the interference into account. downlink duality like in [12] and [7].
The expression in (10) is the one used in the following section
to compute the achievable rate in the figures. C. COMPARISON OF SOLUTIONS FOR DOWNLINK
To obtain the transmit covariances, we propose to maxi- In this section, we study the performance of the two pre-
mize the sum data rate in the cluster taking into account the vious clustering solutions in the same scenario described in