Congestion Control Prediction Model For 5G Environment Based On Supervised and Unsupervised Machine Learning Approach

Uploaded by

boinpallyvamshi3

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views

Congestion Control Prediction Model For 5G Environment Based On Supervised and Unsupervised Machine Learning Approach

Uploaded by

boinpallyvamshi3

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

Received 17 May 2024, accepted 27 May 2024, date of publication 19 June 2024, date of current version 9 July 2024.

Digital Object Identifier 10.1109/ACCESS.2024.3416863

Congestion Control Prediction Model for 5G

Environment Based on Supervised and
Unsupervised Machine Learning
Approach
MOHAMMED B. M. KAMEL 1,2 , IHAB AHMED NAJM 3, AND ALAA KHALAF HAMOUD 4
1 Department of Computer Science, University of Kufa, Najaf 54003, Iraq
2 Department of Computer Algebra, Eötvos Löránd University (ELTE), 1053 Budapest, Hungary
3 Department of Mathematics, University of Tikrit, Tikrit 43000, Iraq
4 Department of Cybersecurity, University of Basrah, Basrah 61004, Iraq

Corresponding author: Mohammed B. M. Kamel ([email protected]; [email protected])

ABSTRACT With the emergence of 5G technology, congestion control has become a vital challenge to be
addressed in order to have efficient communication. There are several congestion control models that have
been proposed to control and predict the possible congestion in 5G technology. However, finding the optimal
congestion control model is an important yet challenging task. In this paper, we examine the supervised and
unsupervised machine learning approaches to the task of predicting the possible node that causes congestion
in the 5G environment. Due to the huge variance in the domains of the data set columns, measuring
the prediction’s consistency was not an easy task. During our study, we tested twenty-six supervised and
seven clustering algorithms. Finally, and based on the performance criteria, we have identified the best five
algorithms out of the studied algorithms.

INDEX TERMS Machine learning, congestion control, 5G, supervised ML, unsupervised ML.

I. INTRODUCTION resources, including frequencies and bandwidth. Network

Compared to previous network generations, 5G networks slicing [6] and edge computing [7], which allow traffic-based
have higher speeds, lower latency, and improved coverage. optimization of 5G networks, may be utilized to achieve this.
These features and its superiority over previous generations Implementing Quality of Service (QoS) techniques ensures
resulted in its widespread adoption [1]. Due to the widespread that critical services remain unaffected by current traffic [8].
adoption and joining of a high number of nodes in the Important traffic, like emergency services, is assigned with a
network, many new challenges have been raised, especially higher priority, and resources in the 5G network are allocated
in the area of congestion control [2]. The goal of a routing appropriately. Another congestion control mechanism is
algorithm is to choose the best possible path and avoid any traffic offloading, which transfers the data traffic to Wi-Fi [9]
potential congestion; yet, it may result in additional costs or other networks. The offloading is done to decrease the load
during the routing process [3]. As it can result in severe on 5G networks, thus minimizing congestion.
delays and lower throughput, congestion during 5G routing In addition to the discussed approaches, applying machine
decisions becomes critical. learning (ML) algorithms has shown positive results in
Several studies have been made for implementing various controlling network congestion [10]. While unsupervised ML
congestion control approaches in the 5G environment [4], algorithms are trained with unlabeled data, supervised ML
[5]. Among the features of 5G networks that reduce algorithms are trained with labeled data [11]. In order to con-
congestion is the ability to dynamically distribute available trol congestion, both supervised and unsupervised algorithms
are trained to identify possible congestion nodes as well as
The associate editor coordinating the review of this manuscript and the optimal congestion control window. Classification is an
approving it for publication was Bilal Khawaja . essential part of supervised ML, where data items are grouped
2024 The Authors. This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 License.
VOLUME 12, 2024 For more information, see https://creativecommons.org/licenses/by-nc-nd/4.0/ 91127
B. M. Mohammed Kamel et al.: Congestion Control Prediction Model

into classes based on the class labels information. On the scalability [17], and distributed telemetry [18]. To reduce
other hand, clustering is an essential part of unsupervised network congestion, enhance the lifetime of the network and
ML, in which similar data items are categorized into clusters individual nodes, and reduce network divisions, Shelke et al.
without the information of class labels. The adoption of [19] proposed a routing algorithm that selects the best route
supervised and unsupervised algorithms is dependent on by combining appropriate sleep scheduling mechanisms
several factors including data type and size, complexity, and based on the opportunistic theory. Godoy et al. [20] analyzed
accuracy. and investigated the communication channel congestion in
Our paper highlights the importance of adopting and the environment based on configuration parameters of nodes
utilizing machine learning algorithms in the process of such as the generation rate of the data packet, intervals of
congestion control in the 5G environment and identifies transmission time, and power level of transmitter output.
the top algorithms in the process of congestion control Najm et al. [21] proposed a multi-criteria decision-making
prediction. The task of finding the optimal algorithm to be mechanism to improve congestion control in 4G networks.
adopted for congestion control is challenging. We aimed to Braham et al. [22] proposed an efficient and fair distributed
find the optimal algorithm that predicts the optimal node algorithm for congestion control in tree-based communi-
causing congestion during the congestion control process in cation WSNs to assign transmission rates for each node.
5G networks. In our study, we tested twenty-six supervised The study lacked a performance comparison with previous
and seven unsupervised algorithms. Unsupervised machine traditional algorithms to see if it was optimal or not.
learning algorithms have been used for classification. The Although the next scenario was poor and simple, applying
approach of classification via clustering is utilized to improve machine learning algorithms, especially supervised ML,
the accuracy of congestion control prediction by clustering to improve congestion control in wireless or wired networks
data to identify distinct groups of data which be used to is considered a vital approach. Machine learning algorithms
enhance the classification process. Cronbach’s alpha has can be adopted in many fields [23], [24], [25], [26], [27], [28],
been used to measure the consistency undimensionality or [29], [30], [31], [32], [33] to predict the required knowledge.
homogeneity of datasets. During the evaluation, the studied Geurts et al. [34] proposed a model based on an automatic
algorithms were evaluated based on performance criteria, loss classifier based on a simulated database of random
including True Positive (TP) and False Positive (FP) rates, topologies of networks. Jagannathan and Almeroth [35]
precision, recall, Receiver Operating Characteristic (ROC), proposed a model called TopoSense for multi-cast congestion
and Area Under Curve (AUC). Fig. 1 shows the main steps of control. Many enhancements were required, such as the poor
the congestion control prediction model. calculation of link capacity and the need for calculating
The rest of this paper is structured as follows. In Section II, interval size. Moreover, there was a need to minimize control
we studied the related works in the field of congestion control. traffic and burst traffic.
Section III discusses machine learning and congestion control Following the trend, machine learning capabilities have
in detail. The model setting has been stated in Section IV. Our been utilized with congestion control algorithms in 5G
findings are explained and analyzed in Section V. We point environments. Several attempts have been presented, for
out our observations in Section VI. Finally, Section VII instance, in an open radio access network, a fast increase
includes our conclusions. in data based on artificial intelligence, and an adaptive
routing control approach to obtain effective congestion
II. LITERATURE REVIEW avoidance [36], [37], [38]. A controller is proposed by
Many studies have handled the congestion control approach. Sunny et al. [39] to ensure the efficient and fair work of
Sangeetha et al. [12] proposed a model based on data loss WLAN that has multi-cochannel access and improvement of
and energy reduction since congestion appears in all WSNs. long-lived multi-TCP AP transfer. Next, many researchers
The sensor nodes’ topology is adjusted regularly based on adopted DT in their studies of network applications.
node degree and time interval to enhance the node’s power Katuwal et al. [40] proposed a model to solve the
consumption and interference and to provide a better and problem of multi-class classification based on the multi-
more effective energy congestion-aware technique for routing classifier system. An efficient NN with oblique random
in WSN, which is called survival path routing (SPR). This forest DT is used to build the model. The model proved its
protocol is used by IoT applications in high-traffic networks efficiency based on the evaluation of 65 multi-class datasets
where all nodes try to send their packets simultaneously compared with the evaluation of large or medium datasets.
to destination nodes [13]. A new algorithm for congestion Gomez et al. [41] compared many ensemble algorithms of
control for WSNs is developed by Singh et al. [14], where DT and proposed a new classifier based on its performance.
a simplified poisson process is used and the optimal rate is The computed capacity for devices of a small network is not
obtained by retransmitting with congestion control, while the a limitation. A new model is proposed by Leng et al. [42]
old algorithm had a high complexity and high power usage. to solve the problem of congestion control flow table in a
Subsequently, many studies evaluated the performance of software-defined network (SDN) based on C4.5 DT. The flow
congestion control mechanisms over the 5G network [15] in entries are compared based on C4.5 DT to reduce the time
terms of resource allocation [16], network selection, network and matching cost. Using the DT approach with an SDN flow
91128 VOLUME 12, 2024
B. M. Mohammed Kamel et al.: Congestion Control Prediction Model

FIGURE 1. Main steps in ML based congestion prediction.

table was the first online machine learning model. Next, the purpose of DT is to improve the factors used for estimating
clustering machine learning approach is used for localization vehicle decryption overhead.
and AP reselection. Liang et al. [43] proposed a model for Based on the results, DT was a better option than K-nearest
WLAN that adopts a clustering algorithm and AP reflection. and SVM for prediction because of its higher precision and
A review of the communication technology of machine- accuracy. Many researchers have highlighted network pro-
to-machine was conducted by Hasan et al. [44] to list all tection by utilizing the DT notion. For example, researchers
challenges and solutions for diverse standards of developing in [53], [54] presented an unknown detection threat approach
organizations. Liu and Wu [45] utilized the random forest in the network via recognition threat features. Following the
algorithm for congestion control prediction, where some trends, Mohamed et al. [55] developed a flexible scheme
variables were utilized to build the model, such as type of for reducing the quantity of data transmitted across the
day, road quality, time period, and weather conditions. smart grid, but the intended scheme missed mentioning the
Park et al. [46] proposed an approach utilizing Bayesian outcome of paradigm updates. Pham and Yeo [56] presented
neural network and DT to predict the occurrence of incidents. an adaptive and protected scheme for cars to control both
Since the aim of the model is to reduce the potential incidents confidentiality and trust in the utilized recognition scheme.
or any events that may cause these incidents, the model Next, Fadlullah et al. [57] highlighted and explored the survey
could not implemented in real systems since it needs realistic requirements of propagation techniques related to deep
parameters for training and building a dataset. An improved learning utilizations concerning numerous traffic network
route based on the support vector machine (SVM) with DT is control characteristics. The leading edge of peak network
used to estimate the link quality over WSN, where Shu et al. communications, which are compromised by algorithms and
[47] used two estimation parameters: link quality and the architectures in deep learning, also encourages the motivation
strength of the received signal. SVM is used in the model due to facilitate deep learning to compromise the network’s
to its ability to handle binary classifications. challenges. Nevertheless, their viewpoints did not include the
For network infrastructure and data centers, DT was 5G environment.
adopted as an energy-saving solution [48]. Soltani and Furthermore, Kong, Zang, and Ma [58] developed dual
Mutka [49] proposed an approach utilizing DT for best path machine-learning approaches to address TCP congestion
selection in the cognitive radio networks. In this model, the control issues in under-buffered connections over the wired
nodes can find better nodes to send data to after analyzing environment. A supportive and adaptive loss prediction was
the tree and removing the choices that reduce node gain. assigned to obtain a superior tradeoff delay. In research issued
DT is utilized to interpret the routing path of cognitive video by Taherkhani and Pierre [59] used the K-means algorithm
over a dynamic radio network. The optimal path from the to control congestion in VANET networks. It contains
leaf node to the root is determined based on background three sections for directing, detecting data congestion, and
induction to construct and receive the transmitted video. The clustering communications. Next, the issue of prediction
DT algorithm is also used by Stimpfling et al. [50] to build traffic status was settled by Chen et al. [60] by permitting DT
a model to enhance data structure size and memory access. and SVM to depend on enabling online data; however, the set
DT is considered a strength since it reduces the searching value of both services was overridden.
time. Moreover, DT is used by Singh et al. [51] to build a Tariq et al. [61] presented a detection of botnet attacks
model for vehicular traffic noise prediction. Four machine by using the machine learning technique, regardless of the
learning algorithms are used for model implementation: DT, explanation of the carried packet. However, comprehensive
ANN, generalized linear model, and random forest. The calculations missed the stats plan. Wu et al. [62] implemented
random forest approach was found to be a better algorithm a developable machine learning method to predict or expose
for prediction compared with other algorithms. Xia et al. the limps of online video via feature extraction of monitored
[52] used DT with a proposed delegation schema (CP-ABE) data in the network. The method defines characteristic
to enhance the efficiency of decryption for VANETs. The features depending on diverse scale windows. The criterion