0% found this document useful (0 votes)

58 views

Deep Reinforcement Learning Based Resource Allocation in Delay

Uploaded by

Goku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

58 views

Deep Reinforcement Learning Based Resource Allocation in Delay

Uploaded by

Goku

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 5

Deep Reinforcement Learning Based Resource

Allocation in Delay-Tolerance-Aware 5G
Industrial IoT Systems

Problem Solved:
In this paper, they investigate the resource allocation problem under delay tolerance
constraints in 5G-enabled IIoT. Specifically, they propose a traffic prediction algorithm to provide
future traffic information for allocation algorithm. Then the allocation algorithms for minimizing
PRBs usage and jointly optimizing PRBs and power allocation are designed, respectively (delay
tolerance is regarded as a constraint in this paper).

Introduction
Industrial Internet of Things (IIoT) is a network that deeply integrates communication technology
with traditional industrial manufacturing. In IIoT, nodes have strict quality of service (QoS)
requirements, and 5G technology provides important technical support for nodes to guarantee QoS.
Since there are usually many nodes with different QoS in IIoT, a reasonable resource allocation
algorithm is needed to guarantee QoS of each node [2]. Using network slicing technology, nodes are
divided into different slices according to QoS, so as to effectively allocate and manage network
resources, which can address heterogeneous services problem. IIoT has stricter requirements on the
delay and reliability of data transmission. In the industrial production process, the failure of data to
arrive in time or even packet loss will cause unpredictable and serious consequences to the
production process and even personal safety. To ensuring the reliability of data transmission,
reducing power consumption is also crucial for the IIoT systems. Keeping low network transmission
power is one of the ways to effectively reduce power consumption in industrial production.
Maintaining low power consumption is an important way to extend the battery life of remote
industrial device. According to the Shannon formula, both power and physical resource blocks (PRBs)
can directly affect the data transmission speed, thereby affecting the transmission delay. There is
strong coupling among reliability of data transmission, power, and PRBs. By designing resource
allocation algorithms related to PRBs and power, the reliability requirements of the IIoT system can
be met.

Main Contribution of this Paper:

 A traffic prediction method combining convolutional neural network, bidirectional long
short-term memory, and attention mechanism is proposed to provide the basis for resource
allocation, since the temporal and spatial correlation of traffic can be learned, which is
superior to traditional prediction methods.
 Taking the result of traffic prediction as input, the Dueling Double DQN (D3QN) allocates
PRBs between slices, then a heuristic algorithm allocates PRBs to nodes within a slice, while
ensuring delay tolerance and slice isolation in 5G-enabled IIoT systems, we propose a
hierarchical resource allocation algorithm. In addition, an action selection mechanism is
added to D3QN to constrain the random range of actions in the random action stage.
 Considering the problem of joint allocation of PRBs and power, an allocation algorithm
based on Branching Dueling Q-Network (BDQ) is proposed, which uses the branch structure
to decouple the coupled actions and divide them into different branches, effectively
reducing the complexity of the action space and improving the suitability of the algorithm
for multi-node scenarios.
 The proposed traffic forecasting algorithm and two dynamic resource allocation algorithms
for different optimization objectives are simulated. The simulation results show that the
proposed algorithms outperform the baseline algorithms.

System Model
 The downlink(Downlink refers to the communication process where data is transmitted from
a higher-level network component, like a satellite or cellular base station, to a lower-level
device, such as a smartphone or satellite dish. It's a crucial part of telecommunications and
satellite systems, enabling users to receive data, such as streaming video, downloading files,
and browsing the internet.) of a 5G IIoT system which consists of a single small cell Base
Station (BS) This is a small-sized base station that provides 5G network coverage to a specific
area. It serves as a hub for communication with multiple devices within its range.), N mobile
nodes and M network slices.
 The downlink communication bandwidth is parted into K PRBs (These are the smallest units
of bandwidth that can be allocated to different communications in the network.) and each
block has a bandwidth of B MHz.
 Each node in the BS corresponds to a queue, and the data to be sent to the node is buffered
in the queue. Nodes with the same delay tolerance are grouped in the same slice and the
delay deadline of slice m is defined as Dmmax . If the delay of data in the queue exceeds
deadline Dmmax , it will be discarded.
 At each slot, kn PRBs are assigned to node n, with the constraint K. In addition, they assume
that the channels between the BS and nodes are block fading, where the channel gain keeps
constant in each slot, but varies independently between slots. In this work, the channel gain
hn(t) between BS and node n at slot t is denoted by Rayleigh fading channel gain and there is
only one traffic flow between a node and the BS.
 The total end to end delay requirements of slice m include the transmission delay,
propagation delay, queue delay and processing delay. They assume the BS supports 5G
communication and is equipped with powerful servers, so the transmission delay and
processing delay can be ignored.
 In this paper, they use delay tolerance to measure the reliability of the network. It reflects
that the minimum requirement for data to meet the end-to-end delay, which ensures the
normal operation of the system. In order to store the incoming traffic data to be sent and
count the queuing delay, we sustain a queue to each node.
 Objective is to minimize the number of PRBs usage while keeping slices isolated from each
other, and meet delay tolerance constraints. Reducing the usage of PRBs can improve
resource utilization. In addition, the saved PRBs can allow more nodes to be allowed to
access, or be scheduled to process other transactions.

1.They combined CNN, LSTM and attention mechanism to design a traffic prediction algorithm

A. Convolutional Neural Network:

Convolutional neural network is added in the prediction model to enhance the learning
ability of the spatial characteristics. Through the convolutional layer and pooling layer in the
hidden layer, features can be sampled from the time series inputted.

B. Attention Mechanism:

Without attention mechanism, all features are given the same weight, so the neural
network can only spend longer time to train. In this work, we use the Squeeze-and-
Excitation (SE) block in [26] as the attention mechanism algorithm. The SE block mainly
includes two parts: squeeze and excitation.

C. Bidirectional Long Short-Term Memory:

LSTM is widely used in time series forecasting because of its gate structure, which controls
the time scale of information flow, effectively alleviating the gradient disappearance and
explosion problems in traditional recurrent neural networks (RNN). There are 3 gates in
LSTM.

2. Aiming at the optimization goal of minimizing the number of PRBs used on basis of the premise
of meeting the constraints of delay tolerance, a hierarchical resource allocation algorithm is
proposed which is combined by deep reinforcement learning (DRL) algorithm and a heuristic
algorithm. The hierarchical architecture is a commonly used method in the field of dynamic
resource allocation to reduce the complexity of the algorithm

A. Resource allocation between slices based on DRL:

Deep Q-networks (DQN), a deep reinforcement learning method, is commonly used in the
field of discrete action schedule which can optimize policy through the interaction between
agents and the environment. There are two networks in the DQN agent, the evaluation
network and the target network. The two networks have the same structure, and the
parameters of the evaluation network are updated to the target network after every few
steps. The architecture of the two networks is to make it easier to iterate toward a stable
direction. Besides, the greedy strategy is used to obtain the optimal action in DQN. Dueling
Network Architectures for DRL adopts the advantage function, which makes Dueling DQN
more accurate when estimating the Q-value. Dueling DQN can learn faster than DQN.

B. Resource allocation within slices based on heuristic algorithm:

The size of action space is equal to the sum of the combination of allocating k PRBs to every
node where k is from 0 to K. With the increase of the number of nodes, the size of the action
space increases rapidly, which is important to the training of DRL, because that increase the
training time, increase the training difficulty, and even lead to non-convergence. Therefore,
a heuristic algorithm is proposed which is combined with D3QN to effectively reduce the size
of action space. They propose a heuristic algorithm called PRBs scheduling policy (PSP).
D3QN is responsible for allocating PRBs to each slice, and then distributing the PRBs
allocated to each node by the PSP.
3. They propose a dynamic allocation algorithm that simultaneously allocates PRBs and power to
minimize the weighted sum of PRBs usage and power consumption, achieving a balance between
resource utilization and power consumption.

A. Branching Dueling Q-Network:

The proposed algorithm needs to minimize the weighted sum of allocating PRBs and power
for each node n, under the premise of delay tolerance constraints.

Stimulation Results
Simulation Results and Analysis for Traffic Prediction:

It can be seen from the figures and tables that the prediction algorithm proposed in this
paper has the smallest prediction error and the best prediction effect relatively, where the
error value of RMSE can be reduced by up to 3%.

Simulation Results and Analysis for D3QN-PSP:

It can be seen that the algorithm satisfies the trend that when the bandwidth of PRBs is
large, the usage of PRBs is low, and when the bandwidth of PRBs is small, the usage of PRBs
is large.

Simulation Results and Analysis for BDQ:

BDQ is only compared with MDQN. It can be seen from Fig. 8 that when the weight is large,
that is, the influence of the power usage is relatively large, the power value is generally low,
and when the weight is small, that is, the influence of the PRB usage is large, the usage of
PRBs is small. Therefore, it is proved that the proposed algorithm can achieve a balance
between resource utilization and power consumption according to different weights which
can be adjusted on the basis of scenarios.

Conclusion
 The traffic prediction algorithm CBL-A is composed of CNN, attention mechanism and Bi-
LSTM.
 A two-layer dynamic resource allocation algorithm is proposed based on the traffic
prediction results. The first layer uses D3QN to allocate PRBs for each slice, and the second
layer uses a heuristic algorithm to allocate PRBs for nodes, to minimize the usage of PRBs.
 A dynamic resource allocation algorithm based on branch structure is proposed, which
divides PRBs and power allocation into different branches, reducing the complexity of the
action space, to minimize the weighted sum of PRBs usage and power.
 Simulation results indicate that the traffic prediction algorithm can achieve higher accuracy
than the baseline algorithm. The D3QN-PSP algorithm can lead to higher resource utilization
and convergence speed. The BDQ algorithm can adapt to the dynamic resource allocation
problem under large action space, and realize the balance between resource utilization and
power consumption.

Future Aspects (Things not discussed but can be

worked on in future mentioned)
 Multiple base stations to solve the problem of multi-base station joint resource scheduling.

 In addition to the proposed algorithm DRL supporting continuous action spaces is also a
potential method to solve the optimization problem.
 Design of resource allocation algorithms using Proximal Policy Optimization.

Mysql 8.0 en 31 60
No ratings yet
Mysql 8.0 en 31 60
30 pages
C PROGRAMMING Notes
75% (4)
C PROGRAMMING Notes
6 pages
Concise Guide to OTN optical transport networks
From Everand
Concise Guide to OTN optical transport networks
alasdair gilchrist
4/5 (2)
Deep Q-Network for 5G NR Downlink Scheduling
No ratings yet
Deep Q-Network for 5G NR Downlink Scheduling
6 pages
Deep Reinforcement Learning Based Dynamic Resource Allocation in 5G Ultra-Dense Networks
No ratings yet
Deep Reinforcement Learning Based Dynamic Resource Allocation in 5G Ultra-Dense Networks
7 pages
Resource_Allocation_for_Network_Slicing_in_Open_RAN_A_Hierarchical_Learning_Approach
No ratings yet
Resource_Allocation_for_Network_Slicing_in_Open_RAN_A_Hierarchical_Learning_Approach
17 pages
Dynamic SDN-Based Radio Access Network Slicing With Deep Reinforcement Learning For URLLC and eMBB Services
No ratings yet
Dynamic SDN-Based Radio Access Network Slicing With Deep Reinforcement Learning For URLLC and eMBB Services
14 pages
Energy-Aware Dynamic DU Selection and NF Relocation
No ratings yet
Energy-Aware Dynamic DU Selection and NF Relocation
17 pages
QoS-Driven Scheduling in 5G Radio Access PDF
No ratings yet
QoS-Driven Scheduling in 5G Radio Access PDF
7 pages
A New Scheduler For URLLC in 5G NR IIoT Networks With Spatio-Temporal Traffic Correlations
No ratings yet
A New Scheduler For URLLC in 5G NR IIoT Networks With Spatio-Temporal Traffic Correlations
6 pages
Applied Sciences: Latency-Optimal Virtual Network Functions Resource Allocation For 5G Backhaul Transport Network Slicing
No ratings yet
Applied Sciences: Latency-Optimal Virtual Network Functions Resource Allocation For 5G Backhaul Transport Network Slicing
21 pages
Deep Reinforcement Learning For Network Slicing With Heterogeneous Resource Requirements and Time Varying Traffic Dynamics - Koo Et Al. 2019
No ratings yet
Deep Reinforcement Learning For Network Slicing With Heterogeneous Resource Requirements and Time Varying Traffic Dynamics - Koo Et Al. 2019
9 pages
Routing in Wireless Mesh Networks
From Everand
Routing in Wireless Mesh Networks
Raghav Kumar
No ratings yet
1 s2.0 S1389128621000165 Main
No ratings yet
1 s2.0 S1389128621000165 Main
15 pages
A Deep Learning Approach Towards An5g (Best)
No ratings yet
A Deep Learning Approach Towards An5g (Best)
6 pages
5G Urllc DL Code
No ratings yet
5G Urllc DL Code
19 pages
GAN-powered Deep Distributional Reinforcement Learning For Resource Management in Network Slicingg
No ratings yet
GAN-powered Deep Distributional Reinforcement Learning For Resource Management in Network Slicingg
16 pages
Performance_analysis_of_slicing_using_5g_network__(1)[1]
No ratings yet
Performance_analysis_of_slicing_using_5g_network__(1)[1]
6 pages
Deep Learning DL Based Joint Resource Allocation and RRH Association in 5G-Multi-Tier Networks
No ratings yet
Deep Learning DL Based Joint Resource Allocation and RRH Association in 5G-Multi-Tier Networks
1 page
Deep Reinforcement Learning For Resource Allocation With Network Slicing in Cognitive Radio Network
No ratings yet
Deep Reinforcement Learning For Resource Allocation With Network Slicing in Cognitive Radio Network
22 pages
Reinforcement Learning Framework For Dynamic Power Transmission in Cloud RAN Systems
No ratings yet
Reinforcement Learning Framework For Dynamic Power Transmission in Cloud RAN Systems
6 pages
Principles of Multiple Spanning Tree Protocol: Definitive Reference for Developers and Engineers
From Everand
Principles of Multiple Spanning Tree Protocol: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Admission_Control_for_5G_Network_Slicing_based_on_
No ratings yet
Admission_Control_for_5G_Network_Slicing_based_on_
13 pages
HUSSEIN
No ratings yet
HUSSEIN
23 pages
Alotaibi et al. - 2021 - The 5G network slicing using SDN based technology
No ratings yet
Alotaibi et al. - 2021 - The 5G network slicing using SDN based technology
8 pages
VNF Placement and Resource Allocation For The Support of Vertical Services in 5G Networks
No ratings yet
VNF Placement and Resource Allocation For The Support of Vertical Services in 5G Networks
14 pages
Paper-6-Resource Allocation in An Open RAN System Using Network Slicing
No ratings yet
Paper-6-Resource Allocation in An Open RAN System Using Network Slicing
15 pages
4s PDF
No ratings yet
4s PDF
15 pages
Paper 1 - O-RAN-Minimum-Delay
No ratings yet
Paper 1 - O-RAN-Minimum-Delay
8 pages
RL For Traffic Scheduling
No ratings yet
RL For Traffic Scheduling
14 pages
progress
No ratings yet
progress
30 pages
Gedikli et al. - 2022 - Deep reinforcement learning based flexible preambl
No ratings yet
Gedikli et al. - 2022 - Deep reinforcement learning based flexible preambl
14 pages
5G-B5G Network Slice Management via Staged Reinforcement Learning
No ratings yet
5G-B5G Network Slice Management via Staged Reinforcement Learning
9 pages
Joint Delay-Energy Optimization For Multi-Priority Random Access in Machine-Type Communications
No ratings yet
Joint Delay-Energy Optimization For Multi-Priority Random Access in Machine-Type Communications
16 pages
Intelligent Cognitive Radio in 5G AI-Based Hierarchical Cognitive Cellular Networks
No ratings yet
Intelligent Cognitive Radio in 5G AI-Based Hierarchical Cognitive Cellular Networks
8 pages
Key technologies for NG-PON2 system
From Everand
Key technologies for NG-PON2 system
Rawa Muayad
No ratings yet
[1]
No ratings yet
[1]
19 pages
An Effective Owl Search Based Optimized Resource Allocation Framework For Network Slicing in An Lte Network
No ratings yet
An Effective Owl Search Based Optimized Resource Allocation Framework For Network Slicing in An Lte Network
7 pages
s13677-025-00729-w
No ratings yet
s13677-025-00729-w
27 pages
Multi-Agent_Deep_Reinforcement_Learning_Joint_Beamforming_for_Slicing_Resource_Allocation
No ratings yet
Multi-Agent_Deep_Reinforcement_Learning_Joint_Beamforming_for_Slicing_Resource_Allocation
5 pages
5GWORD
No ratings yet
5GWORD
6 pages
NB-IoT Systems and Protocols: Definitive Reference for Developers and Engineers
From Everand
NB-IoT Systems and Protocols: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
tham2019--
No ratings yet
tham2019--
4 pages
Rapid Spanning Tree Protocol for Modern Networks: Definitive Reference for Developers and Engineers
From Everand
Rapid Spanning Tree Protocol for Modern Networks: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed Machine Learning For Multiuser Mobile Edge Computing Systems
No ratings yet
Distributed Machine Learning For Multiuser Mobile Edge Computing Systems
14 pages
Long Term 5G Network Traffic Forecasting Via Model
No ratings yet
Long Term 5G Network Traffic Forecasting Via Model
12 pages
School of Computing Science and Engineering[2]
No ratings yet
School of Computing Science and Engineering[2]
19 pages
Spanning Tree Protocol Essentials: Definitive Reference for Developers and Engineers
From Everand
Spanning Tree Protocol Essentials: Definitive Reference for Developers and Engineers
Richard Johnson
No ratings yet
Distributed_Channel_Allocation_for_Mobile_6G_Subnetworks_via_Multi-Agent_Deep_Q-Learning
No ratings yet
Distributed_Channel_Allocation_for_Mobile_6G_Subnetworks_via_Multi-Agent_Deep_Q-Learning
6 pages
Federated Reinforcement Learning-Based Resource Allocation in D2D-Enabled 6G
No ratings yet
Federated Reinforcement Learning-Based Resource Allocation in D2D-Enabled 6G
7 pages
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
No ratings yet
Deep Reinforcement Learning Based Computation Offloading and Resource Allocation For MEC
6 pages
Deep Reinforcement Learning MultiAgent System For Resource Allocation in Industrial Internet of ThingsSensors
No ratings yet
Deep Reinforcement Learning MultiAgent System For Resource Allocation in Industrial Internet of ThingsSensors
23 pages
AI Driven VNF Splitting in O RAN For Enhancin Esmaeil Amiri
No ratings yet
AI Driven VNF Splitting in O RAN For Enhancin Esmaeil Amiri
117 pages
2015-Elsevier-Resource Allocation for LTE-based Cognitive Radio Network
No ratings yet
2015-Elsevier-Resource Allocation for LTE-based Cognitive Radio Network
13 pages
[11] Dynamic_Resource_Allocation_With_RAN_Slicing_and_Scheduling_for_uRLLC_and_eMBB_Hybrid_Services
No ratings yet
[11] Dynamic_Resource_Allocation_With_RAN_Slicing_and_Scheduling_for_uRLLC_and_eMBB_Hybrid_Services
14 pages
Distributed Facts Device for Flow Controls
From Everand
Distributed Facts Device for Flow Controls
Dr.V.V.L.N. Sastry
No ratings yet
Document (5)
No ratings yet
Document (5)
7 pages
Learn To Allocate Resources in Vehicular Networks
No ratings yet
Learn To Allocate Resources in Vehicular Networks
7 pages
Literature Survey Draft
No ratings yet
Literature Survey Draft
3 pages
Intelligent Traffic Steering in Beyond 5G Open
No ratings yet
Intelligent Traffic Steering in Beyond 5G Open
29 pages
publi-5993
No ratings yet
publi-5993
7 pages
IEEE Conference Template
No ratings yet
IEEE Conference Template
6 pages
Labsheet 5
No ratings yet
Labsheet 5
10 pages
UI and Oops QP
No ratings yet
UI and Oops QP
7 pages
LeetCode Program - S3AIE - Problems Set
No ratings yet
LeetCode Program - S3AIE - Problems Set
2 pages
IEEEComputer NCCGroup
No ratings yet
IEEEComputer NCCGroup
5 pages
New Resume For FAANG
No ratings yet
New Resume For FAANG
1 page
Database Management System Course Syllabus
No ratings yet
Database Management System Course Syllabus
3 pages
ThreadX User Guide PDF
No ratings yet
ThreadX User Guide PDF
362 pages
Rabadon FirstVHDL
No ratings yet
Rabadon FirstVHDL
16 pages
How To Load A File Into A Stream On Android Knowing Its Jnet - Uri - Stack Overflow
No ratings yet
How To Load A File Into A Stream On Android Knowing Its Jnet - Uri - Stack Overflow
3 pages
Clock Gen CPU: LCD Board
No ratings yet
Clock Gen CPU: LCD Board
53 pages
The CIS Top 20 Controls Explained
No ratings yet
The CIS Top 20 Controls Explained
4 pages
Digital Logic Design
No ratings yet
Digital Logic Design
2 pages
Utpal Project
No ratings yet
Utpal Project
20 pages
Trace
No ratings yet
Trace
83 pages
Basit Ali MS Research Proposal
No ratings yet
Basit Ali MS Research Proposal
10 pages
Cambridge IGCSE: 0417/11 Information and Communication Technology
No ratings yet
Cambridge IGCSE: 0417/11 Information and Communication Technology
16 pages
Gfe-Tcp-Web: Configuration Manual
No ratings yet
Gfe-Tcp-Web: Configuration Manual
11 pages
Software Requirements Document
No ratings yet
Software Requirements Document
15 pages
Social Media and Public Policy
No ratings yet
Social Media and Public Policy
40 pages
LPC55S06 Manual
No ratings yet
LPC55S06 Manual
1,029 pages
ECD Chapter 15 Active Filters
No ratings yet
ECD Chapter 15 Active Filters
46 pages
PENCIL 2D User Manual
No ratings yet
PENCIL 2D User Manual
7 pages
Lab 02 - PHP Variables - HTML Input Form
No ratings yet
Lab 02 - PHP Variables - HTML Input Form
13 pages
Industry Ai White Paper Post Production en
No ratings yet
Industry Ai White Paper Post Production en
29 pages
1T01036 E&tc Sem-6 C Scheme
No ratings yet
1T01036 E&tc Sem-6 C Scheme
1 page
Chapter 1 Operating System notes
No ratings yet
Chapter 1 Operating System notes
3 pages
CLOUD NOTES
No ratings yet
CLOUD NOTES
12 pages
AZ900-005_ Create an Azure Logic App [Guided] p3
No ratings yet
AZ900-005_ Create an Azure Logic App [Guided] p3
1 page
Sabp Z 087
No ratings yet
Sabp Z 087
29 pages
Go Back N Numericals
No ratings yet
Go Back N Numericals
7 pages
1[1]
No ratings yet
1[1]
3 pages
Fraud Detection in Financial Transactions
No ratings yet
Fraud Detection in Financial Transactions
5 pages

Deep Reinforcement Learning Based Resource Allocation in Delay

Uploaded by

Deep Reinforcement Learning Based Resource Allocation in Delay

Uploaded by

Deep Reinforcement Learning Based Resource

Main Contribution of this Paper:

A. Convolutional Neural Network:

C. Bidirectional Long Short-Term Memory:

A. Resource allocation between slices based on DRL:

B. Resource allocation within slices based on heuristic algorithm:

A. Branching Dueling Q-Network:

Simulation Results and Analysis for D3QN-PSP:

Simulation Results and Analysis for BDQ:

Future Aspects (Things not discussed but can be

You might also like