Diagnosis Based On Genetic Fuzzy Algorithms For LTE Self-Healing
Diagnosis Based On Genetic Fuzzy Algorithms For LTE Self-Healing
fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
1
Self-Organizing Networks (SON) mechanisms reduce Operational Expenditure (OPEX) in cellular networks, whilst enhancing
the offered quality of service. Within SON, self-healing aims to autonomously solve problems in the radio access network and to
minimize their impact on the user. Self-healing comprises automatic fault detection, root cause analysis, fault compensation and
recovery. This paper presents a root cause analysis system based on fuzzy logic. A genetic algorithm is proposed for learning the
rule base. The proposed method is adapted to the way of reasoning of troubleshooting experts, which ease knowledge acquisition
and system output interpretation. Results show that the obtained results are comparable or even better than those obtained when
the troubleshooting experts define the rules, with the clear benefit of not requiring the experts to define the system. In addition, the
system is robust, since fine tuning of its parameters is not mandatory.
Index TermsSelf-Organizing Networks, self-healing, troubleshooting, root cause analysis, genetic algorithms, supervised learning,
fuzzy systems
I. I NTRODUCTION
In the last decade, cellular mobile networks have grown
rapidly in size and complexity. Operation and maintenance
of these networks is a difficult challenge because of the evergrowing number of users and a demand for reliability. For this
reason, the NGMN Alliance [1] came up with the definition
of Self-Organizing Networks (SON) as a set of principles and
concepts to add automation to mobile networks so that they
require less maintenance than traditional networks while improving service quality. Subsequently, 3GPP proposed a set of
use cases and specific SON functionalities for next generation
mobile communication networks, that is Long Term Evolution
(LTE) [2], and identified SON as one of the key features
of LTE-Advanced (LTE-A) [3]. Three main aspects conform
the SON definition: self-configuration, self-optimization and
self-healing. This study contributes to the research effort in
self-healing [4] [5]. Self-healing encompasses the automatic
functionality aimed to solve problems (traditionally called
troubleshooting) as they emerge in the network. It is a substitute of manual troubleshooting, which currently is the only
way network problems are being confronted. Automatic troubleshooting is a great competitive advantage since offloading
the Operation and Maintenance (O&M) experts from the task
of solving common problems, especially related to the access
networks, greatly reduces Operational Expenditure (OPEX).
The reduced response time of automatic troubleshooting also
increases the quality of service.
Self-healing consists of fault detection, root cause analysis
(diagnosis), compensation and fault recovery. Recently, some
research effort has started to be carried out on these topics. However, so far there are no commercial products that
implement a totally automated troubleshooting tool, although
there is an emerging research effort into the topic. In [6] fault
detection is based on monitoring the throughput. A sector is
considered problematic if it falls under a certain threshold that
is calculated according to the recorded average throughput.
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
2
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
3
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
4
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
5
(1)
(2)
D. Parameters
The algorithm has 7 parameters:
Uncommon case sensitivity () (0, +): adjusts the
sensitivity of the algorithm to uncommon cases. A too
low value may disregard cases that are important although
they are uncommon. A too high value may give relevance
to cases that are wrongly diagnosed.
Minimum inclusion degree (M ID) (0, 1): minimum
score a rule must have to be part of the final rule set.
Minimum degree of activation (M DA) (0, 1): indicates
the minimum DoA of a rule on a case to consider that
it covers it.
Initial rules [1, +): Number of random initial rules.
If none of the initial rules survives the first iteration of the
algorithm, an extinction occurs and the algorithm stops.
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
6
A. Case study
In cellular networks, the O&M can provide historical
records containing the value of PIs and configuration parameters. However, after troubleshooting experts perform a
root cause analysis of problematic cells, the diagnosed cause
is normally not saved together with the value of the PIs.
Therefore, there are no existing databases of classified fault
cases in which to test new diagnosis systems, especially in
LTE, which is starting to be deployed in current networks.
Due to these reasons, and in order to be as close as possible
to the behavior of real networks (consequently, discarding
using a simulator), a network emulator was built based on the
knowledge of troubleshooting experts. The network emulator
is simply a case generator, i.e. it produces fault causes and
their associated PIs (labeled cases). The advantage of using
the network emulator is that it can provide as many cases
as needed, still being close to the real network behavior.
In order to define the LTE network emulator, the following
methodology was followed. Firstly, troubleshooting experts
defined the most common categories of fault causes and
related PIs for LTE. Secondly, the frequency of occurrence
of each fault cause was defined (prior probabilities). Finally,
the probability density function (PDF) of each PI conditioned
to each fault cause was modeled either as a normal or a beta
probability density function, which has been demonstrated to
be an adequate distribution in cellular networks [8]. The parameters of the distributions were defined by the experts. The
network emulator works by generating new cases according to
the defined prior and conditional probabilities. The network
emulator has been used to generate both training cases and
testing cases.
The defined fault categories, each covering a broader group
of causes, are the following:
TABLE I
P RIOR PROBABILITIES
Fault category
SW Problem
Coverage
Quality
Mobility
Proportion (%)
13
25
34
28
TABLE II
PI MODELING . PDF OF EACH PI CONDITIONED TO AN EXISTING PROBLEM
PI
Type
Acc.
beta
Ret.
beta
HOSR
beta
RSRP
norm
RSRQ
norm
avg
avg
Nor
2
0.1
17
0.5
4
0.02
-70
3
-6.5
1.1
Parameters/Cause
SW
Cov
Qual
12
1.391
450.3
3
0.028
23. 7
11.756
10
11
1.306
1.5
1.9
4.62
3
5
0.024
0.02
0.04
-75
-107
-72
6
5
7
-6
-10
-13
2
5
2
Mob
2
0.5
9
2
42.5
7.5
-80
10
-11
3
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
7
TABLE III
PARAMETER VALUES
Test
Nun
Np
(5)
Default
value
24
MID
0.2
MDA
0.4
0.5
reproduction ratio
MMF
Training cases
Only
problematic
Expert
elicited
rules
Live network example
Variable
Scalability test
Tested values
6, 12, 18, 24, 30, 36,
42, 48, 54, 60
0.1, 0.2, 0.3, 0.4, 0.5,
0.6, 0.7, 0.8, 0.9
0, 0.1, 0.2, 0.3, 0.4, 0.5,
0.6, 0.7, 0.8, 0.9, 1
0, 0.1, 0.2, 0.3, 0.4, 0.5,
0.6, 0.7, 0.8, 0.9, 1
1, 2, 3, 4, 5, 6, 7, 8, 9,
10
Only
problematic,
problematic + normal
Learned rules, expert
rules
Default and Enhanced
(=27,M ID=0, number of initial rules = 20,
M M F =20)
Default and Enhanced
(=42,M DA=0.1,
loops = 100)
(7)
Pn E f p
Pn Ef p + Pp (1 Eu )
(8)
C. Results
Test 1:
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
8
0.45
4
Diagnosis Error Rate
False Positive Rate
Undetected Rate
0.4
3.5
0.35
0.3
2.5
0.25
2
0.2
1.5
0.15
0.1
0.5
0.05
0
10
20
30
40
50
60
0.7
0.6
0.5
0.4
0.3
0.2
0.1
0
0.1
0.2
0.3
0.4
0.3
0.4
0.5
0.6
0.7
0.8
0.9
0.8
0.2
MID
0.9
0
0.1
0.5
0.6
0.7
0.8
0.9
MID
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
9
0.45
0.8
Diagnosis Error Rate
False Positive Rate
Undetected Rate
0.4
0.35
0.7
0.6
0.3
0.5
0.25
0.4
0.2
0.3
0.15
0.2
0.1
0.1
0.05
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
0
0
0.2
MDA
0.4
0.6
0.8
0.8
Reproduction Ratio
2.5
3.5
2
3
2.5
1.5
2
1
1.5
1
0.5
0.5
0
0.1
0.2
0.3
0.4
0.5
0.6
0.7
0.8
0.9
MDA
0
0
0.2
0.4
0.6
Reproduction Ratio
Fig. 15. Relative execution times for different values of the reproduction
ratio.
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
10
0.45
Diagnosis Error Rate
False Positive Rate
Undetected Rate
0.4
0.35
0.3
0.25
0.2
0.15
0.1
0.05
0
1
10
Fig. 18. Results comparing the presence and lack of normal cases in the
training set.
MMF
3.5
3
2.5
2
1.5
1
0.5
0
1
10
MMF
Ret.
LOW
LOW
LOW
LOW
HOSR
HIGH
HIGH
LOW
RSRP
HIGH
LOW
HIGH
RSRQ
HIGH
HIGH
LOW
Cause
SW Problem
Coverage
Quality
Mobility
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
11
TABLE VI
A DDITIONAL PI MODELS . PDF OF EACH PI CONDITIONED TO AN
EXISTING PROBLEM
PI
Type
SINR
norm
Dist.
norm
Thp.
norm
CPU
negative
binomial
RSSI
norm
avg
avg
avg
n
p
avg
Nor
15
0.8
0.9
0.2
11.4
3.1
0.052
0.01
-116.7
0.5
Parameters/Cause
SW
Cov
Qual
13.6
15
7
1.4
1
1.7
0.9
1.1
1
0.3
0.3
0.2
7.5
7.9
6
4.25
6.1
5.25
2.07
0.052
0.024
6.3e-3
9.6e-4
4.7e-4
-112
-114
-109
10
10
11
Mob
14.2
1
0.9
0.3
9.7
4.25
5.5e-3
2.3e-3
-119
12
Diagnosis
Error Rate
0%
0%
Undetected
Rate
62.41 %
19.19 %
False
Positive Rate
10.08 %
21.67 %
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
12
TABLE VII
R ESULTS OF THE SCALABILITY TESTS
Test
Default parameters
Enhanced parameters
Diagnosis
Error Rate
0.6 %
1.2 %
Undetected
Rate
50.13 %
1.4 %
False
Positive Rate
3.26 %
4.2 %
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. Citation information: DOI
10.1109/TVT.2015.2414296, IEEE Transactions on Vehicular Technology
13
0018-9545 (c) 2015 IEEE. Translations and content mining are permitted for academic research only. Personal use is also permitted, but republication/redistribution requires IEEE
permission. See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.