0% found this document useful (0 votes)

162 views

Analysis and Detection of Fraud in International Calls Using Decision Tree

This document proposes using a decision tree algorithm to detect SIMbox fraud in international calls based on analyzing call data records. The decision tree model would be built using six features extracted from call data records, with the goal of classifying calls as either legitimate or fraudulent SIMbox calls. The proposed technique was tested on data from a mobile operator in Libya and achieved a 97.95% detection accuracy. Decision trees are a commonly used machine learning method that can represent classification rules understood by humans through a multi-stage binary tree with internal nodes splitting the data and terminal nodes assigning a class.

Uploaded by

Dilip Thummar

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

162 views

Analysis and Detection of Fraud in International Calls Using Decision Tree

Uploaded by

Dilip Thummar

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 10

Analysis and Detection of Fraud in International

Calls Using Decision Tree

Ahmed Aljarray and Abdulla Abouda

Almadar Aljadid R&D Office, Libya-Misrata

Abstract. fraud is one of the most severe threats to revenue and quality
of service in telecommunication networks. The advent of new technologies
has provided fraudsters with new techniques to commit fraud. Subscriber
identity module box (SIMbox) fraud is one of such fraud that is used in
international calls and it has emerged with the use of VOIP technologies.
In this paper, we propose a novel technique for detecting SIMbox fraud
in international calls. The proposed technique is based in using decision
tree algorithm to build a model based on six features extracted from
call data record (CDR). The proposed algorithm is tested using dataset
obtained from a real mobile operator (Almadar Ajadid Co.,) and it has
shown 97.95% detection accuracy.

1 Introduction

Cellular network operators lose about 3% of the their annual revenue due to
fraudulent and illegal services [1]. Juniper Research estimated the total losses
from the underground mobile network industry to be 58 billion in 2011 [1, 2].
The impact of voice traffic termination fraud, commonly known as Subscriber
Identity Module (SIMbox) fraud or bypass fraud, on mobile networks is partic-
ularly severe in some parts of the globe [2]. Recent highly publicized raids on
fraudsters include those in Mauritius, Haiti, and El Salvador [3].
Fraudulent SIMboxes hijack international voice calls and transfer them over the
Internet to a cellular device, which injects them back into the cellular network.
As a result, the calls become local at the destination network [4]. When interna-
tional call is received with the emergence of a local number on the phone screen
that call should be noted as a type of fraud which causes considerable losses for
the telecommunications companies. Cellular operators of the intermediate and
destination networks do not receive payments for call routing and termination.
Fraudulent SIMboxes also hijack domestic traffic in certain areas, e.g. in Alaska
within the United States, where call termination costs are high. In some cases,
the traffic is injected into a cellular network and is forwarded to the terminating
country [5]. This increases the call routing cost for the operator of the injected
traffic. Besides causing the economic loss, SIMboxes degrade the quality of local
service where they operate. Often, cells are overloaded, and voice calls routed
over a SIMbox have poor quality, which results in customer dissatisfaction. Al-
though some vendors provide cellular anti-fraud services, the large amount of
Fig. 1: Example of one-hop SIM-box bypass fraud hijacking of an international
call [7]

daily cellular traffic and the number of connected mobile devices make detecting
call bypassing fraud extremely challenging. Moreover, traffic patterns and char-
acteristics of fraudulent SIMboxes are very similar to those of certain legitimate
devices, such as cellular network probes. So, detecting fraudulent SIMboxes re-
sembles searching for a few needles in a huge haystack full of small objects that
look like needles. While operators of the intermediate and destination networks
have high financial incentives to understand the problem, they do not have the
data to analyse the international calls that are gone. Also, the absence of publicly
available SIMbox related data is a major obstacle for emerging of comprehensive
studies on voice bypassing fraud analysis and detection [6]. By contrast, most
of the SIMbox traffic analysed in this paper is on the originating end of the
communication, giving us insight on SIMbox fraud from a different perspective
than most networks with a bypass problem. This work analyses fraudulent SIM-
box traffic based on communication data from Almadar Aljadid company, one of
the major mobile operators in Libya. It neither collects nor uses any personally
identifiable information. Based on these observations, we propose using decision
tree for detecting fraudulent SIMboxes. The proposed technique shows high de-
tection rate and correctly filters out mobile network probes with traffic patterns
similar to those of SIMboxes.
The rest of this paper is organized into six sections. Section II overviews fraud in
international call and illustrates it with basic example. Section III presents deci-
sion tree algorithm of fraud detection in international calls. Section IV analyses
SIMbox related traffic, compares it to the legitimate traffic, based on the ex-
tracted features. In Section V we describe some experiments we have performed,
and Section VI concludes the paper.

2 Fraud in International Calls (Bypass Fraud)

SIMbox voice fraud occurs when the cost of terminating domestic or interna-
tional calls exceeds the cost of a local mobile-to-mobile call in a particular re-
gion or country. Fraudsters make profit by offering low-cost international and
sometimes domestic voice calls to other operators. To bypass call routing fees,
they buy large amounts of SIM cards, install them into an off-the-shelf hardware
to connect to a cellular network, which essentially becomes a SIMbox. Then the
fraudsters transfer a call via the Internet to a SIMbox in the area of call recipient
to deliver the call as local. As a result, the operators serving the called party do
not receive the corresponding call termination fees. In other cases SIMboxes re-
inject telecom voice traffic into the cellular network masked as mobile customer
calls, and the operator pays for carrying the re-injected calls. Figure 1 shows
example of how SIMbox bypass fraud occurs in international phone calls. For
simplicity, the example assumes that there is only one intermediate hop con-
nection between two countries. The lower path marks a legitimate path for a
phone call, whereas the upper path indicates a fraudulent one when a SIMbox
is in place. Actual SIMbox fraud is often more complicated, involving multiple
intermediate steps. In the legitimate case, once the origin customer dials desti-
nation customer number, the call is routed through the cellular infrastructure of
operator 1 to international switch (Regular Transit). Based on an agreement be-
tween operator 2 and the international switch, the call is routed to cellular core
network of operator 2. The international switch pays operator 2 a fee in order to
have the call terminated. Then the call is routed through cellular infrastructure
of operator 2 and is delivered to destination customer. The fraud occurs when a
fraudulent international switch hijacks call of origin customer and forwards it to
operator 2 over the Internet (e.g. via VoIP). Then in the country of destination
customer a SIMbox transforms the incoming VoIP flow into a local mobile call
to the destination customer, and operator 2 loses the termination fee for the
hijacked calls.

3 Decision Tree Algorithm

Machine learning is a technique which computer learns from a set of data given
to it, and then it becomes able to predict the result of new data similar to
the training data. The machine learning algorithm is meant to identify patterns
based on different characteristics or features and then make predictions on new,
unclassified data based on the patterns learned earlier. The input data is usu-
ally numerous instances of relations between the different variables or features
relevant to the data.
There are various different approaches to machine learning namely decision trees,
random forests, neural networks, clustering, bayesian networks, reinforcement
learning, support vector machines, genetic algorithms, and many more. Decision
tree learning is a method commonly used in data mining. Decision trees are pow-
erful and popular tools for classification and prediction. Decision trees represent
rules, which can be understood by humans and used in knowledge system such
as database. The goal is to create a model that predicts the value of a target fea-
ture based on several input features. Figure 2 shows general criteria in decision
tree. A decision tree represents a multi-stage decision process, where a binary
decision is made at each stage. The tree is made up of nodes and branches, with
Fig. 2: General criteria in decision tree

nodes being designated as an internal or a terminal (leaf) node. Internal nodes

are the ones that split into two children. Each internal node corresponds to one
of the input features, there are edges to children for each of the possible values
of that input feature, while terminal nodes do not have any children. A terminal
node has a class label associated with it, such that observations that fall into
the particular terminal node are assigned to that class. To use a decision tree,
a feature vector is presented to the tree. If the value for a feature is less than a
defined number, then the decision is to move to the left child. If the answer to
that question is no, then we move to the right child. We continue in that manner
until we reach one of the terminal nodes, and the class label that corresponds
to the terminal node is the one that is assigned to the pattern. Decision tree
induction algorithms are function recursively. First, a feature must be selected
as the root node. In order to create the most efficient tree (i.e., smallest tree),
the root node must effectively split the data. Each split attempts to pare down
a set of instances (the actual data) until they all have the same classification.
The best split is the one that provides what is termed the most information
gain [810].
The tree grows by recursively splitting each node using the feature which gives
the best information gain until the leaf is consistent.
Example:
Applying decision tree rules on node A for a tree model is shown in Figure 3
where M is the number of SIMbox training samples, N is the number of legiti-
mate training samples. Four next steps are used to calculate I.G for one feature
with one condition:
1- Calculate entropy at node A:

M M N N
H(S) = log2 log2 (1)
M +N M +N N +M N +M

2- The data set is split into two branches by different feature, the entropy for
each branch is calculated:
Ha = H(m, n)
Fig. 3: Model example

m m n n
Ha = log2 log2 (2)
m+n m+n n+m n+m

Hb = H(M m, N n)

M m M m
Hb = log2
(M m) + (N n) (M m) + (N n)

N n N n
log2 (3)
(N n) + (M m) ((N n) + (M m)

3- The entropy for each branch is added proportionally to get total entropy for
the split:
H(S|A) = Pa Ha + Pb Hb

m+n (M m) + (N n)
H(S|A) = Ha + Hb (4)
M +N (M + N )

where Pa is the number of samples at node (a) per the number of samples at
node (A), Pb is the number of samples at node (b) per the number of samples
at node (A).
4- The resulting entropy is subtracted from the entropy before the split and the
result is the information gain or decrease in entropy:

I.G(S, A) = H(S) H(S|A) (5)

Table 1 summarises the decision tree algorithm specialized to learning boolean-

valued functions. Decision tree is a greedy algorithm that grows the tree top-
down. At each node selecting the features that best classifies the local training
samples. This process continues until the tree perfectly classifies the training
samples, or all features have been used [11].
Table 1: Summary of decision tree algorithm

Decision tree (data samples, target-feature, features-list)

Data samples are the training samples (building samples). Target-feature is the
feature whose value is to be predicted by the tree. Feature-list is a list of other
features that may be tested by the Decision tree.

Create a Root node for the tree

If all samples are SIMbox, Return the single-node tree Root, with label =SIMbox
If all samples are Legitimate, Return the single-node tree Root, with label = Legitimate
If features-list are empty, Return the single-node tree Root, with label = most common
value of Target-feature in samples
Otherwise Begin
A is the feature from features-list with condition that gives best classifies samples
with best(the feature that gives the biggest I.G)
The decision feature for Root is A
For each possible value, vi , of A,
Add a new tree branch below Root, corresponding to the test A = vi
Let samplesvi be the subset of samples that have value vi for A
If samples of vi is empty
Then below this new branch add a leaf node with label = most common
value of Target-feature in samples.
Else below this new branch add the subtree Decision tree(samplesvi , Target-
feature, features-list without A).

End.

Return Root

4 SIMbox Fraud Analysis

4.1 Data feeds

We analyse samples of fully anonymous call data records (CDRs) from a tier-1
cellular operator in Libya (Almadar Aljadid Co.,). Data collected between Oc-
tober 2014 and November 2014. CDRs are logs of all phone calls, text messages,
and data exchanges in the network. If there are two communicating parties (caller
and receiver) belong to the same cellular provider, two records are stored.

4.2 Data sample

The data set contains CDRs of 34 known fraudulent SIMboxes account and of
about 273 legitimate accounts. The legitimate accounts consist of fully anonymized
post-paid family plans, unlikely to be involved in fraudulent activities, corporate
accounts, and mobile network probing devices. It is a common practice that lo-
cal and foreign cellular operators and device manufacturers probe the mobility
network to measure the quality of service in terms of latency, to test upcoming
new cellular devices, etc. [12,13]. Probing devices generate a rather large number
of voice calls, most of which are addressed to different recipients. This contrasts
with the communication pattern of regular users, who make less phone calls to
fewer contacts [14]. The data set split into two parts the first one are used for
building (training) and the second one are used for testing.

4.3 Call traffic feature

CDR fields (collected during five days in 2014) are transformed into 6 features
characterizing voice call communication patterns of legitimate and fraudulent
users. The six features are: The total number of outgoing and incoming calls are
counted based on MO and MT, the total number of SMS originating and SMS
terminating, the total number of hand over and the total number of different
locations (NoDF) number of calls have different between first and last location
at the same call and summing with number of calls that have different between
last location of call and first location of the next call. Customer details are
obtained from the corresponding CDR fields.

4.4 SIMbox data Analysis

This sub-section analysis the traffic characteristics of fraudulent SIMboxes based
on the features described in the previous subsection (Section 4.3).
Figure 4 a plots the number of MO calls versus the number of MT calls. It can
be noticed that most of SIMboxes are clustered around two areas without any
legitimate account and legitimate accounts are clustered around another area.
It can be observed that most of SIMboxes have originating calls more than ter-
minating ones while legitimate accounts have comparable number of originating
and terminating calls. That is because SIMboxes are used mainly to regenerate
the calls received from the VOIP branch and make them GSM calls again. This
feature is very useful to distinguish between SIMboxes and legitimate accounts.
Figure 4 b present the number of MT calls versus the number of different loca-
tions (NoDL). It can be clearly seen that legitimate accounts have large number
of terminating calls and higher mobility than SIMbox accounts. This is due to
the fact that legitimate users are usually not tight to a specific location while
SIMboxes are installed to one location and could be moved from time to time.
This feature is very attractive to utilize in order to detect SIMbox accounts.
Figure 4 c plots number of SMS originating (SMSO) versus NoDL. The number
of locations feature has split most of samples and here it has been used with
SMS originating. We can notice that most of SIMboxes have a small number of
SMSO bounded in a small level but legitimate accounts have number of SMSO
larger than number of SMSO of SIMboxes.
Figure 4 d plots the number of MT calls versus the number of SMSO. It can
be seen that legitimate and SIMboxes accounts have similar behaviour and it is
250 40
SIMbox SIMbox
Legitimate Legitimate
35
200
30
The Number of MT Calls

25
150

NoDL
20

100
15

10
50
5

0 0
0 20 40 60 80 100 120 140 0 10 20 30 40 50 60
Number of MO Calls Number of SMSO

((a)) ((c))

40 60
SIMbox SIMbox
Legitimate Legitimate
35
50

40
25
Number of SMSO
NoDL

20 30

15
20

10
5

0 0
0 20 40 60 80 100 120 140 0 50 100 150 200 250
The Number of MT Calls Number of MT Calls

((b)) ((d))

Fig. 4: Analysis the traffic characteristics of fraudulent SIMboxes based on the

features

hard to distinguish between them based on this feature.

Based on the analysis above we can conclude that from the six explored feature
the number of locations feature can give the highest distinguish rate between
SIMboxes and legitimate accounts. In other words the number of locations fea-
ture results in the highest information gain and therefore, it should be used in
the first stage.

5 Experimental results
The practical performance of the decision tree algorithm described in the pre-
vious section was tested using another data sample (that used for testing) that
consist of 12 samples of SIMboxes and 251 samples of legitimate accounts. Ac-
cording to the information gain measure, the Number of different locations pro-
vides the best prediction of the target feature (kind of account) over the training
samples. Therefore, the number of different locations is selected as the decision
feature for the root node, and branches are created below the root for each of its
Table 2: Information Gain for the features at each node
MO MT SMSO SMST NoDL node
0.052596 0.207557 0.097026 0.088234 0.276763 Root(R)
0.514704 0.245623 0.118183 0.133216 0.181276 R-Left(L)
0.066197 0.136376 0.040580 0.174136 0.072861 R-L-L
0.311689 0.311689 0 0 0.141619 R-L-L-L
0.027740 0.257678 0 0 0.242697 R-L-L-L-L
0.144484 0.078982 0 0 0.144484 R-L-L-L-L-L
0.122556 0.122556 0 0 0.811278 R-L-L-L-L-L-r
0.093531 0.111687 0.138122 0.185579 0.012461 R-L-r
0.970950 0.970950 0.321928 0.170950 0 R-L-r-r
0.013723 0.053982 0.002601 0.006265 0.008751 R-r
0.918295 0.251629 0.251629 0.918295 0.918295 R-r-L

possible values. Table 2 summarises the information gain for the six features at
each node. where R is root node, L is a node on the left and r is a node on the
right. There are two types of testing to determine the accuracy of the algorithm,
true negative rate test and true positive rate test. The true negative rate test
is the proportion of legitimate accounts classified as legitimate (Its inverse of
The false positive rate), whereas true positive rate is the proportion of SIMboxes
classified as SIMbox accounts (Its inverse of the false negative rate).
Figure 5 a shows the prediction accuracy of the proposed algorithm as a func-
tion of number of building samples. It can be clearly seen that as the number of
samples increases the accuracy of the algorithm improves. When the full number
of samples were used the classification accuracy has reached 97.95%. Figure 5 b
shows true negative rate versus the number of legitimate building samples when
using decision tree algorithm to predict status of the SIM-Card. It can be seen
that the prediction accuracy improves with changing the number of samples for
legitimate users. The improvement is due to the fact increasing the number of
legitimate building samples improves the understanding of the of behaviour of
legitimate users.

6 Conclusions

In this paper six features extracted from CDR data are utilized to build decision
tree that can be used to distinguish between legitimate and SIMbox accounts.
The features include the total number of outgoing and incoming calls, the total
number of SMS originating and SMS terminating, the total number of hand over
and the total number of different locations. The proposed decision tree algorithm
has shown accuracy up to 97.95% when it was tested using testing samples data
from Almadar Aljadid company.
98 98

96 96

94
94
92

True negative rate

92
90
Accuracy

90
88
88
86
86
84
84
82

80 82

78 80
0 50 100 150 200 250 300 0 50 100 150 200 250
Number of building samples Number of building Legitimate samples

((a)) ((b))

Fig. 5: Total Accuracy of Algorithm and True Negative rate of Algorithm

References
1. H. Windsor, Mobile Revenue Assurance Fraud Management, Juniper Research,
http://goo.gl/GX7G4.
2. M. Yelland, Fraud in mobile networks, Computer Fraud & Security, vol. 2013, no.
3, pp. 5-9, 2013.
3. Raids on SIM Box/GSM Gateway Fraudsters Save Mobile Operators Millions,
Reuters, http://goo.gl/pHCpK.
4. Fraud in the Mobile World, Revector, http://goo.gl/Uobx6.
5. I. Murynets, M. Zabarankin, R.P. Jover and A. Panagia, Analysis and detection
of SIMbox fraud in mobility networks, INFOCOM, 2014 Proceedings IEEE, pp.
1519-1526, May 2014.
6. A. H. Elmi, S. Ibrahim, and R. Sallehuddin, Detecting sim box fraud using neural
network, in IT Convergence and Security 2012. Springer, 2013, pp. 575-582.
7. N2B Risk Management, http://www.zira.com.ba/products/risk-managemet/n2b-
fraud-management-system/sim-box.
8. G. Kesavaraj, S. Sukumaran, A study on classification techniques in data mining,
International Conference on Computing, Communications and Networking Tech-
nologies (ICCCNT), pp. 1-7, July 2013.
9. Wendy L. Martinez , Angel R. Martinez, Computational Statistics Handbook with
MATLAB,, 2002.
10. T. M. Mitchellz, Machine Learning,,Published by McGraw-Hill, March 1997.
11. Y. Freund, The alternating decision tree learning algorithm, in Machine Learn-
ing: Proceedings of the Sixteenth International Conference, March 1999.
12. I. Murynets and R. Piqueras Jover, Crime scene investigation: SMS spam data
analysis, in Proceedings of the 2012 ACM conference on Internet measurement.
ACM, pp. 441-452, 2012.
13. RCATS - Remote Cellular Active Test System, JDSU, http://goo.gl/VEbMA.
14. A.-L. Barabasi and R. Albert, Emergence of scaling in random networks, science,
vol. 286, no. 5439, pp. 509-512, 1999.

SMS Firewall Solution Description 2019
100% (2)
SMS Firewall Solution Description 2019
30 pages
Vuga SagasOfTheIcelanders
100% (3)
Vuga SagasOfTheIcelanders
180 pages
Unified Communication System Proposal
No ratings yet
Unified Communication System Proposal
11 pages
David and Goliath, A Story of Place. The Narrative-Geographical Shaping of 1 Samuel 17
No ratings yet
David and Goliath, A Story of Place. The Narrative-Geographical Shaping of 1 Samuel 17
11 pages
Telecom Fraud & Management
100% (7)
Telecom Fraud & Management
11 pages
Wifi Survey2015 715 Web
No ratings yet
Wifi Survey2015 715 Web
32 pages
SMSC SMPP Server-Client
No ratings yet
SMSC SMPP Server-Client
8 pages
Entreprise SMS Platform Brochure
No ratings yet
Entreprise SMS Platform Brochure
6 pages
Curriculum First Waldorf School
100% (5)
Curriculum First Waldorf School
78 pages
Classification Detection Prosecution of Fraud
No ratings yet
Classification Detection Prosecution of Fraud
6 pages
TD 101254 Analysis Detection SIMBox
No ratings yet
TD 101254 Analysis Detection SIMBox
8 pages
Preventing Mobile Fraud WP 2010
No ratings yet
Preventing Mobile Fraud WP 2010
7 pages
Detecting SIM Box Fraud Using Neural Network
No ratings yet
Detecting SIM Box Fraud Using Neural Network
9 pages
2012 WHITEPAPER Telecommunication-Fraud-Management Waveroad ConsulT
No ratings yet
2012 WHITEPAPER Telecommunication-Fraud-Management Waveroad ConsulT
24 pages
What Is SIMBOX?
No ratings yet
What Is SIMBOX?
6 pages
Telecom Fraud-Introduction, Types, and Solutions-White Paper PDF
No ratings yet
Telecom Fraud-Introduction, Types, and Solutions-White Paper PDF
15 pages
Guide To Detecting and Preventing Telecom Fraud
No ratings yet
Guide To Detecting and Preventing Telecom Fraud
16 pages
Mobileum - CaseStudy ROAMING CEM
No ratings yet
Mobileum - CaseStudy ROAMING CEM
4 pages
SIM Boxing
No ratings yet
SIM Boxing
2 pages
UC SIM Box Detection1
No ratings yet
UC SIM Box Detection1
47 pages
Telecom Fraud
No ratings yet
Telecom Fraud
29 pages
Traffic Analysis of A Short Message Service Network: January 2010
No ratings yet
Traffic Analysis of A Short Message Service Network: January 2010
5 pages
Infobip Whitepaper SMS FW Fraud Detection
No ratings yet
Infobip Whitepaper SMS FW Fraud Detection
6 pages
UCaaS and CPaaS For Cloud
No ratings yet
UCaaS and CPaaS For Cloud
3 pages
Sms Spam
No ratings yet
Sms Spam
14 pages
Fitsum Tesfaye
100% (1)
Fitsum Tesfaye
59 pages
VoIP Vs PBX - Dialpad
No ratings yet
VoIP Vs PBX - Dialpad
4 pages
A2PSMSC 21270.v.2.0 Web
No ratings yet
A2PSMSC 21270.v.2.0 Web
4 pages
Camel Application Part-By Abhinav Kumar & VAS
No ratings yet
Camel Application Part-By Abhinav Kumar & VAS
20 pages
Telecom Fraud
No ratings yet
Telecom Fraud
30 pages
Auditing Fraud and Revenue Assurance in Telecom Companies Sep12
No ratings yet
Auditing Fraud and Revenue Assurance in Telecom Companies Sep12
4 pages
Cellusys Datasheet SMS Defence v4.6
No ratings yet
Cellusys Datasheet SMS Defence v4.6
4 pages
Stay Compliant With Regulations by Employing Trustworthy MVNO Billing Solutions
No ratings yet
Stay Compliant With Regulations by Employing Trustworthy MVNO Billing Solutions
2 pages
Metaswitch White Paper For NFV & SDN
No ratings yet
Metaswitch White Paper For NFV & SDN
19 pages
Cpaas Proposal: One Touch Technology LTD
No ratings yet
Cpaas Proposal: One Touch Technology LTD
3 pages
USSD Sales Kit: Disclaimer
No ratings yet
USSD Sales Kit: Disclaimer
12 pages
A2P and P2A SMS
No ratings yet
A2P and P2A SMS
2 pages
SMS Spam Fraud Prevention
No ratings yet
SMS Spam Fraud Prevention
6 pages
Rich Communication Service (RCS)
No ratings yet
Rich Communication Service (RCS)
12 pages
Multimedia Applications
No ratings yet
Multimedia Applications
112 pages
A2P Monetization: Monetization Is The Process of Converting or Establishing Something Into Legal Tender (Wikipedia)
No ratings yet
A2P Monetization: Monetization Is The Process of Converting or Establishing Something Into Legal Tender (Wikipedia)
2 pages
Oosd - Cellular Networking Case Study
100% (1)
Oosd - Cellular Networking Case Study
54 pages
SMS Defense White Paper
No ratings yet
SMS Defense White Paper
16 pages
Ovum Omnichannel Whitepaper US Final
No ratings yet
Ovum Omnichannel Whitepaper US Final
25 pages
How SMS Messaging Works
No ratings yet
How SMS Messaging Works
15 pages
Mvno Migration Final
No ratings yet
Mvno Migration Final
4 pages
2 Ways Service Overview
No ratings yet
2 Ways Service Overview
15 pages
Presentation On MVNO (Mobile Virtual Network Operator)
100% (1)
Presentation On MVNO (Mobile Virtual Network Operator)
19 pages
Prepaid MVNO - Telecom Basics and Introduction To BSS
No ratings yet
Prepaid MVNO - Telecom Basics and Introduction To BSS
14 pages
PBX and VOIP Security Vulnerabilities
No ratings yet
PBX and VOIP Security Vulnerabilities
3 pages
Whit E Pap ER: Australian SMS SPAM Compliance
No ratings yet
Whit E Pap ER: Australian SMS SPAM Compliance
9 pages
SIM Box - Predictive Modeling For Fraud Detection
100% (2)
SIM Box - Predictive Modeling For Fraud Detection
140 pages
SKMM Mobile Virtual Network Operators
No ratings yet
SKMM Mobile Virtual Network Operators
44 pages
Online charging system Second Edition
From Everand
Online charging system Second Edition
Gerardus Blokdyk
No ratings yet
Building Telephony Systems with OpenSER
From Everand
Building Telephony Systems with OpenSER
Goncalves Flavio E.
No ratings yet
Introductory Guideline for Using Twilio Programmable Messaging and Programmable Voice Services
From Everand
Introductory Guideline for Using Twilio Programmable Messaging and Programmable Voice Services
Dr. Hidaia Mahmood Alassouli
No ratings yet
Mobile Virtual Network Operator MVNO The Ultimate Step-By-Step Guide
From Everand
Mobile Virtual Network Operator MVNO The Ultimate Step-By-Step Guide
Gerardus Blokdyk
No ratings yet
Overview of Some Voice Over IP Calls and SMS Verifications Services Providers
From Everand
Overview of Some Voice Over IP Calls and SMS Verifications Services Providers
Dr. Hidaia Mahmood Alassouli
No ratings yet
Diameter Protocol A Complete Guide
From Everand
Diameter Protocol A Complete Guide
Gerardus Blokdyk
No ratings yet
Public Cloud for Core Banking A Complete Guide
From Everand
Public Cloud for Core Banking A Complete Guide
Gerardus Blokdyk
No ratings yet
Network performance Third Edition
From Everand
Network performance Third Edition
Gerardus Blokdyk
No ratings yet
Analysis and Detection of Simbox Fraud in Mobility Networks: Proceedings - Ieee Infocom April 2014
No ratings yet
Analysis and Detection of Simbox Fraud in Mobility Networks: Proceedings - Ieee Infocom April 2014
9 pages
Grappling With The Challenges of Interconnect Bypass Fraud: Okumbor N. Anthony, Ateli A. Joy
No ratings yet
Grappling With The Challenges of Interconnect Bypass Fraud: Okumbor N. Anthony, Ateli A. Joy
7 pages
Assembler, Compiler, Interpreter, Linker, Loader
No ratings yet
Assembler, Compiler, Interpreter, Linker, Loader
2 pages
Math24-1 LQ2 2014-2015 4Q
No ratings yet
Math24-1 LQ2 2014-2015 4Q
1 page
Current Recruitment Process Hyundai
75% (8)
Current Recruitment Process Hyundai
77 pages
ETABS-Example-RC Building Seismic Load - Time History
100% (15)
ETABS-Example-RC Building Seismic Load - Time History
59 pages
Compiler Interpreter
No ratings yet
Compiler Interpreter
3 pages
ETABS-Example-RC Building Seismic Load - Time History
100% (15)
ETABS-Example-RC Building Seismic Load - Time History
59 pages
ETABS Examples Manual
90% (31)
ETABS Examples Manual
50 pages
Computer Digest - PDF 69
No ratings yet
Computer Digest - PDF 69
18 pages
Rojalin PDF Cha
No ratings yet
Rojalin PDF Cha
1 page
Motor k100m
No ratings yet
Motor k100m
3 pages
Glossary of Media Literacy Terms
No ratings yet
Glossary of Media Literacy Terms
73 pages
Chatterjee IIMBMR 2013 Vol.25 Iss.1
No ratings yet
Chatterjee IIMBMR 2013 Vol.25 Iss.1
14 pages
WEEK 27 - Unit 8 - Going Away + REVISION LESSON
No ratings yet
WEEK 27 - Unit 8 - Going Away + REVISION LESSON
8 pages
Flex 24v 2022 Catalog Pms Digital
No ratings yet
Flex 24v 2022 Catalog Pms Digital
100 pages
David Cleary Vegan Fat Loss Guide
100% (1)
David Cleary Vegan Fat Loss Guide
48 pages
Present Perfect Continuous
No ratings yet
Present Perfect Continuous
21 pages
MODULE 08 Artificial Intelligence
No ratings yet
MODULE 08 Artificial Intelligence
84 pages
VIP 3 REF Exam
No ratings yet
VIP 3 REF Exam
8 pages
Revisedloadandresistancefactorsforthe AASHTOLRFDBridge Design Specifications
No ratings yet
Revisedloadandresistancefactorsforthe AASHTOLRFDBridge Design Specifications
14 pages
Petteri Huuhka Google Paper
No ratings yet
Petteri Huuhka Google Paper
13 pages
Walmart Case
No ratings yet
Walmart Case
22 pages
Week 3 - Assignment Solutions
83% (6)
Week 3 - Assignment Solutions
4 pages
Installation Guide Roll-By Method
100% (1)
Installation Guide Roll-By Method
24 pages
Css Geography Repeated Questions
No ratings yet
Css Geography Repeated Questions
6 pages
Astm F1960
No ratings yet
Astm F1960
7 pages
Study Paper On 5G Transport Requirement
No ratings yet
Study Paper On 5G Transport Requirement
55 pages
MicroStrategy Mobile Design and Administration Guide 9.3.0
No ratings yet
MicroStrategy Mobile Design and Administration Guide 9.3.0
244 pages
Universal Motor
No ratings yet
Universal Motor
24 pages
UCTMCFSP PG Scholarship AppGuidelines 2023 Intake
No ratings yet
UCTMCFSP PG Scholarship AppGuidelines 2023 Intake
4 pages
Culminating Activity: Creative Nonfiction
No ratings yet
Culminating Activity: Creative Nonfiction
2 pages
Molecular Phylogenetic Analyses Indicate Extensive
No ratings yet
Molecular Phylogenetic Analyses Indicate Extensive
4 pages
Building Item Bank
67% (3)
Building Item Bank
6 pages
Module2 Discussion2.1
No ratings yet
Module2 Discussion2.1
9 pages
Ms 02 331
No ratings yet
Ms 02 331
29 pages
Asi Controls
No ratings yet
Asi Controls
38 pages
Asad Notes
No ratings yet
Asad Notes
15 pages
The Level of Due Dilligence in The Land Processess
No ratings yet
The Level of Due Dilligence in The Land Processess
3 pages
4 - The Nature and Types of Variables and Data
No ratings yet
4 - The Nature and Types of Variables and Data
5 pages

Analysis and Detection of Fraud in International Calls Using Decision Tree

Uploaded by

Analysis and Detection of Fraud in International Calls Using Decision Tree

Uploaded by

Analysis and Detection of Fraud in International

Calls Using Decision Tree

Ahmed Aljarray and Abdulla Abouda

Almadar Aljadid R&D Office, Libya-Misrata

2 Fraud in International Calls (Bypass Fraud)

3 Decision Tree Algorithm

nodes being designated as an internal or a terminal (leaf) node. Internal nodes

I.G(S, A) = H(S) H(S|A) (5)

Table 1 summarises the decision tree algorithm specialized to learning boolean-

Decision tree (data samples, target-feature, features-list)

Create a Root node for the tree

4 SIMbox Fraud Analysis

4.1 Data feeds

4.2 Data sample

4.3 Call traffic feature

4.4 SIMbox data Analysis

Fig. 4: Analysis the traffic characteristics of fraudulent SIMboxes based on the

hard to distinguish between them based on this feature.

True negative rate

Fig. 5: Total Accuracy of Algorithm and True Negative rate of Algorithm

You might also like