SlideShare a Scribd company logo
IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org
ISSN (e): 2250-3021, ISSN (p): 2278-8719
Vol. 05, Issue 08 (August. 2015), ||V4|| PP 05-10
International organization of Scientific Research 5 | P a g e
Application of Meta learning in Banking Sector
Sanjay Kumar Sen, Dr. B.K. Ratha.
Assistant Professor, Dept. of CSE . Orissa Engineering College, Bhubaneswar, Odisha, India
sanjaysen2k@gmail.com
Reader, Dept. of Computer Science and Application, Utkal University, Vani Vihar, Bhubaneswar.
vkramus@yahoo.com
Abstract: - The data mining technology is a process of identifying patterns and information from a huge
quantity of data. In a single repository data base where data is stored in central site, then applying data mining
algorithms on these data base, patterns are extracted which are converted into information. It becomes
important area for financial organization like banking sector, insurance sector, stock market etc. Banking
systems gather huge amount of data like customer information, transaction details, loan details, credit card
details etc. It helps to find hidden patterns in a group and discover unknown relationship in the data. Instead of
quantitative and statistical data characteristics, this technique helps in data interpretations for the banking sector.
Data mining tools helpful for leading banking for credit scoring service and approval predicting payment lapse,
fraud detecting etc. By applying data mining techniques bankers are getting advantage in competitive market. It
is also helpful for the retailers for customer buying tradition and also desires. This proposed work in this paper
is the combination of supervised machine learning algorithms, Classification and Regression Tree (CART),
Adaboost and Logitboost, Bagging and Dagging are proposed for classification of banking data. These resulted
forms help researchers to detect fraud in banking data. The experimental result shows the performance analysis
of different meta-learning algorithms and also compared on the basis of misclassification and correct
classification rate. Smaller misclassification reveals that bagging algorithm performs better classification of
banking data fraud detection technique.. The proposed work in this paper is the combination of five supervised
machine learning algorithms, Classification and Regression Tree (CART), Adaboost, Grading, Bagging are
proposed for classification of banking data. The experimental result shows the performance analysis of different
meta-learning algorithms and also compared on the basis of misclassification and correct classification rate.
Smaller misclassification reveals that bagging algorithm performs better classification of banking data fraud
detection technique.
I. INTRODUCTION
The modern technology has great influence on banking business. All traditional banks compelled to
adopt and implement new technological approach not only to improve financial and customer relationship but
also to reduce time and cost to survive in this competitive market. Some automated software which radically
changed the basic definition of the definition of the banking business.
Data mining is the process of extracting hidden and unknown knowledge by applying statistical and
machine-learning techniques for extracting interesting patterns from huge data base. Data mining is the
application of statistical and machine-learning techniques for extracting interesting patterns from raw data [4]
The strong security systems maintained in banking industries to avoid fraud in internet banking.
Though fraud in banking can not be removed drastically., but to the some extent can be minimized. Programs
for Machine Learning, published in 1993 [1], that many applications of artificial intelligence are based on a
model of knowledge that is usually employed by a human specialist. This paper helps to minimize the bank
fraud by using different meta learning algorithm. Initial work on the forecasting system of banking system is
done by Frankel and Rose using some indicators in relation to the emerging currency markets[2]. The first group
indicates integrated indicators of interest rates of interest rates and output growth[3]. The second group includes
national macroeconomic indicators like sudden changes the rate of production[3]. The third category includes
indicators of price revaluation, the current account deficit and the level of debt. The fourth class of variables
describes the determinants of a debt structure [10].
1.1Application of Data Mining in Banking Sector
Various areas where data mining can be used in financial sectors [5][6][7] like customer segmentation and
profitability, credit analysis, predicting payment default, marketing, fraudulent transactions, ranking
investments, optimizing stock portfolios, cash management and forecasting operations, high risk loan applicants,
most profitable Credit Card Customers and Cross Selling. Certain examples where banking industry has been
utilizing the data mining technology effectively as follows.
Application of Meta learning in Banking Sector
International organization of Scientific Research 6 | P a g e
1.2Fraud Detection Fraud detection
[8][9] [10][11] [12] is the recognition of symptoms of fraud where no priorm suspicion or tendency to fraud
exists. Fraud is defined as „a deception deliberately practiced in order to secure unfair of unlawful gain. Fraud
detection is the detection of criminal activities which occurs in commercial organizations such as banks, credit
card issuing organizations, insurance agencies, mobile companies, stock market. Themmalicious users might be
the actual customers of the organization or might be posing as a customer (also known as identity theft) [11].
Financial Organizations especially banking sectors follows mainly two approaches[5] [12]towards determining
the fraud patterns, online transaction check and Offline transaction Check.
1.3 Risk Management
Data Mining is used to identify the risk factors in each department of banking business [6]. Credit
Approval authorities in the financial organization used data mining techniques to determine the risk factors in
lending decisions[13] by analyzing the data based on nationality, repayment capacity and so on. Retail
marketing department uses data mining methodologies to find the reliability[14] and the behavior of credit card
applicant[8] while selling the credit cards. They uses data mining techniques on existing customers to sell credit
cards or increase customers credits or top up on credit card loans [15]. In commercial lending, data mining plays
a vital role. In commercial lending, risk assessment is usually an attempt to quantify the risk of default or loss to
the lender when making a particular lending decision or approving a credit facility [8].
Data Mining can be used to derive credit behavior [8] of individual borrowers with parameters card
loans, mortgage value, repayment and using characteristics such as history of credit, employmen period and
length of residency. A score is thus produced that allow a lender to evaluate the customer and decide whether
the person is a good candidate for a loan, or if there is a tendency to become high risk of default[14] Customers
who have been with bank for a longer periods of time, remained better with bank and have good credit history
and have higher salaries/wages, are more likely to receive a loan than a new customer who has no credit history
with the bank, or who earns low salaries/wages[17].Bank can reduce the risk factors to maintain a better position
by knowing the chances of a customer to become default[18] [19].
II. ALGORITHM USED
1.Classification and Regression Tree(CART) It was introduce by Breimann 1984.It builds both classification
and regression tree (Gini index measure is used for selecting splitting attribute. Pruning is done on training data
set. It can deal with both numeric and categorical attributes and can also handle missing attributes. [12]. The
CART monograph focus on the Gini rule which is similar to the better know entropy or information gain
criterion [13]. For binary (0/1) target the 'Gini measure of impurity" of a node t is: Classification and regression
free provide automatic construction of new features within each node and for the binary target[11].
Figure 1: Screen shot for C4.5 classifier performance
2. Adaboost It is a machine algorithm, formulated by Yeave Freud and Robert Scapire. It is a meta-learning
algorithm and used in conjunction with many other learning algorithms to improve their performance.
[11]AdaBoost is an algorithm for constructing a”strong” classifier as linear combination. AdaBoost is adaptive
only for this reason that subsequent classifiers built are weaked in favour of those instances misclassified by
previous classifiers. AdaBoost is sensitive to noisy data and outliers. In some problems, however, it can be less
susceptible to the over fitting problem than most learning algorithms. The classifiers it uses can be weak (i.e.,
display a substantial error rate), but as long as their performance is not random (resulting in an error rate of 0.5
for binary classification), they will improve the final model.
Application of Meta learning in Banking Sector
International organization of Scientific Research 7 | P a g e
Figure 2: Screen shot for Adaboost classifier performance
3. Logiboost LogitBoost[11] is a boosting algorithm formulated by Jerome Friedmome, Trevor Hastie, and
Robert Tibshirani. LogitBoost represents an application of established logistic regression to the AdaBoost
method. Rather than minimizing error with respect to y, weak learners are chosen to minimize the (weighted
least-squares) error of with respect to
,
where , and .
That is is the Newton-Raphson approximation of the minimizer of the log-likelihood error at stage , and the
weak learner is chosen as the learner that best approximates by weighted least squares.
Figure 3: Screen shot for Logiboost classifier performance
Application of Meta learning in Banking Sector
International organization of Scientific Research 8 | P a g e
4. Bagging
Figure 4: Screen shot for Grading classifier performance
Algorithm
classification
Correct
Classification Rate Mis- Classification
CART 88.0000 12.0000
ADABOOST 73.0000 27.0000
GRADING 54.3333 45.6667
BAGGING 90.3333 9.6667
Figure-4
Application of Meta learning in Banking Sector
International organization of Scientific Research 9 | P a g e
Figure-5
.Graph for Correct Classification VS. Misclassification rate
Figure-6
III. CONCLUSIONS
This paper represents computational issues of four supervised machine learning algorithms i.e.,
Classification and Regression technique algorithm, Adaboost algorithm, Logitboost algorithm and Bagging
algorithm with dedicating role of detection of weather forecast on the basis of classification rule. Among four
algorithms, Bagging algorithm is the best in performance.. In order to compare the classification performance of
four machine learning algorithms, classifiers are applied on same data and results are compared on the basis of
misclassification and correct classification rate and according to experimental results in table 1, it can be
concluded that Baging is the best as compared to adaboost, logitboost, Classification and Regression Technique.
Application of Meta learning in Banking Sector
International organization of Scientific Research 10 | P a g e
REFERENCES
[1] QUINLAN, J. R. (1988). C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers, San
Mateo, CA.
[2] C4.5 Algorithm - Disponible inhttp://www.cse.unsw.edu.au/~quinlan . Accessed in 25 July 2010.
[3] Bruno Carneiro da Rocha1,2 and Rafael Timóteo de Sousa Júnior2 “IDENTIFYING BANK FRAUDS
USING CRISP-DM AND DECISION TREES” International journal of computer science & information
Technology (IJCSIT) Vol.2, No.5, October 2010.
[4] Mabroukeh, N.R. and C.I. Ezeife, 2010. A taxonomy of sequential pattern mining algorithms. ACM
Comput. Surveys. DOI: 10.1145/1824795.1824798
[5] Ramageri, B.M. and B.L. Desai, 2013 Role of datamining in retail sector. Int. J. Comput. Sci. Eng., 5.
Moradi, M., M. Salehi, M.E. Ghorgani and H.S. Yazdi, 2013. Financial distress prediction of Iranian
companies by using data mining techniques. Organizacija, 46: 20-27.
[6] Ramageri, B.M. and B.L. Desai, 2013 Role of data mining in retail sector. Int. J. Comput. Sci. Eng., 5.
[7] Delamaire, L., A. Hussein and P. John, 2009. Credit card fraud and detection techniques: A review.
Banks Bank Syst., 4: 57-68.
[8] Ravisankar, P., V. Ravi, G.R. Rao and I. Bose, 2011. Detection of financial statement fraud and feature
selection using data mining techniques. Decision Support Syst., 50: 491-500. DOI:
10.1016/j.dss.2010.11.006.
[9] Raj, S.B.E. and A.A. Portia, 2011. Analysis on credit card fraud detection methods. Proceedings of the
International Conference on Computer Communication and Electrical Technology, Mar. 18- 19, IEEE
Xplore Press, Tamilnadu, pp: 152-156.DOI: 10.1109/ICCCET.2011.5762457.
[10] Petry, F.E. and L. Zhao, 2009. Data mining by attribute generalization with fuzzy hierarchies in fuzzy
databases. Fuzzy Sets Syst., 160: 2206-2223. DOI 10.1016/j.fss.2009.02.014.
[11] Changdola, V., A. Banerjee and V. Kumar, 2009. Anomaly detection: A survey. ACM Comput. Surveys,
9: 1-72.
[12] Moin, K.I. and Q.B. Ahmed, 2012. Use of data mining in banking. Int. J. Eng. Res. Applic., 2: 738-742.
[13] Chen, I., L. Chi-Jie, L. Tian-Shyug and L. Chung-Ta, 2009. Behavioral scoring model for bank customers
using data envelopment analysis. Stud. Comput. Intell., 214: 99-104. DOI: 10.1007/978-3-540-92814-
0_16
Yap, B.W., S.H. Ong and N.H.M. Hussain, 2011. Using
[14] Yap, B.W., S.H. Ong and N.H.M. Hussain, 2011. Using data mining to improve assessment of credit
worthiness via credit scoring models. Expert Syst. Appli., 38: 13274-13283. DOI:
10.1016/j.eswa.2011.04.147.
[15] Bhattacharya, S., S. Jha, K. Tharakunnel and J.C. Westland, 2011. Data mining for credit card fraud: A
comparative study. Decision Support Syst., 50: 602- 613. DOI: 10.1016/j.dss.2010.08.008 .
[16] Ravisankar, P., V. Ravi, G.R. Rao and I. Bose, 2011. Detection of financial statement fraud and feature
selection using data mining techniques. Decision Support Syst., 50: 491-500. DOI:
[17] Ravisankar, P., V. Ravi, G.R. Rao and I. Bose, 2011.Detection of financial statement fraud and feature
selection using data mining techniques. Decision Support Syst., 50: 491-500. DOI:
10.1016/j.dss.2010.11.006 .
[18] Dorr, D.M. and M.D. Anne, 2009. Establishing relationships among patterns instock market data. Data
Knowl. Eng., 68: 318-337. DOI 10.1016/j.datak.2008.10.001.
[19] Tsai, H.H., 2012. Global data mining: An empirical study of current trends, future forecasts and
technology diffusions. Expert Syst. Applic., 39: 8172-8181.

More Related Content

PDF
The use of genetic algorithm, clustering and feature selection techniques in ...
IJMIT JOURNAL
 
PDF
Applying Convolutional-GRU for Term Deposit Likelihood Prediction
VandanaSharma356
 
PDF
Applications of machine learning
business Corporate
 
PDF
LABELING CUSTOMERS USING DISCOVERED KNOWLEDGE CASE STUDY: AUTOMOBILE INSURAN...
ijmvsc
 
PDF
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
csijjournal
 
PDF
IRJET- Credit Card Fraud Detection using Random Forest
IRJET Journal
 
PDF
DEVELOPING PREDICTION MODEL OF LOAN RISK IN BANKS USING DATA MINING
mlaij
 
PDF
Default Probability Prediction using Artificial Neural Networks in R Programming
Vineet Ojha
 
The use of genetic algorithm, clustering and feature selection techniques in ...
IJMIT JOURNAL
 
Applying Convolutional-GRU for Term Deposit Likelihood Prediction
VandanaSharma356
 
Applications of machine learning
business Corporate
 
LABELING CUSTOMERS USING DISCOVERED KNOWLEDGE CASE STUDY: AUTOMOBILE INSURAN...
ijmvsc
 
THE EFFECTIVENESS OF DATA MINING TECHNIQUES IN BANKING
csijjournal
 
IRJET- Credit Card Fraud Detection using Random Forest
IRJET Journal
 
DEVELOPING PREDICTION MODEL OF LOAN RISK IN BANKS USING DATA MINING
mlaij
 
Default Probability Prediction using Artificial Neural Networks in R Programming
Vineet Ojha
 

What's hot (20)

PDF
IRJET- Prediction of Credit Risks in Lending Bank Loans
IRJET Journal
 
PDF
Credit iconip
Armando Vieira
 
PDF
An efficient data pre processing frame work for loan credibility prediction s...
eSAT Journals
 
PDF
A Review on Credit Card Default Modelling using Data Science
YogeshIJTSRD
 
PDF
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Ferhat Ozgur Catak
 
PDF
An Innovative Approach to Predict Bankruptcy
vivatechijri
 
PDF
Extended pso algorithm for improvement problems k means clustering algorithm
IJMIT JOURNAL
 
PDF
A simulated decision trees algorithm (sdt)
Mona Nasr
 
PDF
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
IJECEIAES
 
PDF
Data mining software comparison
Esteban Alcaide
 
PPTX
Credit defaulter analysis
Nimai Chand Das Adhikari
 
PDF
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
PDF
Clustering Prediction Techniques in Defining and Predicting Customers Defecti...
IJECEIAES
 
PPTX
Credit card fraud detection
vineeta vineeta
 
PPTX
Mining Credit Card Defults
Krunal Khatri
 
PDF
Instance Selection and Optimization of Neural Networks
ITIIIndustries
 
PDF
IRJET- Analysis on Credit Card Fraud Detection using Capsule Network
IRJET Journal
 
PDF
Predicting Credit Card Defaults using Machine Learning Algorithms
Sagar Tupkar
 
PDF
Proficiency comparison ofladtree
ijcsa
 
PDF
TOURISM DEMAND FORECASTING MODEL USING NEURAL NETWORK
ijcsit
 
IRJET- Prediction of Credit Risks in Lending Bank Loans
IRJET Journal
 
Credit iconip
Armando Vieira
 
An efficient data pre processing frame work for loan credibility prediction s...
eSAT Journals
 
A Review on Credit Card Default Modelling using Data Science
YogeshIJTSRD
 
Fuzzy Analytic Hierarchy Based DBMS Selection In Turkish National Identity Ca...
Ferhat Ozgur Catak
 
An Innovative Approach to Predict Bankruptcy
vivatechijri
 
Extended pso algorithm for improvement problems k means clustering algorithm
IJMIT JOURNAL
 
A simulated decision trees algorithm (sdt)
Mona Nasr
 
Credit Scoring Using CART Algorithm and Binary Particle Swarm Optimization
IJECEIAES
 
Data mining software comparison
Esteban Alcaide
 
Credit defaulter analysis
Nimai Chand Das Adhikari
 
Welcome to International Journal of Engineering Research and Development (IJERD)
IJERD Editor
 
Clustering Prediction Techniques in Defining and Predicting Customers Defecti...
IJECEIAES
 
Credit card fraud detection
vineeta vineeta
 
Mining Credit Card Defults
Krunal Khatri
 
Instance Selection and Optimization of Neural Networks
ITIIIndustries
 
IRJET- Analysis on Credit Card Fraud Detection using Capsule Network
IRJET Journal
 
Predicting Credit Card Defaults using Machine Learning Algorithms
Sagar Tupkar
 
Proficiency comparison ofladtree
ijcsa
 
TOURISM DEMAND FORECASTING MODEL USING NEURAL NETWORK
ijcsit
 
Ad

Viewers also liked (14)

DOC
200182551 syllabus-case-studies-in-corporate-finance
homeworkping4
 
DOCX
Liberty Cjus 420 final exam
Homework Simple
 
PPT
Family Portrait Photography Canada
Chris Bernard
 
PPTX
Persebaran flora dan fauna didunia
Yusron Nur Rosyid
 
DOCX
201642103 bus-org-cases-part-2
homeworkping4
 
PDF
E05842831
IOSR-JEN
 
PDF
Gift pens for men
Treeringpens
 
PPTX
Hostess per discoteche edˇeventi
Milano Model Agency
 
PPTX
Chương trình kì diệu của ĐCT
Cường Lê
 
PDF
Behera, Santosh Kumar
Santosh Behera
 
PPTX
The mission
Marius Iluca
 
PDF
景氣不見底 失業恐惡化 自由時報 20120827
Ray Chou
 
PPTX
竹圍高中授課W2
Daisy Chang
 
PDF
心靈彩繪:直覺式創意曼陀羅,探索自我與靜心紓壓
鼓手 心
 
200182551 syllabus-case-studies-in-corporate-finance
homeworkping4
 
Liberty Cjus 420 final exam
Homework Simple
 
Family Portrait Photography Canada
Chris Bernard
 
Persebaran flora dan fauna didunia
Yusron Nur Rosyid
 
201642103 bus-org-cases-part-2
homeworkping4
 
E05842831
IOSR-JEN
 
Gift pens for men
Treeringpens
 
Hostess per discoteche edˇeventi
Milano Model Agency
 
Chương trình kì diệu của ĐCT
Cường Lê
 
Behera, Santosh Kumar
Santosh Behera
 
The mission
Marius Iluca
 
景氣不見底 失業恐惡化 自由時報 20120827
Ray Chou
 
竹圍高中授課W2
Daisy Chang
 
心靈彩繪:直覺式創意曼陀羅,探索自我與靜心紓壓
鼓手 心
 
Ad

Similar to B05840510 (20)

PDF
Supervised and unsupervised data mining approaches in loan default prediction
IJECEIAES
 
PDF
Dk24717723
IJERA Editor
 
PDF
A Review On Data Mining In Banking Sector
Kim Daniels
 
PDF
Thinking space
Titos Matsakos
 
PPT
Introduction data mining
Rana Chakraborty
 
PDF
A rule-based machine learning model for financial fraud detection
IJECEIAES
 
PPTX
Data mining & Decison Trees
Selman Bozkır
 
PPTX
Machine learning in Banks
Abhishek Upadhyay
 
PPTX
Data mining
sayalipatil528
 
PPTX
Application areas of data mining
priya jain
 
PDF
dataminingppt-170616163835.pdf jejwwkwnwnn
jainutkarsh078
 
PPTX
Data mining
District Administration
 
PDF
Data mining on Financial Data
AmarnathVenkataraman
 
PDF
Credit card fraud detection through machine learning
dataalcott
 
PDF
Lecture 01 Data Mining
Pier Luca Lanzi
 
PPTX
Dw dm-banking
A P
 
PPTX
Data mining
jadhav_priti
 
PDF
Impact of machine learning on banking sector
Abhishek Verma
 
PPT
Data mining
Cloudbells.com
 
Supervised and unsupervised data mining approaches in loan default prediction
IJECEIAES
 
Dk24717723
IJERA Editor
 
A Review On Data Mining In Banking Sector
Kim Daniels
 
Thinking space
Titos Matsakos
 
Introduction data mining
Rana Chakraborty
 
A rule-based machine learning model for financial fraud detection
IJECEIAES
 
Data mining & Decison Trees
Selman Bozkır
 
Machine learning in Banks
Abhishek Upadhyay
 
Data mining
sayalipatil528
 
Application areas of data mining
priya jain
 
dataminingppt-170616163835.pdf jejwwkwnwnn
jainutkarsh078
 
Data mining on Financial Data
AmarnathVenkataraman
 
Credit card fraud detection through machine learning
dataalcott
 
Lecture 01 Data Mining
Pier Luca Lanzi
 
Dw dm-banking
A P
 
Data mining
jadhav_priti
 
Impact of machine learning on banking sector
Abhishek Verma
 
Data mining
Cloudbells.com
 

More from IOSR-JEN (20)

PDF
C05921721
IOSR-JEN
 
PDF
B05921016
IOSR-JEN
 
PDF
A05920109
IOSR-JEN
 
PDF
J05915457
IOSR-JEN
 
PDF
I05914153
IOSR-JEN
 
PDF
H05913540
IOSR-JEN
 
PDF
G05913234
IOSR-JEN
 
PDF
F05912731
IOSR-JEN
 
PDF
E05912226
IOSR-JEN
 
PDF
D05911621
IOSR-JEN
 
PDF
C05911315
IOSR-JEN
 
PDF
B05910712
IOSR-JEN
 
PDF
A05910106
IOSR-JEN
 
PDF
I05844759
IOSR-JEN
 
PDF
H05844346
IOSR-JEN
 
PDF
G05843942
IOSR-JEN
 
PDF
F05843238
IOSR-JEN
 
PDF
D05842227
IOSR-JEN
 
PDF
C05841121
IOSR-JEN
 
PDF
B05840510
IOSR-JEN
 
C05921721
IOSR-JEN
 
B05921016
IOSR-JEN
 
A05920109
IOSR-JEN
 
J05915457
IOSR-JEN
 
I05914153
IOSR-JEN
 
H05913540
IOSR-JEN
 
G05913234
IOSR-JEN
 
F05912731
IOSR-JEN
 
E05912226
IOSR-JEN
 
D05911621
IOSR-JEN
 
C05911315
IOSR-JEN
 
B05910712
IOSR-JEN
 
A05910106
IOSR-JEN
 
I05844759
IOSR-JEN
 
H05844346
IOSR-JEN
 
G05843942
IOSR-JEN
 
F05843238
IOSR-JEN
 
D05842227
IOSR-JEN
 
C05841121
IOSR-JEN
 
B05840510
IOSR-JEN
 

Recently uploaded (20)

PDF
Enable Enterprise-Ready Security on IBM i Systems.pdf
Precisely
 
PDF
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
PDF
Software Development Company | KodekX
KodekX
 
PDF
CIFDAQ's Token Spotlight: SKY - A Forgotten Giant's Comeback?
CIFDAQ
 
PDF
Make GenAI investments go further with the Dell AI Factory - Infographic
Principled Technologies
 
PDF
Google’s NotebookLM Unveils Video Overviews
SOFTTECHHUB
 
PPTX
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PPTX
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
AVTRON Technologies LLC
 
PDF
agentic-ai-and-the-future-of-autonomous-systems.pdf
siddharthnetsavvies
 
PDF
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
PDF
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
PDF
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
PDF
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
PDF
DevOps & Developer Experience Summer BBQ
AUGNYC
 
PDF
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
PPTX
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
PDF
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
PDF
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
PPTX
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
PDF
REPORT: Heating appliances market in Poland 2024
SPIUG
 
Enable Enterprise-Ready Security on IBM i Systems.pdf
Precisely
 
Using Anchore and DefectDojo to Stand Up Your DevSecOps Function
Anchore
 
Software Development Company | KodekX
KodekX
 
CIFDAQ's Token Spotlight: SKY - A Forgotten Giant's Comeback?
CIFDAQ
 
Make GenAI investments go further with the Dell AI Factory - Infographic
Principled Technologies
 
Google’s NotebookLM Unveils Video Overviews
SOFTTECHHUB
 
How to Build a Scalable Micro-Investing Platform in 2025 - A Founder’s Guide ...
Third Rock Techkno
 
PA Analog/Digital System: The Backbone of Modern Surveillance and Communication
AVTRON Technologies LLC
 
agentic-ai-and-the-future-of-autonomous-systems.pdf
siddharthnetsavvies
 
A Day in the Life of Location Data - Turning Where into How.pdf
Precisely
 
Security features in Dell, HP, and Lenovo PC systems: A research-based compar...
Principled Technologies
 
Oracle AI Vector Search- Getting Started and what's new in 2025- AIOUG Yatra ...
Sandesh Rao
 
How-Cloud-Computing-Impacts-Businesses-in-2025-and-Beyond.pdf
Artjoker Software Development Company
 
DevOps & Developer Experience Summer BBQ
AUGNYC
 
Google I/O Extended 2025 Baku - all ppts
HusseinMalikMammadli
 
New ThousandEyes Product Innovations: Cisco Live June 2025
ThousandEyes
 
NewMind AI Weekly Chronicles - July'25 - Week IV
NewMind AI
 
Unlocking the Future- AI Agents Meet Oracle Database 23ai - AIOUG Yatra 2025.pdf
Sandesh Rao
 
Comunidade Salesforce São Paulo - Desmistificando o Omnistudio (Vlocity)
Francisco Vieira Júnior
 
REPORT: Heating appliances market in Poland 2024
SPIUG
 

B05840510

  • 1. IOSR Journal of Engineering (IOSRJEN) www.iosrjen.org ISSN (e): 2250-3021, ISSN (p): 2278-8719 Vol. 05, Issue 08 (August. 2015), ||V4|| PP 05-10 International organization of Scientific Research 5 | P a g e Application of Meta learning in Banking Sector Sanjay Kumar Sen, Dr. B.K. Ratha. Assistant Professor, Dept. of CSE . Orissa Engineering College, Bhubaneswar, Odisha, India [email protected] Reader, Dept. of Computer Science and Application, Utkal University, Vani Vihar, Bhubaneswar. [email protected] Abstract: - The data mining technology is a process of identifying patterns and information from a huge quantity of data. In a single repository data base where data is stored in central site, then applying data mining algorithms on these data base, patterns are extracted which are converted into information. It becomes important area for financial organization like banking sector, insurance sector, stock market etc. Banking systems gather huge amount of data like customer information, transaction details, loan details, credit card details etc. It helps to find hidden patterns in a group and discover unknown relationship in the data. Instead of quantitative and statistical data characteristics, this technique helps in data interpretations for the banking sector. Data mining tools helpful for leading banking for credit scoring service and approval predicting payment lapse, fraud detecting etc. By applying data mining techniques bankers are getting advantage in competitive market. It is also helpful for the retailers for customer buying tradition and also desires. This proposed work in this paper is the combination of supervised machine learning algorithms, Classification and Regression Tree (CART), Adaboost and Logitboost, Bagging and Dagging are proposed for classification of banking data. These resulted forms help researchers to detect fraud in banking data. The experimental result shows the performance analysis of different meta-learning algorithms and also compared on the basis of misclassification and correct classification rate. Smaller misclassification reveals that bagging algorithm performs better classification of banking data fraud detection technique.. The proposed work in this paper is the combination of five supervised machine learning algorithms, Classification and Regression Tree (CART), Adaboost, Grading, Bagging are proposed for classification of banking data. The experimental result shows the performance analysis of different meta-learning algorithms and also compared on the basis of misclassification and correct classification rate. Smaller misclassification reveals that bagging algorithm performs better classification of banking data fraud detection technique. I. INTRODUCTION The modern technology has great influence on banking business. All traditional banks compelled to adopt and implement new technological approach not only to improve financial and customer relationship but also to reduce time and cost to survive in this competitive market. Some automated software which radically changed the basic definition of the definition of the banking business. Data mining is the process of extracting hidden and unknown knowledge by applying statistical and machine-learning techniques for extracting interesting patterns from huge data base. Data mining is the application of statistical and machine-learning techniques for extracting interesting patterns from raw data [4] The strong security systems maintained in banking industries to avoid fraud in internet banking. Though fraud in banking can not be removed drastically., but to the some extent can be minimized. Programs for Machine Learning, published in 1993 [1], that many applications of artificial intelligence are based on a model of knowledge that is usually employed by a human specialist. This paper helps to minimize the bank fraud by using different meta learning algorithm. Initial work on the forecasting system of banking system is done by Frankel and Rose using some indicators in relation to the emerging currency markets[2]. The first group indicates integrated indicators of interest rates of interest rates and output growth[3]. The second group includes national macroeconomic indicators like sudden changes the rate of production[3]. The third category includes indicators of price revaluation, the current account deficit and the level of debt. The fourth class of variables describes the determinants of a debt structure [10]. 1.1Application of Data Mining in Banking Sector Various areas where data mining can be used in financial sectors [5][6][7] like customer segmentation and profitability, credit analysis, predicting payment default, marketing, fraudulent transactions, ranking investments, optimizing stock portfolios, cash management and forecasting operations, high risk loan applicants, most profitable Credit Card Customers and Cross Selling. Certain examples where banking industry has been utilizing the data mining technology effectively as follows.
  • 2. Application of Meta learning in Banking Sector International organization of Scientific Research 6 | P a g e 1.2Fraud Detection Fraud detection [8][9] [10][11] [12] is the recognition of symptoms of fraud where no priorm suspicion or tendency to fraud exists. Fraud is defined as „a deception deliberately practiced in order to secure unfair of unlawful gain. Fraud detection is the detection of criminal activities which occurs in commercial organizations such as banks, credit card issuing organizations, insurance agencies, mobile companies, stock market. Themmalicious users might be the actual customers of the organization or might be posing as a customer (also known as identity theft) [11]. Financial Organizations especially banking sectors follows mainly two approaches[5] [12]towards determining the fraud patterns, online transaction check and Offline transaction Check. 1.3 Risk Management Data Mining is used to identify the risk factors in each department of banking business [6]. Credit Approval authorities in the financial organization used data mining techniques to determine the risk factors in lending decisions[13] by analyzing the data based on nationality, repayment capacity and so on. Retail marketing department uses data mining methodologies to find the reliability[14] and the behavior of credit card applicant[8] while selling the credit cards. They uses data mining techniques on existing customers to sell credit cards or increase customers credits or top up on credit card loans [15]. In commercial lending, data mining plays a vital role. In commercial lending, risk assessment is usually an attempt to quantify the risk of default or loss to the lender when making a particular lending decision or approving a credit facility [8]. Data Mining can be used to derive credit behavior [8] of individual borrowers with parameters card loans, mortgage value, repayment and using characteristics such as history of credit, employmen period and length of residency. A score is thus produced that allow a lender to evaluate the customer and decide whether the person is a good candidate for a loan, or if there is a tendency to become high risk of default[14] Customers who have been with bank for a longer periods of time, remained better with bank and have good credit history and have higher salaries/wages, are more likely to receive a loan than a new customer who has no credit history with the bank, or who earns low salaries/wages[17].Bank can reduce the risk factors to maintain a better position by knowing the chances of a customer to become default[18] [19]. II. ALGORITHM USED 1.Classification and Regression Tree(CART) It was introduce by Breimann 1984.It builds both classification and regression tree (Gini index measure is used for selecting splitting attribute. Pruning is done on training data set. It can deal with both numeric and categorical attributes and can also handle missing attributes. [12]. The CART monograph focus on the Gini rule which is similar to the better know entropy or information gain criterion [13]. For binary (0/1) target the 'Gini measure of impurity" of a node t is: Classification and regression free provide automatic construction of new features within each node and for the binary target[11]. Figure 1: Screen shot for C4.5 classifier performance 2. Adaboost It is a machine algorithm, formulated by Yeave Freud and Robert Scapire. It is a meta-learning algorithm and used in conjunction with many other learning algorithms to improve their performance. [11]AdaBoost is an algorithm for constructing a”strong” classifier as linear combination. AdaBoost is adaptive only for this reason that subsequent classifiers built are weaked in favour of those instances misclassified by previous classifiers. AdaBoost is sensitive to noisy data and outliers. In some problems, however, it can be less susceptible to the over fitting problem than most learning algorithms. The classifiers it uses can be weak (i.e., display a substantial error rate), but as long as their performance is not random (resulting in an error rate of 0.5 for binary classification), they will improve the final model.
  • 3. Application of Meta learning in Banking Sector International organization of Scientific Research 7 | P a g e Figure 2: Screen shot for Adaboost classifier performance 3. Logiboost LogitBoost[11] is a boosting algorithm formulated by Jerome Friedmome, Trevor Hastie, and Robert Tibshirani. LogitBoost represents an application of established logistic regression to the AdaBoost method. Rather than minimizing error with respect to y, weak learners are chosen to minimize the (weighted least-squares) error of with respect to , where , and . That is is the Newton-Raphson approximation of the minimizer of the log-likelihood error at stage , and the weak learner is chosen as the learner that best approximates by weighted least squares. Figure 3: Screen shot for Logiboost classifier performance
  • 4. Application of Meta learning in Banking Sector International organization of Scientific Research 8 | P a g e 4. Bagging Figure 4: Screen shot for Grading classifier performance Algorithm classification Correct Classification Rate Mis- Classification CART 88.0000 12.0000 ADABOOST 73.0000 27.0000 GRADING 54.3333 45.6667 BAGGING 90.3333 9.6667 Figure-4
  • 5. Application of Meta learning in Banking Sector International organization of Scientific Research 9 | P a g e Figure-5 .Graph for Correct Classification VS. Misclassification rate Figure-6 III. CONCLUSIONS This paper represents computational issues of four supervised machine learning algorithms i.e., Classification and Regression technique algorithm, Adaboost algorithm, Logitboost algorithm and Bagging algorithm with dedicating role of detection of weather forecast on the basis of classification rule. Among four algorithms, Bagging algorithm is the best in performance.. In order to compare the classification performance of four machine learning algorithms, classifiers are applied on same data and results are compared on the basis of misclassification and correct classification rate and according to experimental results in table 1, it can be concluded that Baging is the best as compared to adaboost, logitboost, Classification and Regression Technique.
  • 6. Application of Meta learning in Banking Sector International organization of Scientific Research 10 | P a g e REFERENCES [1] QUINLAN, J. R. (1988). C4.5: Programs for Machine Learning, Morgan Kaufmann Publishers, San Mateo, CA. [2] C4.5 Algorithm - Disponible inhttp://www.cse.unsw.edu.au/~quinlan . Accessed in 25 July 2010. [3] Bruno Carneiro da Rocha1,2 and Rafael Timóteo de Sousa Júnior2 “IDENTIFYING BANK FRAUDS USING CRISP-DM AND DECISION TREES” International journal of computer science & information Technology (IJCSIT) Vol.2, No.5, October 2010. [4] Mabroukeh, N.R. and C.I. Ezeife, 2010. A taxonomy of sequential pattern mining algorithms. ACM Comput. Surveys. DOI: 10.1145/1824795.1824798 [5] Ramageri, B.M. and B.L. Desai, 2013 Role of datamining in retail sector. Int. J. Comput. Sci. Eng., 5. Moradi, M., M. Salehi, M.E. Ghorgani and H.S. Yazdi, 2013. Financial distress prediction of Iranian companies by using data mining techniques. Organizacija, 46: 20-27. [6] Ramageri, B.M. and B.L. Desai, 2013 Role of data mining in retail sector. Int. J. Comput. Sci. Eng., 5. [7] Delamaire, L., A. Hussein and P. John, 2009. Credit card fraud and detection techniques: A review. Banks Bank Syst., 4: 57-68. [8] Ravisankar, P., V. Ravi, G.R. Rao and I. Bose, 2011. Detection of financial statement fraud and feature selection using data mining techniques. Decision Support Syst., 50: 491-500. DOI: 10.1016/j.dss.2010.11.006. [9] Raj, S.B.E. and A.A. Portia, 2011. Analysis on credit card fraud detection methods. Proceedings of the International Conference on Computer Communication and Electrical Technology, Mar. 18- 19, IEEE Xplore Press, Tamilnadu, pp: 152-156.DOI: 10.1109/ICCCET.2011.5762457. [10] Petry, F.E. and L. Zhao, 2009. Data mining by attribute generalization with fuzzy hierarchies in fuzzy databases. Fuzzy Sets Syst., 160: 2206-2223. DOI 10.1016/j.fss.2009.02.014. [11] Changdola, V., A. Banerjee and V. Kumar, 2009. Anomaly detection: A survey. ACM Comput. Surveys, 9: 1-72. [12] Moin, K.I. and Q.B. Ahmed, 2012. Use of data mining in banking. Int. J. Eng. Res. Applic., 2: 738-742. [13] Chen, I., L. Chi-Jie, L. Tian-Shyug and L. Chung-Ta, 2009. Behavioral scoring model for bank customers using data envelopment analysis. Stud. Comput. Intell., 214: 99-104. DOI: 10.1007/978-3-540-92814- 0_16 Yap, B.W., S.H. Ong and N.H.M. Hussain, 2011. Using [14] Yap, B.W., S.H. Ong and N.H.M. Hussain, 2011. Using data mining to improve assessment of credit worthiness via credit scoring models. Expert Syst. Appli., 38: 13274-13283. DOI: 10.1016/j.eswa.2011.04.147. [15] Bhattacharya, S., S. Jha, K. Tharakunnel and J.C. Westland, 2011. Data mining for credit card fraud: A comparative study. Decision Support Syst., 50: 602- 613. DOI: 10.1016/j.dss.2010.08.008 . [16] Ravisankar, P., V. Ravi, G.R. Rao and I. Bose, 2011. Detection of financial statement fraud and feature selection using data mining techniques. Decision Support Syst., 50: 491-500. DOI: [17] Ravisankar, P., V. Ravi, G.R. Rao and I. Bose, 2011.Detection of financial statement fraud and feature selection using data mining techniques. Decision Support Syst., 50: 491-500. DOI: 10.1016/j.dss.2010.11.006 . [18] Dorr, D.M. and M.D. Anne, 2009. Establishing relationships among patterns instock market data. Data Knowl. Eng., 68: 318-337. DOI 10.1016/j.datak.2008.10.001. [19] Tsai, H.H., 2012. Global data mining: An empirical study of current trends, future forecasts and technology diffusions. Expert Syst. Applic., 39: 8172-8181.