Fraud Detection System Micro-Project
Fraud Detection System Micro-Project
Now a day the usage of credit cards has dramatically increased. As credit
card becomes the most popular mode of payment for both online as well
as regular purchase, cases of fraud associated with it are also rising. In
this paper, we model the sequence of operations in credit card transaction
processing using a Decision tree and Deep Neural Network show how it
can be used for the detection of frauds. An both algorithms is initially
trained with the normal behaviour of a cardholder. If an incoming credit
card transaction is not accepted by the trained with sufficiently high
probability, it is considered to be fraudulent. At the same time, we try to
ensure that genuine transactions. We present
Motivation
The prediction Model will describe you whether to invest in the proposal or
not. Here, we choose to minimize the risk for investing, i.e. we aim to
minimize investing in proposals for which the loan will not be paid back.
Issues
Motivation
Issues
Abstract
In our project, mainly focussed on credit card fraud detection for in
real world. Initially I will collect the credit card datasets for trained
dataset. Then will provide the user credit card queries for testing
data set. After classification process of random forest algorithm
using to the already analysing data set and user provide current
dataset. Finally optimizing the accuracy of the result data. Then will
apply the processing of some of the attributes provided can find
affected fraud detection in viewing the graphical model
visualization. The performance of the techniques is evaluated based
on accuracy, sensitivity, and specificity, precision. The results
indicate about the optimal accuracy for Decision tree are 98.6%
respectively.
Existing System
Disadvantage
Proposed System
Advantage
Hardware
RAM-4GB
Software
Python
Anaconda
PROBLEM STATEMENT
METHODOLOGY
To helps us to get awareness about the fraudulent and without loss of any
financially.
MODULES
1. DATA COLLECTION
2. DATA PRE-PROCESSING
3. FEATURE EXTRATION
4. EVALUATION MODEL
DATA COLLECTION
Data used in this paper is a set of product reviews collected from credit
card transactions records. This step is concerned with selecting the subset
of all available data that you will be working with. ML problems start with
data preferably, lots of data (examples or observations) for which you
already know the target answer. Data for which you already know the
target answer is called labelled data.
DATA PRE-PROCESSING
Organize your selected data by formatting, cleaning and sampling from it.
Formatting: The data you have selected may not be in a format that
is suitable for you to work with. The data may be in a relational
database and you would like it in a flat file, or the data may be in a
proprietary file format and you would like it in a relational database
or a text file.
FEATURE EXTRATION
EVALUATION MODEL
UML DIAGRAMS
SEQUENCE DIAGRAM
ACTIVITY DIAGRAM
COLLABRATION DIAGRAM
REQUIREMENTS ANAYLSIS
SOFTWARE REQUIREMENTS
• Python
• Anaconda Navigator
o Numpy
o Pandas
o Matplotlib
o Sklearn
o Seaborm
ANACONDA NAVIGATOR
Jupyter Notebook
QT Console
Spyder
VS Code
Glue viz
Orange 3 App
Rodeo
RStudio
The simplest way is with Spyder. From the Navigator Home tab,
click Spyder, and write and execute your code.
You can also use Jupyter Notebooks the same way. Jupyter
Notebooks are an increasingly popular system that combine your
code, descriptive text, output, images and interactive interfaces
into a single notebook file that is edited, viewed and used in a web
browser.
PYTHON
Python
Features of Python
You can freely use and distribute Python, even for commercial use.
Not only can you use and distribute software’s written in it, you can
even make changes to the Python’s source code. Python has a large
community constantly improving it in each iteration.
Portability
Object-oriented
NUMPY
TESTING
TESTING METHODS
• Functional Testing
Integration Testing
The proposed paper evaluate that the Decision tree and support vector
machine algorithm will perform better with a larger number of training
data comparing to Adaboost classifier, but speed during testing and
application will suffer. Application of more pre- processing techniques
would also help. The SVM algorithm still suffers from the imbalanced
dataset problem and requires more pre-processing to give better results at
the results shown by SVM is great but it could have been better if more
pre-processing have been done on the data.so, in proposed work we
balanced the imbalanced data with up- sampling technique during pre-
processing. We review the existing works on credit card fraud prediction in
three different perspectives: datasets, methods, and metrics. Firstly, we
present the details about the availability of public datasets and what kinds
of details are available in each dataset for predicting credit card fraud.
Secondly, we compare and contrast the various predictive modeling
methods that have been used in the literature for predicting, and then
quantitatively compare their performances in terms of accuracy.
REFERENCES
[5] Z.-H. Zhou and X.-Y. Liu, “Training cost-sensitive neural networks with
Knowledge and Data Engineering, vol. 18, no. 1, pp. 63-77, 2006.