BTAIML10 Major Project Report
BTAIML10 Major Project Report
On
CREDIT CARD APPROVAL PREDICTION
In partial fulfilment of the requirements for the award of the degree of
Bachelor of Technology
In
Artificial Intelligence and Machine Learning
Submitted By
Himanshu Kumar (Roll No. 2101921640017)
Priyanshu Kumar (Roll No. 2101921640036)
Anubhav Singh (Roll No. 2101921640003)
Shreyansh Jaiswal (Roll No. 2101921640053)
Under the Supervision of
Ms. Shabanam Kumari
We hereby declare that the project work presented in this report, entitled
"CREDIT CARD APPROVAL PREDICTION", submitted in partial
fulfillment of the requirements for the award of the degree of Bachelor of
Technology in Artificial Intelligence & Machine Learning, affiliated to Dr.
A.P.J. Abdul Kalam Technical University, Lucknow, is based on our original
work carried out in the Department of Computer Science & Engineering
(AIML) at G.L. Bajaj Institute of Technology & Management, Greater Noida.
We further certify that the work contained in this report is original and has not
been submitted by us for the award of any other degree or diploma.
Date:
Place: Greater Noida
Signature: Signature:
Name: Himanshu Kumar Name: Priyanshu Kumar
Roll No: 2101921640017 Roll No: 2101921640036
Signature: Signature:
Name: Anubhav Singh Name: Shreyansh Jaiswal
Roll No: 2101921640004 Roll No: 2101921640053
ii
Certificate
This is to certify that the project report, entitled "CREDIT CARD APPROVAL
PREDICTION", completed by Himanshu Kumar (2101921640017), Priyanshu
Kumar (2101921640036), Anubhav Singh (2101921640004), and Shreyansh
Jaiswal (2101921640053) of the Artificial Intelligence & Machine Learning
branch, is an original work carried out by them in the Department of Computer
Science & Engineering (AIML) at G.L. Bajaj Institute of Technology &
Management, Greater Noida, under my guidance.
To the best of my knowledge and belief, the content embodied in this project work
has not been submitted earlier for the award of any degree or diploma.
Date:
iii
Acknowledgement
We extend our sincere thanks to our project guide, Ms. Shabanam Kumari, for
her invaluable guidance and support throughout this project. Her extensive
knowledge and insight were instrumental in overcoming the challenges we faced
during the course of this work.
We are profoundly grateful to our Head of Department, Dr. Naresh Kumar, for
his unwavering support and encouragement throughout this project.
We also take this opportunity to thank everyone who has directly or indirectly
contributed to the completion of this project. Their help and support have been a
source of great strength.
We want to thanks our friends who have always encouraged us during this project.
Lastly, we thank all the faculty members of the CSE (AIML) department for their
valuable suggestions and guidance during the course of this project.
iv
Abstract
Upon completion, the system will provide users with the ability to predict whether
their application will be approved or rejected, based on their data inputs, assisting
both applicants and financial institutions in decision-making processes. Further
work includes model optimization, addressing potential biases, and integrating
real-time data to improve prediction accuracy.
v
TABLE OF CONTENT
References
vi
LIST OF FIGURES
vii
Chapter 1
INTRODUCTION
1.1. Overview
Credit card approval prediction is an essential area of research in the finance
sector, particularly in streamlining the credit application process, as illustrated in
Fig. 1.1. Financial institutions are increasingly relying on data-driven models to
determine creditworthiness, and this project aims to build such a model using
machine learning. The focus is on predicting whether a credit card application will
be approved or rejected based on a set of applicant features such as income,
employment status, and credit history.
1.3. Objectives
The main objectives of the project are:
◼ Develop a Predictive Model: The primary objective of this project is to
develop a machine learning model that accurately predicts the approval or
rejection of credit card applications based on historical data.
◼ Preprocess the Dataset: The raw dataset needs to be cleaned and
preprocessed. This involves dealing with missing values, encoding
categorical variables, and scaling numerical features.
2
◼ Model Selection and Evaluation: We aim to test and compare various
machine learning algorithms to identify the one that offers the best
predictive accuracy. Possible models include Logistic Regression, Random
Forest, and Support Vector Machines (SVM).
◼ Optimize the Model: Once the best model is selected, the next step is to
tune the hyperparameters for optimal performance. This can be done using
grid search or random search methods.
◼ Deploy the Model: The final model will be integrated into a web
application or API to allow users to input their data and receive predictions
on whether their credit card application will be approved.
3
Chapter 2
LITERATURE REVIEW
The automation of credit card approval processes through machine learning has
gained significant attention in recent years. This review examines various
approaches and methodologies implemented in this domain.
4
2.3. Comparative Analysis of Classification Algorithms
Several studies have focused on comparing different classification algorithms for
credit card approval prediction. Dalsania et al. [5] conducted a comprehensive
comparison of various classification algorithms, providing insights into their
relative performance and applicability. Building on this, Kibria and Sevkli [6]
extended the comparison to include deep learning techniques, offering valuable
insights into the advantages and limitations of different approaches.
5
Chapter 3
METHODOLOGY / APPROACH
6
iii. Feature Scaling: Some machine learning models, especially those that
rely on distance-based algorithms (e.g., SVM, KNN), benefit from feature
scaling. Standardization or normalization is applied to scale features so
that they have the same range.
iv. Outlier Detection: Outliers are extreme values that may distort the
training process. These are detected and handled appropriately by either
removing or transforming them to avoid skewing the model’s predictions.
v. Feature Engineering: Creating new features based on existing data can
improve model performance. For instance, derived features like "credit
utilization ratio" (credit limit vs. outstanding balance) or "account age"
(calculated from account opening date) may provide additional insights
into an applicant's financial behavior.
7
• Logistic Regression: A simple, yet powerful algorithm for binary
classification. It estimates the probabilities of approval or rejection based
on the input features, as illustrated in Fig. 3.2.
9
• Gradient Boosting Classifier: Gradient Boosting is another ensemble
technique where models are trained sequentially. Each new model corrects
the errors of the previous ones by focusing on the residuals (errors) of the
previous model's predictions. This method is powerful and often yields
high-performance results for classification tasks, as illustrated in Fig. 3.6.
10
Hyperparameter tuning is then performed using techniques like grid search or
random search to optimize the model’s performance.
11
Chapter 4
RESULT & EXPERIMENT
Fig. 4.1. Machine Learning Pipeline for Credit Card Approval Prediction
12
4.1.2. Data Merging
• The account_age_(months) feature was derived by calculating the
minimum balance month for each applicant in the
credit_record.csv dataset and merging it into the main
application_record.csv.
The dataset has been successfully acquired and preprocessed. Missing values have
been handled, categorical variables have been encoded, and feature scaling has
been applied.
Several new features, including "account age" and "credit utilization ratio," have
been successfully created to enhance the predictive power of the models.
13
4.3. Model Preparation
4.3.1. Feature Selection
• Unnecessary features, such as ID and has_mobile, were dropped
using the DropFeatures class, ensuring that only relevant features
were retained for model training.
14
• AdaBoost Classifier: Strong ensemble learning capabilities, good
balance of bias and variance.
15
Chapter 5
CONCLUSION & FUTURE SCOPE
The project presents several promising avenues for future development and
enhancement. The primary focus would be on developing an interactive web
application that enables users to input their personal and financial information and
receive instant predictions about their credit card approval likelihood. This user-
friendly interface would incorporate real-time data validation and provide clear,
16
interpretable results along with basic recommendations based on the prediction
outcome.
17
References
[1] M. P. C. Peiris, “Credit Card Approval Prediction by Using Machine
Learning Techniques,” University of Colombo, 2022. Accessed: Mar. 07,
2023. [Online]. Available: https://dl.ucsc.cmb.ac.lk/jspui/handle/12345
6789/4593
[2] R. Marginingsih and I. Sari, “Nilai Transaksi Non Tunai Terhadap
Pertumbuhan Ekonomi Di Indonesia Periode 2014-2018,” Inovator, vol. 8,
no. 2, p. 24, Sep. 2019, doi: 10.32832/INOVATOR.V8I1.2504.
[3] A. Khan and S. K. Ghosh, “Machine assistance for credit approval?
Random wheel can recommend and explain,” Expert Syst Appl, vol. 215, p.
119231, Apr. 2023, doi: 10.1016/J.ESWA.2022.119231.
[4] Z. Zarnaz, D. Biswas, and K. M. A. Hasan, “Credit Card Approval
Prediction by Nonnegative Tensor Factorization,” International Conference
on Robotics, Electrical and Signal Processing Techniques, pp. 319–323,
2021, doi: 10.1109/ICREST51555.2021.9331172.
[5] N. Dalsania, D. Punatar, and D. Kothari, “Credit Card Approval Prediction
using Classification Algorithms,” vol. 10, 2022, doi:
10.22214/ijraset.2022.47369.
[6] Md. G. Kibria and M. Sevkli, “Application of Deep Learning for Credit
Card Approval: A Comparison with Two Machine Learning Techniques,”
Int J Mach Learn Comput, vol. 11, no. 4, pp. 286–290, Aug. 2021, doi:
10.18178/IJMLC.2021.11.4.1049.
[7] L. Duan, “Performance evaluation and practical use of supervised data
mining algorithms for credit card approval,” Proceedings - 2020
International Conference on Computing and Data Science, CDS 2020, pp.
251–254, Aug. 2020, doi: 10.1109/CDS49703.2020.00057.
[8] S. Hemkiran, G. Sudha Sadasivam, A. Prasanna Rahavendra, and A. K.
Anjhanna, “Design of Automatic Credit Card Approval System Using
Machine Learning,” Lecture Notes in Electrical Engineering, vol. 806, pp.
1–9, 2022, doi: 10.1007/978-981-16- 6448-9_1.
18