Human-Centered Machine Learning Implementation in Banking Case Study in BRILink BRI Branchless Banking Agent Acquisition Upgrade and Activation
Human-Centered Machine Learning Implementation in Banking Case Study in BRILink BRI Branchless Banking Agent Acquisition Upgrade and Activation
Abstract - BRILink is a branchless banking service that agents. Agent acquisition is the process of acquiring
provides quintessential banking transactions to the highly potential customers to become BRILink agents.
unbanked population in Indonesia through human agents. Agent upgrade is the action of rewarding BRILink agents
Due to the important role of agents, the quantity and quality who exceed their target by leveling up their tier. There are
of agents become critical in BRILink operations to achieve three tiers of BRILink agents: pemula agent (first tier),
the expected business performance. Thus, new agent
acquisition as well as agent upgrade and activation are
jawara agent (second tier), and juragan agent (third tier).
essential as endeavors to maintain and improve agent Agent activation is the process of increasing existing
quantity and quality. However, the existing acquisition, agent’s transactions in order to achieve their monthly
upgrade, and activation method is troublesome due its high transaction targets based on the Break Even Point (BEP)
reliance on subjective judgment and non-data-driven formula, where the net income must be at least equal to
approach. Therefore, human-centered machine learning the total expenditure. BRILink Agent Officers (PAB)
solutions are implemented to determine highly qualified and plays a crucial role in acquiring new agents and
potential customers to be acquired, and agents to be developing the existing agents (i.e., activation and
upgraded and activated. Based on evaluation, the upgrade function). Currently, PAB relies predominantly
implemented machine learning solutions significantly can
achieve thousands of agent acquisition and up to 18%
on subjective judgment, intuition, and random selection to
increases of agent upgrade and activation, then increase determine potential new agents to be acquired and
above 13% fee-based income on average. existing agents to be activated and upgraded.
Consequently, many BRILink agents do not perform as
Keywords - branchless banking, machine learning, expected and have relatively low productivity. Currently,
rule-based, software development, change management only 49% of all existing agents reached their BEP target.
To tackle the aforementioned challenges, we develop
an end-to-end Machine Learning (ML) solution to help
I. INTRODUCTION PAB in selecting highly qualified customers to be
acquired and potential existing agents to be upgraded and
BRILink is a branchless banking service, provided by activated. The acquisition function is an ML-based
Bank Rakyat Indonesia (BRI), that serves quintessential solution that utilizes existing BRI customers and BRILink
banking transactions (e.g., transfers, deposits, agents data (e.g., demography, historical transactions and
withdrawals, etc.) to more than 100 million of Indonesia's portfolio). The upgrade and activation function uses a
unbanked population. Customers are served by BRILink rule-based approach based on the business domain
agents, who carry out transactions for customers using knowledge. A web mobile application is developed to
Electronic Data Capture (EDC) or BRILink mobile. On provide a convenient user-interface for PAB to interact
average, BRILink serves more than 100 trillion with the outputs of these functions. In addition, a change
transactions with up to IDR 100 billion revenue per management initiative, in the form of workshops, is held
month. The Financial Services Authority (OJK) has also to provide understanding to PAB regarding the
initiated a branchless banking program that aims to implementation of this new approach, considering that
provide basic savings accounts for all Indonesian citizens this new approach changes their day-to-day workflow.
[1]. As a result, the status of financial inclusion in This solution is expected to improve the existing approach
Indonesia continues to increase, reaching 83.6% in 2021 by utilizing a data-driven approach while reducing the use
from 81.4% in 2020 [2]. of subjective judgment and intuition. Another goal of this
Due to the important role of agents in BRILink daily solution is to increase business profits. In the 6 months
operations and in supporting the growth of financial since the implementation of this solution, the average
inclusion in Indonesia, the quantity and quality of agents fee-based income has increased by up to 13%. The growth
become critical. Thus, new agent acquisition as well as of agent quality to achieve BEP activation to Juragan also
agent upgrade and activation are essential as endeavors to increased by up to 18%.
maintain and improve quantity and quality of BRILink
1348
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of The 2022 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
II. METHODOLOGY
Each of the CRISP-DM stages is described as
In the banking industry, machine learning (ML) follows.
techniques have been used extensively in a variety of 1. Business understanding. The main business goal is
situations, including product recommendation, credit risk to provide PABs list of potential customers who are
analysis, and customer churn prediction. The financial suitable to be acquired as new BRILink agents.
industry has been the primary impetus for the Data-driven method is proposed as an improvement
development of this ML technology, perhaps as a result of of the existing intuition-based method.
the risk exposure and the significant amount of cash 2. Data understanding. Customers’ demographic,
generated. K-nearest Neighbor (KNN), Support Vector portfolio, transactional, and geospatial data are
Machine (SVM), Tree-Based Models, Ensembles, utilized to analyze highly potential customers.
Boosting Techniques, and Artificial Neural Networks Exploratory data analysis (EDA) is performed to
(ANN) are some of the most pertinent ML techniques. discover and visualize trends and patterns.
However, the Boosting Technique was chosen as the 3. Data preparation. Data preparation is performed to
pre-eminent solution, both for this classification issue and improve the data data quality and to ensure the data is
when used with structured data [3]. ready to be used for training [6]. Some data
There are three main phases in this project. The first preparation that is performed are missing values
one is the development of an ML-based and rule-based imputation, outliers removal, and feature selection.
engine to perform agent acquisition, upgrade, and 4. Modeling. The ML model is developed by using
activation function. The second one is the development of LightGBM. LightGBM is a Gradient Boosting
a web mobile application to provide a convenient Decision Tree (GBDT) method that utilizes Gradient
user-interface for users (i.e., PABs) to interact with the Based One-Side Sampling (GOSS) and Exclusive
outputs of the aforementioned engines. The third one is a Feature Bundling (EFB) [7]. GOSS is a sampling
change management initiative, in the form of workshops, method that keeps data instances with larger
to provide understanding to users regarding the gradients and omits instances with small gradients.
implementation of this new approach. Each phase will be Meanwhile, EFB is an efficient algorithm that
discussed in the following sections. optimizes a bundling problem to a graph coloring
problem and resolves it by greedy algorithm with
A. Models Development constant approximation ratio. In many machine
learning tasks, LightGBM is an efficient GBDT
ML-based solution is used to develop the agent algorithm that delivers greater accuracy in many
acquisition function while rule-based approach is utilized applications, including peer-to-peer lending [8].
to build the agent upgrade and activation function [4]. Additionally, LightGBM significantly enhances
CRISP-DM (Cross Industry Standard Process for Data parallel learning to process large volumes of data in
Mining) methodology [5] is adopted as the process model accordance with the numerous sector data banks [9].
to develop the ML-based solution. CRISP-DM comprises 5. Evaluation. The developed ML models are evaluated
six sequential stages: business understanding, data using precision, recall, and AUC (Area Under the
understanding, data preparation, modeling, evaluation, ROC Curve) metric. Multiple models are assessed
and deployment. The schema of the CRISP-DM before the final model is selected. Recall measures
methodology is shown in Fig. 1. the fraction of all positive instances that are
successfully retrieved while precision describes the
fraction of all positive instances among the retrieved
positive instances. The formula to calculate recall and
precision is shown in (1) and (2).
𝑇𝑃
𝑅𝑒𝑐𝑎𝑙𝑙 = 𝑇𝑃 + 𝐹𝑁
(1)
𝑇𝑃
𝑃𝑟𝑒𝑐𝑖𝑠𝑖𝑜𝑛 = 𝑇𝑃 + 𝐹𝑃
(2)
1349
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of The 2022 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
To develop the agent activation and upgrade function, ensure the developed UI is in accordance with what is
rule-based solution is preferred over ML-based solution needed and required, usability testing is carried out. If the
because of the existence of a specific business domain UI is deemed to be satisfactory, the project can proceed to
knowledge and rules. Additionally, the rule-based frontend and backend development.
approach offers better explainability than the ML-based The application development needs to consider the
approach and the possibility of making ad-hoc business deployment and integration mechanism of the ML model.
rules if there is a change in the business process [10]. Additionally, the application also provides a feature where
The upgrade rule is simply governed by the users can find the recommended agents/customers easily
transaction frequency and volume. If the agent’s by using turn-by-turn directions. To enable this feature,
transaction frequency and volume surpass the defined we use the O-D (origins to destinations) travel time
threshold, then he/she can be promoted to the next class. matrix estimation using Google Maps API [13].
There are three tiers of BRILink agents: pemula agent
(first tier), jawara agent (second tier), and juragan agent
(third tier).
The Break Even Point (BEP) for any financial
product or financial institution is a condition where the
net income must be at least equal to the total expenditure
[11]. The development of BEP activation
recommendations for existing agents is made based on the
rules that have been made by the business by adopting
BEP calculation analysis.
Extreme Programming (XP) [12] is used as the Fig. 3. Travel time estimation with the Google Maps API
primary software development methodology to develop
the web application development in this project. This Quality assurance (QA) process is embedded in the
methodology is slightly modified to adapt with BRI’s process to ensure the application is working as expected
development environment. The XP methodology being and no defects are found that could deter the application
used is visualized in Fig. 2. to work as per the requirement.
C. Change Management
1350
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of The 2022 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
regarding the implementation of the new workflow selected as the final model to achieve both robustness and
and the technical know-how to use the new system. efficiency that is further deployed to the application.
4. Stakeholder survey. After the implementation of the
new system, checkpoints are carried out periodically B. Web Application Result
to get feedback from the users. This feedback is then
used as the basis for improving the application. There are several features that have been
Step three and four are carried out by holding implemented in the web application:
presentations/workshops explaining the use of the newly 1. Evaluation feature. The evaluation feature shows
developed application, including introduction of all the target and progress of achieving acquisition,
features embedded in the system. activation, and upgrade function. Thus, PABs can
monitor their own performance conveniently. A
snapshot of the evaluation feature interface is shown
III. RESULTS in Fig. 4.
2. Recommendation feature. The recommendation
A. Models Result feature displays the result of the developed ML
model and rule-based model, which generate highly
Some ML algorithms have been tried, including qualified customers to be acquired and potential
Logistic Regression, Random Forest, Decision Tree, and existing agents to be upgraded and activated. Map
LightGBM. Based on equation (2), precision calculates representation is used to represent the list of
the number of correct positive predictions made. recommendations, so that PABs are aware about their
Furthermore, AUC is an aggregate measure of location relative to the recommendees’ positions.
performance through all possible classification thresholds. Color coding is used to differentiate between the
Below are Logistic Regression, Random Forest, Decision acquisition, activation, and upgrade recommendation
Tree, and LightGBM models that are evaluated through (blue for acquisition, green for upgrade, and yellow
accuracy, precision, recall, and AUC. for activation). A snapshot of the recommendation
feature interface is shown in Fig. 5.
TABLE I 3. Detailed recommendation feature. We can select
PERFORMANCE METRICS OF COMPARATIVE MODELS one of the recommendations and view the detailed
information (e.g. recommendation score, the reason
Model Type Accuracy Precision Recall AUC why the recommendees are selected, and basic
information) of the recommendees. A snapshot of the
Logistic detailed recommendation feature interface is shown
0.636 0.506 0.048 0.545
Regression
in Fig. 6.
Random Forest 0.655 0.555 0.269 0.659 4. Routing feature. The routing feature is provided to
make it easier for PABs to find the most efficient
Decision Tree 0.578 0.423 0.435 0.548 route to reach their target. A snapshot of the routing
feature interface is shown in Fig. 7.
LightGBM 0.625 0.489 0.704 0.694
1351
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of The 2022 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
1352
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of The 2022 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
1353
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.
Proceedings of The 2022 IEEE International Conference on Industrial Engineering and Engineering Management (IEEM)
1354
Authorized licensed use limited to: UNIVERSITAS GADJAH MADA. Downloaded on October 02,2023 at 04:56:27 UTC from IEEE Xplore. Restrictions apply.