0% found this document useful (0 votes)

29 views

Predicting BPLMatch Winners An Empirical Study Using Machine Learning Approach

sports analytics

Uploaded by

Aditya Agrawal

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Predicting BPLMatch Winners An Empirical Study Using Machine Learning Approach

sports analytics

Uploaded by

Aditya Agrawal

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 9

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/375884996

Predicting BPL Match Winners: An Empirical Study Using Machine Learning

Approach

Conference Paper · July 2023

DOI: 10.1109/ICCCNT56998.2023.10306698

CITATIONS READS

0 35

2 authors:

Bornita Adhikari Md. Sazzadur Ahamed

Daffodil International University Daffodil International University
1 PUBLICATION 0 CITATIONS 24 PUBLICATIONS 16 CITATIONS

SEE PROFILE SEE PROFILE

All content following this page was uploaded by Bornita Adhikari on 07 December 2023.

The user has requested enhancement of the downloaded file.

Predicting BPL Match Winners: An Empirical Study
Using Machine Learning Approach
Bornita Adhikari Md. Sazzadur Ahamed
Department. of Computer Science and Department. of Computer Science and
Engineering Engineering
Daffodil International University Daffodil International University
Dhaka,Bangladesh Dhaka,Bangladesh
[email protected] [email protected]

Abstract— With the evolution of computer science, every Every team want to give their best and for this purpose ML
company is implementing the newest technologies to survive in prediction can play significant role by handling any
market with better decision-making capabilities, better uncertainty and can make an impact in predicting the winner
communication and customer satisfaction. The only means of of matches using present data in several ways. ML models
fulfilling all these criteria’s is to perform data analysis that is can analyze the past data and predict how weather condition
more accurate and pure. In cricket, where no one can guess and a particular pitch or ground may affect the match’s
which team will win until the last ball of the last over, machine outcome. By analyzing the factors like team performance,
learning can help by predicting the results of the games. Match team ranking, head-to-head records and the recent form of
outcome prediction models have a lot of financial incentive
players, probability of winning the game at particular venue,
because cricket is a multi-billion-dollar industry. The goal of
score impact on field/bat first after winning the toss machine
this study is to identify the most accurate machine learning
model that can accurately predict the winner given the data learning models can predict which team is likely to win.
from the Bangladesh Premier League. For this analysis five ML In our analysis, the performance of each model and future
models XGBoost, Gradient Boosting, KNN, Decision Tree, directions are discussed with the goal of predicting the
Random Forest has been tested for the purpose of model outcome of BPL matches. With the growth of T20 leagues
building despite that our proposed model is XGBoost. To get and technological advancements the ask for cricket winner
access to BPL dataset web scrapping has been done, the dataset
prediction models is anticipated to expand in the upcoming
contains 15 columns and 3239 values and 8 team was available
years, as more fans and teams recognize the value of data-
in each season from 2018 to 2023. We use cutting-edge machine
learning techniques based on the use of numerous models,
driven insights and the potential competitive advantages they
feature selection, and data separation techniques. Finally, by can provide. Our experimental conclusion can help to
structuring every line of action, the forecast accuracy is optimize team strategies and increase their chances of
attained. winning. The prediction was convey using five machine
learning classifiers XGBoost, Gradient Boosting(GB), k-
nearest neighbors(KNN), Decision Tree(DT), Random
Forest(RF) where all the models in our research has shown
Keywords— BPL, Cricket, Prediction, XGBoost, outstanding accuracy. According to investigational result,
Visualization, Classification XGBoost indicates better prediction of 93%.We described the
benefits of cricket outcome prediction modeling with brief
introduction of this game. In section 2, some of the related
I. INTRODUCTION works about cricket outcome prediction is shown. The section
3, represent the workflow of this analysis, Section 4 comes
Cricket is a well-liked sport that is played and enjoyed by with the result formulation and conclusion section 5 of this
millions of people around the globe. It is especially popular in work provides a description.
countries such as India, Pakistan, Australia, England, South
Africa, Sri Lanka, Bangladesh, and the West Indies, where it
is considered a national pastime. The reason for the popularity
of cricket is it offers excitement and drama. The sport is II. LITERATURE REVIEW
known for its high-scoring matches, close finishes, and the
individual brilliance of its players. The fast-paced and Cricket has gained a lot of attention as it has progressed
dynamic nature of Twenty20 cricket has made it particularly among sports commentators. Cricket has been the subject of
popular in recent years, as it offers a shorter and more action- an increasing amount of research, but because this dataset is
packed version of the game. Nowadays Cricket is being brand-new and private, it has not yet been utilized in any
benefited from the growth of technology and the media, which research articles. Vistro et al. [1] conducted a study which
has made it more accessible to fans. Live streaming of
aimed to foresee the triumphant team in cricket matches
matches, social media, and mobile apps have made it easier
utilizing machine learning and data analytics methods. The
for fans to stay up-to-date with the latest news and scores.
study incorporated a range of features related to team and
A professional Twenty20 cricket league is called the player performance, venue, and other match-specific
Bangladesh Premier League (BPL) that was launched in 2012 variables to train their models. The study's findings showed
and it operates on franchise-based business model. BPL has that the suggested method able to foresee the winner of
become a t-twenty blast investing big amount of money. There cricket matches with an accuracy of more than 70%. The
are five individual winning team in BPL history, with the study accentuated the potential of machine learning and data
Dhaka Dynamites winning the most titles (3). Teams select analytics in predicting the winner of cricket matches. In the
their players based on draft system that says players should be study of Awan et al. [2] The team scores were predicted using
selected based on their performances in the previous season.

XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE

a machine learning linear regression model, and the accuracy Boosting, outperformed individual models, yielding match
was assessed using the big data framework Spark ML., outcome predictions that were over 60% accurate. In
RMSE, MSE, MAE on ODI dataset and got the outstanding Wickramasinghe's study [13], machine learning techniques
accuracy 95%. To predict the results of IPL matches, Pallavi were used to categorize all-rounders in One Day International
Tekade er al. [3] used several ML models, including Decision (ODI) cricket. The authors chose the fielding, bowling, and
Tree Regression, Random Forest Regression, Naive Bayes, batting statistics-related variables to train the algorithms. The
and Logistic Regression. They briefly described the key suggested method outperformed conventional statistical
factors, such as pitch conditions, temperature, humidity, and methods in categorizing all-rounders with high accuracy. The
precipitation, that directly affect the match scores. To study helps estimate cricket player performance using
illustrate their methods, they used a number of diagrams, and machine learning techniques. Overall, the study shows how
they recorded a 90% accuracy rate. With 70% accuracy, machine learning may be used to categorize and forecast
Vignesh Vapors Sankaranarayanan et al. [4] created a model cricket player performance. To find the ideal cricket match
to forecast the results of 125 ODI cricket matches. They also winning strategy, Srivastava et al. [17] suggested a hybrid
discuss how well the model performed. The difference machine learning-clustering-association rule architecture. In
between projected and actual total home runs, the overall order to train their models, the authors took advantage of a
score error distribution, and an examination of betting market variety of team composition, batting and bowling
sensitivity are all being shown using various data mining performance, and pitch characteristics factors. The best
approaches. Mittal and others [5] Kumar conducted a players and tactics for winning a match could be found using
thorough analysis and comparison of some of the most widely the suggested framework. The study adds to the body of
used machine learning algorithms, Naive Bayes, Logistic knowledge regarding the application of machine learning to
Regression Neural Networks, and Random Forest, among cricket strategy. In order to forecast player performance and
others, which virtually exactly predict the outcome of a team recommendations in the sport of cricket, Biswas et al.
match. Based on player performance and previous form, as [16] conducted a poll. The authors analyzed a number of
well as how many runs each batsman is expected to score and research that employed machine learning strategies to
how many wickets each bowler is expected to take, Kalpdrum forecast cricket player performance, team makeup, and game
Passi and Niravkumar Pandey [7] attempt to forecast the outcomes. The poll brought attention to how machine
outcome of the game. This document provides classification learning could enhance cricket team selection, training, and
reports of algorithms and a detailed computation of batting strategic decision-making.
and bowling attributes. In their investigation, the Random Using machine learning techniques, Ahmed et al.'s [14]
Forest classifier produced the highest accurate results. analysis of the Pakistan cricket team's performance. To
Kamble, R. R. [8] developed a system combining 2 models forecast game results and pinpoint variables that affect a
which can predict the score a team and prediction of the win team's success or failure, the authors examined several
percentage of both teams. This is an effective prediction performance indicators and player information. According to
system for cricket developed with 5 years of ODI data based the study, the performance of the Pakistani team can be
on Linear Regression, Naïve Bayes algorithm in their influenced by a number of variables, including player form,
analysis, Passi, Kalpdrum, and Niravkumar Pandey [9] used pitch conditions, and team composition. The study's
naive bayes, random forest, multiclass SVM, and decision conclusions show the potential of machine learning for
tree to create prediction models that could forecast how many cricket analytics and have practical ramifications for raising
runs a player would score and how many wickets a bowler the performance of the Pakistan side. This study adds to the
would take by analyzing attributes from ODI cricket matches. body of knowledge on cricket performance analysis and
The application of machine learning to forecast the results of strategic decision-making using machine learning. n their
English county twenty over cricket matches is explored in the 2017 study, Mustafa et al. sought to determine whether
study by Kampakis and Thomas (2015).The goal of this work machine learning techniques could accurately forecast cricket
by Lamsal and Choudhary [11] was to use machine learning match results based on user sentiment on social media. To
to predict the results of cricket matches in the Indian Premier find the most effective method for prediction, the authors
League.They put out a revolutionary strategy that blended compared various algorithms, including Random Forests
various machine learning algorithms and included both (RF), K-Nearest Neighbors (KNN), and Support Vector
simple and intricate features concerning team and player Machines (SVM). With a 67% accuracy rate, the data
performance, weather patterns, and other match-specific demonstrated that the RF algorithm performed better than the
aspects. The suggested model outperformed the benchmark others. The study emphasizes the potential of using online
model and had a prediction accuracy for matches of 64.1%. social network crowdsourcing to forecast sporting events.
The authors came to the conclusion that their strategy was an The investigation made by Mustafa et al. [6] (2017) aimed to
effective way to forecast cricket match results in the IPL and investigate the effectiveness of machine learning methods in
that it could also be used for other sports. Overall, this study predicting cricket match outcomes employing social network
adds to the body of knowledge on picking winners and losers user evaluations. The authors compared different algorithms
in cricket matches. such as Random Forests (RF), K-Nearest Neighbors (KNN),
Using machine learning methods, Srikantaiah, K. C. et al. and Support Vector Machines (SVM) to determine the best
[12] sought to forecast the results of Indian Premier League approach for prediction. The findings revealed that the RF
(IPL) matches. Authors use a variety of features to train their algorithm outperformed the others with an accuracy rate of
models, including batting and bowling statistics, location, 67%. The study highlights the potential of using crowd
and team performance. The findings demonstrated that opinions on social networks for predicting sports outcomes.
ensemble approaches, such as Random Forest and Gradient
III. METHODOLOGY A. Data Description
To conduct the analysis, we sourced data from
cricbuzz.com that pertains to matches played in the BPL
Twenty20 cricket league from 2018 to 2023. Each season
comprises 46 matches and the data includes various attributes
such as the city, date, teams, toss winner, toss decision, winner
team, win-by-runs, result, win-by-wickets, and venue. To
ensure high accuracy and better outcomes when applying
prediction models, a significant amount of data was extracted,
resulting in a dataset that contains 15 columns and 3239
entries.
B. Data Pre-Processing
Preparing data for analysis involves several crucial steps,
collectively known as data pre-processing. These include
cleaning, transforming, and organizing raw data to make it
suitable for analysis. This step is crucial because the accuracy
and effectiveness of any analysis or modeling depend heavily
on the quality of the data. The various steps involved in pre-
processing are discussed in detail below.
 Removing Unwanted Columns
We eliminated null values and unnecessary columns like
ID, Season, and Man of the Match from this analysis in order
to focus on the columns that our forecast is founded on and
rely upon.
 Tackle Data Volume
To tackle the low volume of data causing lower
performance of algorithms, the data had to be volume up by
duplicating data rows.
 Label Encoding
Label encoding is the process of giving several levels or
categories of a variable or attribute a numerical value or code.
Fig. 1. Workflow of Methodology The pre-processing step of categorical data representation is
crucial for statistical modeling and data analysis. Data for ML
The workflow of this research are briefly elaborated in models must only be in numerical format. Therefore, label
Figure 1. The prime decision of this analysis was select encoding has been used to transform every entity into a
Bangladesh Premier League as our study work. First and numerical representation. Before label encoding, Table 1
foremost, data extraction was done. Before classification a displays all the properties and values of our dataset. The idea
small group of data preprocessing was needed. After testing of label encoding is displayed in Table 2.
several models, the proposed model was found, that got
highest accuracy.

TABLE I. BEFORE LABEL ENCODING

City date team1 Team2 toss- Toss- result winner Win-by- Win- value
runs by-
winner decision wickets
Chattogram 20/12/2019 Chattogram Comilla Comilla Field Normal 0 Zahur Ahmed
Challengers Warriors Warriors Chattogram Chowdhury
16
Challengers Stadium

Dhaka 11/1/2020 Dhaka Khulna Khulna Field Normal Khulna 0 8 Sher-e-Bangla

Platoon Tighers Tighers Tighers National Cricket
Stadium

Dhaka 11/1/2020 Dhaka Khulna Khulna Field Normal Dhaka 0 4 Sher-e-Bangla

Platoon Tighers Tighers Platoon National Cricket
Stadium

Dhaka 4/2/2018 Chattogram Dhaka Chattogram Bat Normal Dhaka 0 7 Sher-e-Bangla

Challengers Platoon Challengers Platoon National Cricket
Stadium

Dhaka 4/2/2023 Chattogram Comilla Chattogram Bat Normal Comilla 0 6 Sher-e-Bangla

Challengers Warriors Challengers Warriors National Cricket
Stadium
TABLE II. AFTER LABEL ENCODING

City date team1 Team2 toss- Toss- result winner Win- Win- value
by-runs by-
winner decision wickets
0 41 0 2 1 1 1 0 16 0 4

2 8 2 5 4 1 1 6 0 8 1

2 79 2 5 4 1 1 2 0 4 2

2 75 0 3 0 0 1 2 0 7 2

2 76 0 2 0 0 1 1 0 6 2

C. Visualize dataset
 scatter plot matrix  visualize the correlation matrix Heatmap
In machine learning research, data visualization plays a To view the correlation matrix Heatmap, one needs to
crucial role in comprehending the connections between visualize a table that displays the correlation coefficients
variables, recognizing patterns, and gaining insights into between pairs of variables in a dataset. These coefficients are
model performance. One such visual representation is the statistical measures that indicate the degree of correlation
scatter plot matrix of features in xtrain, which displays the between two elements. Correlation coefficient values vary
correlations between pairs of features (or variables) in the from -1 to 1, where -1 denotes a fully negative correlation, 1
training dataset. To predict the winner of BPL matches, this denotes a fully positive correlation, and 0 denotes no
research has selected five columns - city, date, team1, team2, association. The plot that results from this visualization is a
and toss winner - to plot on the two axes, X and Y in a grid of matrix that is color-coded, with red hues representing positive
scatter plots. Each plot within the grid demonstrates the relationships and blue tones representing negative
relationship between two features, while the diagonal of the correlations. This display is shown in Figure3.
grid showcases the distribution of each feature. The
correlations observed in the dataset is shown in Figure 2

Fig. 2. Scatter plot matrix Fig. 3. Correlation matrix Heatmap

D. Classification models training probability or class label. No information has been omitted in
To prepare the models and preprocess the data, Google the paraphrased text.
Colaboratory's cloud GPU, along with essential libraries such E. Making predictions
as pandas, numpy, matplotlib, and seaborn, have been
employed. The data set includes a substantial number of After testing several model / machine learning algorithms,
entities, and the relevant variables that are likely to aid in it is seen that the models are getting trained conveniently. But
predicting the target variable (winner) have been selected the testing capability is not performed yet. To find out how the
based on the data description. Following the selection of algorithms are performing on each test portions of the dataset,
algorithms, the input data with corresponding target values we had to evaluate the model's performance through a step of
was fed into the model training. To increase the volume of prediction performed from the trained models separately and
data, the data was duplicated by a factor of two, and during the testing results indicate XGBoost model to be the highest
model training, the data was randomly divided into testing and performing one.
training sections. The training component used 80% of the
data, and the testing portion used the remaining 20%. Brief
descriptions of the tested models are provided below.
 KNN IV. RESULT AND DISCUSSION
The K-Nearest Neighbors (KNN) is a straightforward The accuracy table is helpful for assessing the
machine learning technique that is utilized for both regression effectiveness of models and pinpointing potential
and classification assignments. The algorithm functions by improvement areas. By emphasizing particular sorts of model
finding the K closest data points to a provided input data point flaws, the presentation of the accuracy table can help reveal
within the training set and then utilizing their values or labels areas that need work. The proportion of correctly classified
to predict an outcome. observations in the dataset after testing several models is
shown in Table 3 and Figure 4 is the proper visualization of
 Decision Tree the result.
The Decision Tree algorithm functions by dividing data TABLE III. MODEL ACCURACY
into smaller, similar subsets based on its defining features. To
make predictions, the input data is moved through the tree Models Accuracy
from the top node to the leaf node. The algorithm determines
XGBoost 93%
which branch to follow at each internal node by evaluating the
input feature's value. This process continues until the leaf node Gradient Boosting 92%
is reached, which provides a forecast for the input data. No
information has been left out in the paraphrased text. Random Forest 86%

 Random Forest Decision Tree 86%

One sort of ensemble learning that uses the Random Forest KNN 79%
method utilizes numerous decision trees to predict outcomes.
In order to make a prediction using this algorithm, the input
data is processed through each decision tree from the root
node to a leaf node. Once the data has reached the leaf node
of each tree, the algorithm produces a prediction based on
either the majority class or the average prediction value of all
the decision trees within the Random Forest.
 Gradient Boosting
Gradient Boosting is a technique that involves combining
several weak learners to form a powerful one. To make
predictions, the input data is first fed into the weak learner,
and the errors in the initial prediction are determined. To
improve the accuracy of the prediction, a new weak learner is
then trained to correct the previous learner. The final
prediction is obtained by adding up the predictions from all
the weak learners.
 XGBoost
XGBoost is a well-known machine learning model that is
widely used for regression, classification, and ranking tasks.
It is an ensemble model that consists of multiple decision trees
and leverages the errors of previous trees to enhance its
predictions. During the prediction process, the input data is
processed through various decision trees, and the scores
generated by each tree are merged to produce a final
prediction. Finally, a non-linear function like the sigmoid or
softmax function is applied to convert the output into a Fig. 4. Accuracy compares
the training data. Predictions were then made on the test data
using the predict method. The evaluation of our model's
performance was done using accuracy as the primary metric.
In machine learning, performance metrics are
used to assess the effectiveness of different models. Accuracy
is a useful metric that measures the proportion of correctly
predicted instances in the dataset. Our XGBoost Classifier
achieved an accuracy of 94.25%, indicating a satisfactory
performance of our model.

C. Classification report of XGBoost

Table 4 presents a report
that summarizes the performance of various classes or
categories in the XGBoost dataset. The report includes key
metrics such as Precision, which represents the proportion of
predicted positives that are true positives, and Recall, which
refers to the proportion of actual positives that are correctly
identified. The F1 score is a combined measure of precision
and recall, while Support indicates the number of observations
in each class.

TABLE IV. AFTER LABEL ENCODING

Class Precision Recall F1-Score Support

Fig. 5. Confusion Matrix

Fortune 0.90 1.00 0.95 18

Barishal

A. Confusion Matrix:
The counts of true positives, true negatives, false positives, Khulna 0.91 1.00 0.95 20
and false negatives for each class in the dataset are shown in Tighers
the confusion matrix. In case the model is producing a lot of
false positives, we can attempt to enhance its performance by
modifying the decision threshold or experimenting with Comilla 1.00 1.00 1.00 7
different features. Figure 5 illustrates the confusion matrix of Warriors
the models that were tested.

Chattogram 1.00 1.00 1.00 2

B. Model Implementation –XGBoost Classifiers
Challengers

Extreme Gradient Boosting

(XGBoost) is a highly effective machine learning algorithm Dhaka 1.00 1.00 1.00 13
known for its remarkable predictive accuracy. It is specially Platoon
designed to be computationally efficient and capable of
handling large datasets. The algorithm incorporates L1 and L2
regularization, as well as dropout regularization, to prevent
overfitting and improve the model's generalization Sylhet 1.00 0.78 0.88 9
Thunder
performance. Additionally, XGBoost generates feature
importance scores, which help in identifying the most
significant features in the data. This feature can be utilized for
feature selection, feature engineering, and to gain insights into Rajshahi 0.83 0.71 0.77 14
various machine learning tasks such as regression, Royals
classification, and ranking.
The flexibility and customization of the
XGBoost algorithm is due to its support for various loss Rangpur 1.00 1.00 1.00 3
functions and evaluation metrics. To implement this Riders
algorithm, we utilized the BPL dataset from scikit-learn and
split it into training and testing sets. Our XGBoost model was
defined using the XGBClassifier class and was trained using
V. CONCLUSION AND FUTURE WORK learning." Computer Science & Information Technology (CS & IT)
(2017).
Predicting the match winner of BPL data from season [10] 10. Kampakis, Stylianos, and William Thomas. "Using machine
2018 to 2023 was the primary goal of this study. We compared learning to predict the outcome of english county twenty over cricket
the performance of 5 different machine learning models to see matches." arXiv preprint arXiv:1511.05837 (2015).
which one best fits the dataset and The fields of data science [11] 11. Lamsal, Rabindra, and Ayesha Choudhary. "Predicting outcome
that have merged include visualizations, pre-processing, of Indian premier league (IPL) matches using machine learning." arXiv
implementing machine learning models, feature selection for preprint arXiv:1809.09813 (2018).
winner prediction and analysis. By Level encoding into [12] 12. Srikantaiah, K. C., Aryan Khetan, Baibhav Kumar, Divy Tolani,
and Harshal Patel. "Prediction of IPL match outcome using machine
numerical format, removing the missing values from the learning techniques." In 3rd International Conference on Integrated
dataset the preprocessing has been done By combining the Intelligent Computing Communication & Security (ICIIC 2021), pp.
target variable with the properties of the data, the best features 399-406. Atlantis Press, 2021.
were chosen. We have found outstanding results by [13] 13. Wickramasinghe, Indika. "Classification of all-rounders in the
performing. models for machine learning with specific game of ODI cricket: Machine learning approach." Athens Journal of
attributes. Our goal was to predict the winner with highest Sports 7, no. 1 (2020): 21-34.
accuracy, we achieved the highest accuracy on XGBoost and [14] 14. Dhonge, Nikhil, Shraddha Dhole, Nikita Wavre, Mandar
Pardakhe, and Amit Nagarale. "IPL cricket score and winning
predicted the winner with 93% accuracy. In this process of prediction using machine learning techniques." Int Res J
predicting the winner KNN gave us the lowest accuracy which Modernization Eng Technol Sci 3 (2021): 1723-1730.
is 79%. [15] 15. Ahmed, Waqar, Mahwish Amjad, K. Junejo, Tariq Mahmood,
and A. Khan. "Is the performance of a cricket team really
Because of different ways of data preprocessing, hyper unpredictable? a case study on pakistan team using machine learning."
parameter tuning, Sensitivity to initial conditions a single Indian Journal of Science and Technology 13, no. 34 (2020): 3586-
algorithm can produce different result. Therefore, we are 3599.
currently attempting to decrease the prediction inaccuracy. [16] 16. Biswas, Milon, Tajim Md Niamat Ullah Akhund, Md Kawsher
Furthermore, we intend to predict fall of wickets, finding Mahbub, Sikder Md Saiful Islam, Sadia Sorna, and M. Shamim Kaiser.
about the reasons behind most preferred decision of BPL after "A survey on predicting player’s performance and team
recommendation in game of cricket using machine learning." In
toss Field or Ball. Eventually, we aim to host a website where Information and Communication Technology for Competitive
all this analysis will be available for audience. Strategies (ICTCS 2020) ICT: Applications and Social Interfaces, pp.
223-230. Springer Singapore, 2022.
[17] 17. Srivastava, Praveen Ranjan, Prajwal Eachempati, Ajay Kumar,
Ashish Kumar Jha, and Lalitha Dhamotharan. "Best strategy to win a
match: an analytical approach using hybrid machine learning-
clustering-association rule framework." Annals of Operations
Research (2022): 1-43.
REFERENCES

[1] Vistro, Daniel Mago, Faizan Rasheed, and Leo Gertrude David. "The
cricket winner prediction with application of machine learning and data
analytics." International Journal of Scientific & Technology Research
8, no. 09 (2019).
[2] 2. Awan, Mazhar Javed, Syed Arbaz Haider Gilani, Hamza
Ramzan, Haitham Nobanee, Awais Yasin, Azlan Mohd Zain, and
Rabia Javed. "Cricket match analytics using the big data approach."
Electronics 10, no. 19 (2021): 2350.
[3] 3. Tekade, Pallavi, Kunal Markad, Aniket Amage, and Bhagwat
Natekar. "Cricket match outcome prediction using machine learning."
International journal 5, no. 7 (2020).
[4] 4. Sankaranarayanan, Vignesh Veppur, Junaed Sattar, and Laks VS
Lakshmanan. "Auto-play: A data mining approach to ODI cricket
simulation and prediction." In Proceedings of the 2014 SIAM
international conference on data mining, pp. 1064-1072. Society for
Industrial and Applied Mathematics, 2014.
[5] 5. Mittal, Harsh, Deepak Rikhari, Jitendra Kumar, and Ashutosh
Kumar Singh. "A study on machine learning approaches for player
performance and match results prediction." arXiv preprint
arXiv:2108.10125 (2021).
[6] 6. Mustafa, Raza Ul, M. Saqib Nawaz, M. Ikram Ullah Lali,
Tehseen Zia, and Waqar Mehmood. "Predicting the cricket match
outcome using crowd opinions on social networks: A comparative
study of machine learning methods." Malaysian Journal of Computer
Science 30, no. 1 (2017): 63-76.
[7] 7. Passi, Kalpdrum, and Niravkumar Pandey. "Increased prediction
accuracy in the game of cricket using machine learning." arXiv preprint
arXiv:1804.04226 (2018).
[8] 8. Kamble, R. R. "Cricket score prediction using machine learning."
Turkish Journal of Computer and Mathematics Education
(TURCOMAT) 12, no. 1S (2021): 23-28.
[9] 9. Passi, Kalpdrum, and Niravkumar Pandey. "Predicting players’
performance in one day international cricket matches using machine
View publication stats

Prediction and Analysis of Franchise Cricket
No ratings yet
Prediction and Analysis of Franchise Cricket
8 pages
Dynamic Cricket Match Outcome Prediction
No ratings yet
Dynamic Cricket Match Outcome Prediction
12 pages
MATLAB SIMSCAPE Manual
100% (1)
MATLAB SIMSCAPE Manual
153 pages
Smooth Board User Guide
No ratings yet
Smooth Board User Guide
84 pages
Background of The Study
No ratings yet
Background of The Study
6 pages
Prediction of IPL Match Outcome Using Machine Lear
No ratings yet
Prediction of IPL Match Outcome Using Machine Lear
8 pages
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
No ratings yet
IPL Score Prediction (Journal) - 4nm18cs142-169-191-215.
10 pages
Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020
No ratings yet
Application of Machine Learning in Cricket and Predictive Analytics of IPL 2020
26 pages
Paper 9073
No ratings yet
Paper 9073
11 pages
Predicting Outcome of Indian Premier League IPL Ma
No ratings yet
Predicting Outcome of Indian Premier League IPL Ma
5 pages
Cricket Match Winner Prediction]
No ratings yet
Cricket Match Winner Prediction]
5 pages
research_paper harshit 212-1
No ratings yet
research_paper harshit 212-1
4 pages
Predicting Cricket Match 490021 1 en
No ratings yet
Predicting Cricket Match 490021 1 en
13 pages
Cricket Analysis Using Machine Learning: B V S Sai Praneeth, V Srighan Reddy, P Jayanth, K Jeevan Reddy
No ratings yet
Cricket Analysis Using Machine Learning: B V S Sai Praneeth, V Srighan Reddy, P Jayanth, K Jeevan Reddy
5 pages
Cricket JETIR2005307
No ratings yet
Cricket JETIR2005307
5 pages
Ipl 4
No ratings yet
Ipl 4
8 pages
Fin Irjmets1697356356
No ratings yet
Fin Irjmets1697356356
4 pages
Major Project Report Estimating the Chances of Winning Ipl Using Machine Le 20240531 235827 0000
No ratings yet
Major Project Report Estimating the Chances of Winning Ipl Using Machine Le 20240531 235827 0000
28 pages
Ipl Cricket Score
No ratings yet
Ipl Cricket Score
8 pages
Paper 3
No ratings yet
Paper 3
7 pages
B.E Cse Batchno 185
No ratings yet
B.E Cse Batchno 185
42 pages
Final PDF
No ratings yet
Final PDF
13 pages
Ijirt157821 Paper
No ratings yet
Ijirt157821 Paper
6 pages
Applied Computing and Informatics: Kumash Kapadia, Hussein Abdel-Jaber, Fadi Thabtah, Wael Hadi
No ratings yet
Applied Computing and Informatics: Kumash Kapadia, Hussein Abdel-Jaber, Fadi Thabtah, Wael Hadi
6 pages
Predictiveanalysis of PSL Match Winners Using Machine Learning Techniques
No ratings yet
Predictiveanalysis of PSL Match Winners Using Machine Learning Techniques
12 pages
Comprehensive Data Analysis and Prediction On IPL Using Machine Learning Algorithms Valarmathi B 2113j1
No ratings yet
Comprehensive Data Analysis and Prediction On IPL Using Machine Learning Algorithms Valarmathi B 2113j1
11 pages
Ipl Winner Prediction Using Machine Learning
100% (1)
Ipl Winner Prediction Using Machine Learning
58 pages
Paper 5
No ratings yet
Paper 5
23 pages
Applied Computing and Informatics: Kumash Kapadia, Hussein Abdel-Jaber, Fadi Thabtah, Wael Hadi
No ratings yet
Applied Computing and Informatics: Kumash Kapadia, Hussein Abdel-Jaber, Fadi Thabtah, Wael Hadi
6 pages
Ipl Nan
No ratings yet
Ipl Nan
11 pages
Ipl Prediction Documentation
No ratings yet
Ipl Prediction Documentation
18 pages
Predicting Players' Performance in One Day International Cricket Matches Using Machine Learning
No ratings yet
Predicting Players' Performance in One Day International Cricket Matches Using Machine Learning
17 pages
Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning
No ratings yet
Predicting Outcome of Indian Premier League (IPL) Matches Using Machine Learning
12 pages
1809 09813 PDF
No ratings yet
1809 09813 PDF
13 pages
IJSRED-V8I2P177
No ratings yet
IJSRED-V8I2P177
6 pages
Balasundaram 2020
No ratings yet
Balasundaram 2020
5 pages
Super Predictor of Indian Premier League (IPL) Using Various ML Techniques With Help of IBM Cloud
No ratings yet
Super Predictor of Indian Premier League (IPL) Using Various ML Techniques With Help of IBM Cloud
7 pages
17 - Best Strategy To Win A Match An Analytical Approach Using Hybrid Machine
No ratings yet
17 - Best Strategy To Win A Match An Analytical Approach Using Hybrid Machine
43 pages
Cricket Score Prediction Using Machine Learning
No ratings yet
Cricket Score Prediction Using Machine Learning
6 pages
Performance Analysis of A Cricketer by Data Visualization
No ratings yet
Performance Analysis of A Cricketer by Data Visualization
10 pages
source file
No ratings yet
source file
42 pages
dalal-2024-ijca-923744
No ratings yet
dalal-2024-ijca-923744
7 pages
1276
No ratings yet
1276
13 pages
Ipl Prediction
No ratings yet
Ipl Prediction
12 pages
The Cricket Winner Prediction With Applications of ML and Data Analytics
No ratings yet
The Cricket Winner Prediction With Applications of ML and Data Analytics
18 pages
Quantifying and Analyzing The Performance of Cricket Player Using Machine Learning
No ratings yet
Quantifying and Analyzing The Performance of Cricket Player Using Machine Learning
7 pages
SSRN Id3572740
No ratings yet
SSRN Id3572740
5 pages
Madan Gopal Jhanwar
No ratings yet
Madan Gopal Jhanwar
11 pages
Ijst 2023 29441
No ratings yet
Ijst 2023 29441
7 pages
The Cricket Winner Prediction With Application of Machine Learning and Data Analytics
No ratings yet
The Cricket Winner Prediction With Application of Machine Learning and Data Analytics
6 pages
Predictive Analysis of Sports Data Using Google Prediction API
No ratings yet
Predictive Analysis of Sports Data Using Google Prediction API
1 page
IPL REPORT
No ratings yet
IPL REPORT
12 pages
Paper3 TeamselectionusingRandomForestAlgorithm
No ratings yet
Paper3 TeamselectionusingRandomForestAlgorithm
8 pages
Jsa - 2018 - 4 4 - Jsa 4 4 Jsa196 - Jsa 4 Jsa196
No ratings yet
Jsa - 2018 - 4 4 - Jsa 4 4 Jsa196 - Jsa 4 Jsa196
11 pages
Project Report
No ratings yet
Project Report
16 pages
editable
No ratings yet
editable
49 pages
Data Mining and Warehousing: Predicting The Outcome of ODI Matches
No ratings yet
Data Mining and Warehousing: Predicting The Outcome of ODI Matches
7 pages
Ipl 2
No ratings yet
Ipl 2
6 pages
Nadeem Report
No ratings yet
Nadeem Report
19 pages
Blue Futuristic Technology Presentation
No ratings yet
Blue Futuristic Technology Presentation
19 pages
Shetty 2020
No ratings yet
Shetty 2020
6 pages
IPL
No ratings yet
IPL
8 pages
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
From Everand
Data and Analytics in Action: Project Ideas and Basic Code Skeleton in Python
Zemelak Goraga
No ratings yet
r20 Ds Lab Manual
No ratings yet
r20 Ds Lab Manual
42 pages
MetaTrader 4 and MQL 4 Overview
No ratings yet
MetaTrader 4 and MQL 4 Overview
15 pages
Cluster Computing
No ratings yet
Cluster Computing
1 page
Z19 Tnoo 000517 PDF
No ratings yet
Z19 Tnoo 000517 PDF
1 page
Differential Evolution
No ratings yet
Differential Evolution
12 pages
3 28
No ratings yet
3 28
7 pages
StructuredBanking-CaseStudyReference v2.6
No ratings yet
StructuredBanking-CaseStudyReference v2.6
37 pages
Introduction To Coding & Computational Thinking Exercises
No ratings yet
Introduction To Coding & Computational Thinking Exercises
9 pages
Chapter 6
No ratings yet
Chapter 6
172 pages
Bob Statement
No ratings yet
Bob Statement
8 pages
Computer Science Class 11 Project
No ratings yet
Computer Science Class 11 Project
38 pages
KLU Student Feedback Report - Virtual Mock Interviews
No ratings yet
KLU Student Feedback Report - Virtual Mock Interviews
11 pages
Digital Computer Fundamentals and Microprocessor
No ratings yet
Digital Computer Fundamentals and Microprocessor
87 pages
Students - Profile (Requinton, J.)
No ratings yet
Students - Profile (Requinton, J.)
1 page
2010122010124615531USER MANUAL (RCPS For RMS)
No ratings yet
2010122010124615531USER MANUAL (RCPS For RMS)
39 pages
Basics of Internet Architecture: Lesson Plan
No ratings yet
Basics of Internet Architecture: Lesson Plan
1 page
On Chip Variation
100% (1)
On Chip Variation
29 pages
Power Scripts Code Depot
No ratings yet
Power Scripts Code Depot
161 pages
Personalized Doorbell: Ece 445 Senior Design
No ratings yet
Personalized Doorbell: Ece 445 Senior Design
5 pages
C C++ Tcs
No ratings yet
C C++ Tcs
8 pages
Solu Test
No ratings yet
Solu Test
6 pages
Lesson 01 BasicModeling1 Steel Precast
No ratings yet
Lesson 01 BasicModeling1 Steel Precast
54 pages
PERT Exercise2 Q&A
No ratings yet
PERT Exercise2 Q&A
19 pages
IP_Mini_Project_Sports_Shop_Management_System-1-.pdf
No ratings yet
IP_Mini_Project_Sports_Shop_Management_System-1-.pdf
25 pages
PIC18F Naming Convention
No ratings yet
PIC18F Naming Convention
1 page
Petri Nets Manufacturing
100% (1)
Petri Nets Manufacturing
502 pages
MCA II Sem
No ratings yet
MCA II Sem
11 pages

Predicting BPLMatch Winners An Empirical Study Using Machine Learning Approach

Uploaded by

Predicting BPLMatch Winners An Empirical Study Using Machine Learning Approach

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Predicting BPL Match Winners: An Empirical Study Using Machine Learning

Conference Paper · July 2023

Bornita Adhikari Md. Sazzadur Ahamed

SEE PROFILE SEE PROFILE

The user has requested enhancement of the downloaded file.

XXX-X-XXXX-XXXX-X/XX/$XX.00 ©20XX IEEE

TABLE I. BEFORE LABEL ENCODING

Dhaka 11/1/2020 Dhaka Khulna Khulna Field Normal Khulna 0 8 Sher-e-Bangla

Dhaka 11/1/2020 Dhaka Khulna Khulna Field Normal Dhaka 0 4 Sher-e-Bangla

Dhaka 4/2/2018 Chattogram Dhaka Chattogram Bat Normal Dhaka 0 7 Sher-e-Bangla

Dhaka 4/2/2023 Chattogram Comilla Chattogram Bat Normal Comilla 0 6 Sher-e-Bangla

Fig. 2. Scatter plot matrix Fig. 3. Correlation matrix Heatmap

 Random Forest Decision Tree 86%

C. Classification report of XGBoost

TABLE IV. AFTER LABEL ENCODING

Class Precision Recall F1-Score Support

Fortune 0.90 1.00 0.95 18

Chattogram 1.00 1.00 1.00 2

Extreme Gradient Boosting

You might also like