Capstone Presentation
Capstone Presentation
By
Anil Ulchala
PGPDSBA Online May_A 2021
Business Problem Understanding
Business Problem:
BCCI has come up with a problem on how to increase the Team India winning Probability. For the same, BCCI had a tie up with Data Analytics
Consultant. The major objective of this tie up is to extract actionable insights from the historical match data and make strategic changes to make India win.
Primary objective is to create Machine Learning models which correctly predicts a win for the Indian Cricket Team. Once a model is developed then you have to
extract actionable insights and recommendation.
Constraints:
• The data set provided has slight imbalance. 83% of the data relates to the matches where India has won. Rest 17% of the data relates to the matches
where India lost. So this may over train the model and bias the output towards Winning
• Also, another constraint is Data pertaining to some formats is not available. One such scenario is as per the problem requirement, it is needed to predict
the India winning strategy against Australia in T20 format. But the dataset has no record of India played T20 with Australia in the past. So this is a
constraint that will not allow to split the data into format wise and do analysis. Also, we cannot able to build three separate models based on format type.
Scope:
The scope of the business problem is to draw actionable insights and recommendations from the data set and build the models to predict the
result of Team India.
objectives:
The main objective is provide the winning strategy for the upcoming matches India will play with their opponents. The strategy should be
different when you play a match with same opponent and same parameters.
Data Set and Dictionary
Subheading
Considering the constraints on the data set only one model is build for the complete dataset and used accordingly based
on the strategy required
One hot encoding is done for the Object variables and on necessary dependent variables
GridSearchCV method is used to Hyper tune the models for the best accuracy
Subheading
• Table 1, gives the comparison of the various models built Table 1: Comparison between the models
• Based on the below parameters, it is identified that Lorem Ipsum is simply
Logistic Regression model is the best model to predict dummy text of the printing
the Match result of Team India against opponents and typesetting industry.
• Accuracy of the model
• Precision
• Recall &
• Performance of the model on Test and Train data
2 T20 match with Australia in India. All the match are Day and Night matches. In India, it will
be winter season at the time to match.
So Team Dynamics should be as follows:
Recommendations (contd.
2 ODI match with Sri Lanka in India. All the match are Day and Night matches. In India, it will
be winter season at the time to match.