Interim Layout
Interim Layout
Paper[7] talks about using artificial neural A boosting machine regressor was used in this
networks for predicting results of and NBA research because:
game. It enhances prediction accuracy by sequentially
Paper[8] use data from historical matches to building decision trees boosting regressors
predict future matches. progressively refine predictions leading to more
accurate forecasts
Paper[9] talks about using multiple player factors
to predict matches and player performance Addresses all the complex dynamics of a
basketball game.
Paper[10] talks about injury rate predictions in
sports such as basketball by using Machine 3.Extra trees regressor:
learning models. This is an ensemble method similar to a random
3.Methadology forest regressor and is used here because of its
ability to reduce overfitting of training data
Dataset- leading to more generalized predictions.
To come up with an accurate prediction we have performance metrics
use a dataset with 464 players with 8 parameters.
X here can be featured as the data frame and has In order to make the prediction model efficient
the following values and accurate multiple model evaluation metrics
were used in it.
X=df[[‘rating’,’team’,’position’,’height’,’weight’,’s
alary’,’country’,’draft year’,’draft round’,’draft A descriptive note on the metrics used in the
peak’,’college’]] prediction model is mentioned ahead.
1. Root mean squared error- principal component analysis and other
Root mean squared error (RMSE) is a dimensionality reduction techniques. It
metric used widely in machine learning to quantifies the amount of variability in the
assess the accuracy of a predictive model. dependent variable that is explained by
It measures the difference between the independent variables or principal
predicted value and actual value. components.
Mathematically RMSE is expressed as: Mathematically explained variance is
√ 1
n expressed as:
RMSE= ∑ ( y i− ^y i)2 Var ( y− ^y )
n i=1 Explained varience=1−
where: Var ( y)
n is the number of observations in the Where:
dataset. Y is the actual values of the dependent
yi is the actual value of the dependent variable.
variable for the observation i. Ŷ is the predicted values of the dependent
ŷi is the predicted value of the dependent variable.
variable for the observation i. Var(y) is the total variance of the
dependent variable.
2. R squared-
R squared is also widely used metric in 4. Mean squared error-
machine learning models. It measures the Mean squared error is a commonly used
proportion of the variance in the metric for evaluation of regression model
dependent variable that is predictable performance. It measures the average of
from the independent variable. the squares of the errors or deviations
R-squared is expresses as: between predicted and actual values in a
SS residual dataset.
2
R =1− MSE is expressed as:
SStotal
n
1
Where: MSE= ∑ ( y i−^y i )
2
References: