Confusion Matrix in Machine Learning fgvbn
Confusion Matrix in Machine Learning fgvbn
○ For the 2 prediction classes of classifiers, the matrix is of 2*2 table, for 3 classes,
it is 3*3 table, and so on.
○ The matrix is divided into two dimensions, that are predicted values and actual
values along with the total number of predictions.
○ Predicted values are those values, which are predicted by the model, and actual
values are the true values for the given observations.
○ True Negative: Model has given prediction No, and the real or actual value was
also No.
○ True Positive: The model has predicted yes, and the actual value was also true.
○ False Negative: The model has predicted no, but the actual value was Yes, it is
also called as Type-II error.
○ False Positive: The model has predicted Yes, but the actual value was No. It is
also called a Type-I error.
Need for Confusion Matrix in Machine learning
○ It not only tells the error made by the classifiers but also the type of errors such
as it is either type-I or type-II error.
○ With the help of the confusion matrix, we can calculate the different parameters
for the model, such as accuracy, precision, etc.
Suppose we are trying to create a model that can predict the result for the disease that
is either a person has that disease or not. So, the confusion matrix for this is given as:
○ The table is given for the two-class classifier, which has two predictions "Yes"
and "NO." Here, Yes defines that patient has the disease, and No defines that
patient does not has that disease.
○ The classifier has made a total of 100 predictions. Out of 100 predictions, 89 are
true predictions, and 11 are incorrect predictions.
○ The model has given prediction "yes" for 32 times, and "No" for 68 times.
Whereas the actual "Yes" was 27, and actual "No" was 73 times.
Calculations using Confusion Matrix:
We can perform various calculations for the model, such as the model's accuracy, using
this matrix. These calculations are given below:
○ Misclassification rate: It is also termed as Error rate, and it defines how often the
model gives the wrong predictions. The value of error rate can be calculated as
the number of incorrect predictions to all number of the predictions made by the
classifier. The formula is given below: