Edit me

Assessing the performance of classification models

A tabulation of the actual and predicted target values is done, to summarise the counts of samples that are correctly and incorrectly predicted. In predicting two values (eg. yes or no), typically the table would have two rows and two columns. The actual labels are read horizontally, while the predictions are read vertically . However, this order can be reversed as well, but this should be mentioned. An example is given in the table below:

Table 1: Contingency table
Actual labels
Predicted labels positive negative
positive True positives False negatives
negative False positives True negatives

By using this table, one can compute various metrics (some of which are listed below), and in the best of cases aim for values converging towards 100% correctness:

  • Sensitivity This tells how well the model predicts the positive cases
  • Specificity This tells how well the model predicts the negative cases
  • Accuracy This quantifies how extent that a model predicts both positives and negatives samples correctly.




Tags: