Other ways of presenting a classification problem
For a binary classifcation matrix, the so-called confusion matrix, is often used. It can also be extended to more catgeories/classes as well.
The following quantities are then used
- positive condition number \( P \), which represents the number of real positive cases in the data (output one/true etc)
- The condition negative number \( N \) which is the number of negative cases (ouput zero/false etc)
- The true positive number \( TP \) which represents whether a positive test result has been correctly classified (the application of our trained model on a test data set)
- The true negative \( TN \) number which represents whether a negative test has been correctly classified
- The false positive \( FP \) number, a so-called type I error which tells us about the fraction of positive test result which are wrongly classified
- A false negative \( FN \) number, a so-called type II error which, should be pretty obvious, indicates if a negative test has been wrongly classified.
It is is easy to think in terms of illness. You could think of the above as
- True positive: Sick people correctly identified as sick
- False positive: Healthy people incorrectly identified as sick
- True negative: Healthy people correctly identified as healthy
- False negative: Sick people incorrectly identified as healthy