recall vs percision
Precision
Precision = TP / (TP + FP)
In Precision, we want straight up, how many did the model get right, how many did the model get wrong - that will give us how precise it is.
Spam filtering: A spam filter with high precision will have few false positives, which means that it will not send many ham emails to the spam folder. So spam filter with high precision may also have some false negatives, which means that some spam emails will not be caught.
Recall
Recall measures how good the model is at avoiding false negatives. For example, if a spam filter has a recall of 0.8, then it means that 80% of the spam emails are actually predicted as spam.
Recall = TP / (TP + FN)
In Recall, we want to know this model predicted correctly and what is predicted badly when the result is actually true - that needed a review / revisit / recall.
We don't want recall for medical system
Medical diagnosis: A medical diagnosis system with high recall will catch most of the sick patients, but it may also incorrectly classify some healthy patients as sick. A medical diagnosis system with high precision will correctly classify most of the healthy patients, but it may also miss some sick patients.
Kappa and ROC
Comments