International Journal of Statistics in Medical Research, 2012, 1, 79-81 79
E-ISSN: 1929-6029/12 © 2012 Lifescience Global
Performance Measures in Binary Classification
Matthias Kohl*
Department of Mechanical and Process Engineering, Furtwangen University, Jakob-Kienzle-Str. 17, D-78054
VS-Schwenningen, Germany
Abstract: We give a brief overview over common performance measures for binary classification. We cover sensitivity,
specificity, positive and negative predictive value, positive and negative likelihood ratio as well as ROC curve and AUC.
Keywords: Sensitivity, specificity, positive predictive value, negative predictive value, positive likelihood ratio,
negative likelihood ratio, prevalence, ROC curve, AUC, informative diagnostic test.
PERFORMANCE MEASURES
In many cases diagnostic tests are performed to
distinguish between two groups reflecting presence or
absence of a relevant medical condition. In this setup
let us assume a group of N patients with true status y
1
,
..., y
N
where y
i
= 1 represents presence and y
i
= 0
absence of the medical condition. A diagnostic test T
yields results t
1
, ..., t
N
where t
i
= 1 represents a positive
and t
i
= 0 a negative test.
The simplest approach to measure the performance
of test T is to use the probability of misclassification
(PMC)
PMC =
cardinality of {i = 1,…, Ny
i
t
i
}
N
respectively, the accuracy (ACC) = 1 - PMC
However, such a single performance measure may
be misleading, as there are two possibilities for a
correct respectively, wrong decision of the diagnostic
test that are the correct respectively, wrong prediction
of the presence or absence of the medical condition [1].
Thus, a pair of criteria should be used to obtain an
exact description of the performance. In general, the
results of a test can be summarized by the so called
confusion matrix.
The confusion matrix whose structure is presented
in Table 1 includes the information on the prevalence
(Pr) for the considered group.
Pr =
TP + FN
TP + FN + TN + FP
*Address corresponding to this author at the Department of Mechanical and
Process Engineering, Furtwangen University, Jakob-Kienzle-Str. 17, D-78054
VS-Schwenningen, Germany; Tel: +49 (0) 7720 307-4746; Fax: +49 (0) 7720
307-4727; E-mail: Matthias.Kohl@hs-furtwangen.de
Table 1: Confusion Matrix for Binary Classification
Test result
0 1
0 True negative (TN) False positive (FP)
True
situation
1 False negative (FN) True positive (TP)
In addition, it is the basis for the definition of various
performance measures. The percentage of correct
positive tests for patients having the medical condition
is called sensitivity (Se), whereas the percentage of
correct negative tests for patients not having the
medical condition is called specificity (Sp).
Se =
TP
TP + FN
SP =
TN
TN + FP
The accuracy can also be expressed as a weighted
sum of sensitivity and specificity
ACC = Pr * Se + (1 Pr) * SP
The positive predictive value (PPV) is the probability
that a patient with a positive test has the medical
condition and the negative predictive value (NPV) is the
probability that a patient with a negative test does not
have the medical condition.
PPV =
TP
TP + FP
NPV =
TN
TN + FN
The positive likelihood ratio (PLR) tells how likely
patients with the medical condition are to have a
positive test compared to patients without the medical
condition. The negative likelihood ratio (NLR) tells how
likely patients with the medical condition are to have a
negative result compared to patients without the
medical condition.