17.10.2 Algorithms (ROC Curve)ROCCurve-Algorithm
In this part, Following notation will be used.
: Test result score for case
: Number of true positive decisions
: Number of false negative decisions
: Number of true negative decisions
: Number of false positive decisions
: Number of cases with negative actual state
: Number of cases with positive actual state
: Number of true negative cases with test results equal to
: : Number of true positive cases with test results greater than
: : Number of true positive cases with test results equal to
: : Number of true negative cases with test results less than
ROC Values
1- Specificity (X):
Sensitivity (Y):
The area under the ROC curve
Let be the scale of the test result variable. Denote by the values for cases with negative actual states and the values for cases with positive actual states. Then, the nonparametric approximation of the ”true” area under the ROC curve, ,is
where is the sample size of +, is the sample size of -, and
Note that is the observed area under the ROC curve, which connects successive points by a straight line, i.e., by the trapezoidal rule.
An alternative way to compute is as follows:
The SE of the area under the ROC curve statistic
The standard deviation of is estimated by:
where
and
The asymptotic confidence interval of the area under the ROC curve
A 2-sided asymptotic confidence interval for the true area under the ROC curve is
The asymptotic P-value under the null hypothesis that vs. the alternative hypothesis that
Since is asymptotically normal under the null hypothesis that , we can calculate the asymptotic P-value under the null hypothesis that vs. the alternative hypothesis that :
In the nonparametric case,
Optimal Cut-Point Value
The cut-point value is defined by the equality maximization of these two quantities (SpEqualSe), which is min( abs(1-x-y) ) for ROC curve.
|