TY - JOUR
T1 - Using the area under an estimated ROC curve to test the adequacy of binary predictors*
AU - Lieli, Robert P.
AU - Hsu, Yu Chin
N1 - Publisher Copyright:
© 2018, © 2018 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.
PY - 2019/1/2
Y1 - 2019/1/2
N2 - We consider using the area under an empirical receiver operating characteristic curve to test the hypothesis that a predictive index combined with a range of cutoffs performs no better than pure chance in forecasting a binary outcome. This corresponds to the null hypothesis that the area in question, denoted as AUC, is 1/2. We show that if the predictive index comes from a first-stage regression model estimated over the same data set, then testing the null based on the standard asymptotic normality results leads to severe size distortion in general settings. We then analytically derive the proper asymptotic null distribution of the empirical AUC in a special case; namely, when the first-stage regressors are Bernoulli random variables. This distribution can be utilised to construct a fully in-sample test of H0 : AUC = 1/2 with correct size and more power than out-of-sample tests based on sample splitting, though practical application becomes cumbersome with more than two regressors.
AB - We consider using the area under an empirical receiver operating characteristic curve to test the hypothesis that a predictive index combined with a range of cutoffs performs no better than pure chance in forecasting a binary outcome. This corresponds to the null hypothesis that the area in question, denoted as AUC, is 1/2. We show that if the predictive index comes from a first-stage regression model estimated over the same data set, then testing the null based on the standard asymptotic normality results leads to severe size distortion in general settings. We then analytically derive the proper asymptotic null distribution of the empirical AUC in a special case; namely, when the first-stage regressors are Bernoulli random variables. This distribution can be utilised to construct a fully in-sample test of H0 : AUC = 1/2 with correct size and more power than out-of-sample tests based on sample splitting, though practical application becomes cumbersome with more than two regressors.
KW - Area under the ROC curve
KW - binary classification
KW - in-sample hypothesis testing
KW - model evaluation
KW - overfitting
UR - http://www.scopus.com/inward/record.url?scp=85055701815&partnerID=8YFLogxK
U2 - 10.1080/10485252.2018.1537440
DO - 10.1080/10485252.2018.1537440
M3 - Article
AN - SCOPUS:85055701815
SN - 1048-5252
VL - 31
SP - 100
EP - 130
JO - Journal of Nonparametric Statistics
JF - Journal of Nonparametric Statistics
IS - 1
ER -