Start Over

A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve

Authors :: Qian M. Zhou
Lu Zhe
Russell J. Brooke
Melissa M. Hudson
Yan Yuan
Source :: Diagnostic and Prognostic Research, Vol 5, Iss 1, Pp 1-15 (2021)
Publication Year :: 2021
Publisher :: BMC, 2021.
Abstract: Abstract Background Incremental value (IncV) evaluates the performance change between an existing risk model and a new model. Different IncV metrics do not always agree with each other. For example, compared with a prescribed-dose model, an ovarian-dose model for predicting acute ovarian failure has a slightly lower area under the receiver operating characteristic curve (AUC) but increases the area under the precision-recall curve (AP) by 48%. This phenomenon of disagreement is not uncommon, and can create confusion when assessing whether the added information improves the model prediction accuracy. Methods In this article, we examine the analytical connections and differences between the AUC IncV (ΔAUC) and AP IncV (ΔAP). We also compare the true values of these two IncV metrics in a numerical study. Additionally, as both are semi-proper scoring rules, we compare them with a strictly proper scoring rule: the IncV of the scaled Brier score (ΔsBrS) in the numerical study. Results We demonstrate that ΔAUC and ΔAP are both weighted averages of the changes (from the existing model to the new one) in separating the risk score distributions between events and non-events. However, ΔAP assigns heavier weights to the changes in higher-risk regions, whereas ΔAUC weights the changes equally. Due to this difference, the two IncV metrics can disagree, and the numerical study shows that their disagreement becomes more pronounced as the event rate decreases. In the numerical study, we also find that ΔAP has a wide range, from negative to positive, but the range of ΔAUC is much smaller. In addition, ΔAP and ΔsBrS are highly consistent, but ΔAUC is negatively correlated with ΔsBrS and ΔAP when the event rate is low. Conclusions ΔAUC treats the wins and losses of a new risk model equally across different risk regions. When neither the existing or new model is the true model, this equality could attenuate a superior performance of the new model for a sub-region. In contrast, ΔAP accentuates the change in the prediction accuracy for higher-risk regions.

Subjects :: Prediction performance
AUC
Area under precision-recall curve
Brier score
Proper scoring rules
Rare outcome
Medicine (General)
R5-920

Details

Language :: English
ISSN :: 23977523
Volume :: 5
Issue :: 1
Database :: Directory of Open Access Journals
Journal :: Diagnostic and Prognostic Research
Publication Type :: Academic Journal
Accession number :: edsdoj.25fbabd3553d4d829d25055e4dfe5ca8
Document Type :: article
Full Text :: https://doi.org/10.1186/s41512-021-00102-w

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

A relationship between the incremental values of area under the ROC curve and of area under the precision-recall curve

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources