External Validations of Cardiovascular Clinical Prediction Models: A Large-Scale Review of the Literature

Benjamin S Wessler; Jason Nelson; Jinny G Park; Hannah McGinnes; Gaurav Gulati; Riley Brazil; Ben Van Calster; David van Klaveren; Esmee Venema; Ewout Steyerberg; Jessica K Paulus; David M Kent

doi:10.1161/CIRCOUTCOMES.121.007858

External Validations of Cardiovascular Clinical Prediction Models: A Large-Scale Review of the Literature

Circ Cardiovasc Qual Outcomes. 2021 Aug;14(8):e007858. doi: 10.1161/CIRCOUTCOMES.121.007858. Epub 2021 Aug 3.

Authors

Benjamin S Wessler^{1

2}, Jason Nelson¹, Jinny G Park¹, Hannah McGinnes¹, Gaurav Gulati^{1

2}, Riley Brazil¹, Ben Van Calster³, David van Klaveren^{1

4}, Esmee Venema^{5

6}, Ewout Steyerberg^{7

5}, Jessica K Paulus¹, David M Kent¹

Affiliations

¹ Predictive Analytics and Comparative Effectiveness (PACE) (B.S.W., J.N., J.G.P., H.G., G.G., R.B., D.v.K., J.K.P., D.M.K.), Tufts Medical Center, Boston, MA.
² Division of Cardiology (B.S.W., G.G.), Tufts Medical Center, Boston, MA.
³ KU Leuven, Department of Development and Regeneration, Belgium (B.V.C.).
⁴ Department of Biomedical Data Sciences (D.v.K.), Leiden University Medical Centre, Netherlands.
⁵ Department of Public Health (E.V., E.S.), Erasmus MC University Medical Center, Rotterdam, the Netherlands.
⁶ Department of Neurology (E.V.), Erasmus MC University Medical Center, Rotterdam, the Netherlands.
⁷ Department of Biomedical Data Sciences (E.S.), Leiden University Medical Centre, Netherlands.

Abstract

Background: There are many clinical prediction models (CPMs) available to inform treatment decisions for patients with cardiovascular disease. However, the extent to which they have been externally tested, and how well they generally perform has not been broadly evaluated.

Methods: A SCOPUS citation search was run on March 22, 2017 to identify external validations of cardiovascular CPMs in the Tufts Predictive Analytics and Comparative Effectiveness CPM Registry. We assessed the extent of external validation, performance heterogeneity across databases, and explored factors associated with model performance, including a global assessment of the clinical relatedness between the derivation and validation data.

Results: We identified 2030 external validations of 1382 CPMs. Eight hundred seven (58%) of the CPMs in the Registry have never been externally validated. On average, there were 1.5 validations per CPM (range, 0-94). The median external validation area under the receiver operating characteristic curve was 0.73 (25th-75th percentile [interquartile range (IQR)], 0.66-0.79), representing a median percent decrease in discrimination of -11.1% (IQR, -32.4% to +2.7%) compared with performance on derivation data. 81% (n=1333) of validations reporting area under the receiver operating characteristic curve showed discrimination below that reported in the derivation dataset. 53% (n=983) of the validations report some measure of CPM calibration. For CPMs evaluated more than once, there was typically a large range of performance. Of 1702 validations classified by relatedness, the percent change in discrimination was -3.7% (IQR, -13.2 to 3.1) for closely related validations (n=123), -9.0 (IQR, -27.6 to 3.9) for related validations (n=862), and -17.2% (IQR, -42.3 to 0) for distantly related validations (n=717; P<0.001).

Conclusions: Many published cardiovascular CPMs have never been externally validated, and for those that have, apparent performance during development is often overly optimistic. A single external validation appears insufficient to broadly understand the performance heterogeneity across different settings.

Keywords: calibration; cardiovascular disease; decision making; literature review.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Review

MeSH terms

Cardiovascular Diseases* / diagnosis
Cardiovascular Diseases* / epidemiology
Cardiovascular Diseases* / therapy
Humans
ROC Curve

Abstract

Publication types

MeSH terms

Grants and funding