TY - JOUR
T1 - Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction
T2 - An externally validated study
AU - Chatterjee, Avishek
AU - Wu, Guangyao
AU - Primakov, Sergey
AU - Oberije, Cary
AU - Woodruff, Henry
AU - Kubben, Pieter
AU - Henry, Ronald
AU - Aries, Marcel J. H.
AU - Beudel, Martijn
AU - Noordzij, Peter G.
AU - Dormans, Tom
AU - van den Oever, Niels C. Gritters
AU - van den Bergh, Joop P.
AU - Wyers, Caroline E.
AU - Simsek, Suat
AU - Douma, Renee
AU - Reidinga, Auke C.
AU - de Kruif, Martijn D.
AU - Guiot, Julien
AU - Frix, Anne-Noelle
AU - Louis, Renaud
AU - Moutschen, Michel
AU - Lovinfosse, Pierre
AU - Lambin, Philippe
N1 - Publisher Copyright:
© 2021 Chatterjee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
PY - 2021/4/15
Y1 - 2021/4/15
N2 - ObjectiveTo establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.MethodsThe training cohort comprised 2337 COVID-19 inpatients from nine hospitals in The Netherlands. The clinical outcome was death within 21 days of being discharged. The features were derived from electronic health records collected during admission. Three feature selection methods were used: LASSO, univariate using a novel metric, and pairwise (age being half of each pair). 478 patients from Belgium were used to test the model. All modeling attempts were compared against an age-only model.ResultsIn the training cohort, the mortality group's median age was 77 years (interquartile range = 70-83), higher than the non-mortality group (median = 65, IQR = 55-75). The incidence of former/active smokers, male gender, hypertension, diabetes, dementia, cancer, chronic obstructive pulmonary disease, chronic cardiac disease, chronic neurological disease, and chronic kidney disease was higher in the mortality group. All stated differences were statistically significant after Bonferroni correction. LASSO selected eight features, novel univariate chose five, and pairwise chose none. No model was able to surpass an age-only model in the external validation set, where age had an AUC of 0.85 and a balanced accuracy of 0.77.ConclusionWhen applied to an external validation set, we found that an age-only mortality model outperformed all modeling attempts (curated on ) using three feature selection methods on 22 demographic and comorbid features.
AB - ObjectiveTo establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.MethodsThe training cohort comprised 2337 COVID-19 inpatients from nine hospitals in The Netherlands. The clinical outcome was death within 21 days of being discharged. The features were derived from electronic health records collected during admission. Three feature selection methods were used: LASSO, univariate using a novel metric, and pairwise (age being half of each pair). 478 patients from Belgium were used to test the model. All modeling attempts were compared against an age-only model.ResultsIn the training cohort, the mortality group's median age was 77 years (interquartile range = 70-83), higher than the non-mortality group (median = 65, IQR = 55-75). The incidence of former/active smokers, male gender, hypertension, diabetes, dementia, cancer, chronic obstructive pulmonary disease, chronic cardiac disease, chronic neurological disease, and chronic kidney disease was higher in the mortality group. All stated differences were statistically significant after Bonferroni correction. LASSO selected eight features, novel univariate chose five, and pairwise chose none. No model was able to surpass an age-only model in the external validation set, where age had an AUC of 0.85 and a balanced accuracy of 0.77.ConclusionWhen applied to an external validation set, we found that an age-only mortality model outperformed all modeling attempts (curated on ) using three feature selection methods on 22 demographic and comorbid features.
KW - RISK
KW - OBESITY
U2 - 10.1371/journal.pone.0249920
DO - 10.1371/journal.pone.0249920
M3 - Article
C2 - 33857224
SN - 1932-6203
VL - 16
JO - PLOS ONE
JF - PLOS ONE
IS - 4
M1 - e0249920
ER -