Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction: An externally validated study

Avishek Chatterjee; Guangyao Wu; Sergey Primakov; Cary Oberije; Henry Woodruff; Pieter Kubben; Ronald Henry; Marcel J. H. Aries; Martijn Beudel; Peter G. Noordzij; Tom Dormans; Niels C. Gritters van den Oever; Joop P. van den Bergh; Caroline E. Wyers; Suat Simsek; Renee Douma; Auke C. Reidinga; Martijn D. de Kruif; Julien Guiot; Anne-Noelle Frix; Renaud Louis; Michel Moutschen; Pierre Lovinfosse; Philippe Lambin

doi:10.1371/journal.pone.0249920

Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction: An externally validated study

Avishek Chatterjee^*, Guangyao Wu, Sergey Primakov, Cary Oberije, Henry Woodruff, Pieter Kubben, Ronald Henry, Marcel J. H. Aries, Martijn Beudel, Peter G. Noordzij, Tom Dormans, Niels C. Gritters van den Oever, Joop P. van den Bergh, Caroline E. Wyers, Suat Simsek, Renee Douma, Auke C. Reidinga, Martijn D. de Kruif, Julien Guiot, Anne-Noelle FrixRenaud Louis, Michel Moutschen, Pierre Lovinfosse, Philippe Lambin

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Objective

To establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.

Methods

The training cohort comprised 2337 COVID-19 inpatients from nine hospitals in The Netherlands. The clinical outcome was death within 21 days of being discharged. The features were derived from electronic health records collected during admission. Three feature selection methods were used: LASSO, univariate using a novel metric, and pairwise (age being half of each pair). 478 patients from Belgium were used to test the model. All modeling attempts were compared against an age-only model.

Results

In the training cohort, the mortality group's median age was 77 years (interquartile range = 70-83), higher than the non-mortality group (median = 65, IQR = 55-75). The incidence of former/active smokers, male gender, hypertension, diabetes, dementia, cancer, chronic obstructive pulmonary disease, chronic cardiac disease, chronic neurological disease, and chronic kidney disease was higher in the mortality group. All stated differences were statistically significant after Bonferroni correction. LASSO selected eight features, novel univariate chose five, and pairwise chose none. No model was able to surpass an age-only model in the external validation set, where age had an AUC of 0.85 and a balanced accuracy of 0.77.

Conclusion

When applied to an external validation set, we found that an age-only mortality model outperformed all modeling attempts (curated on ) using three feature selection methods on 22 demographic and comorbid features.

Original language	English
Article number	e0249920
Number of pages	15
Journal	PLOS ONE
Volume	16
Issue number	4
DOIs	https://doi.org/10.1371/journal.pone.0249920
Publication status	Published - 15 Apr 2021

Keywords

RISK
OBESITY

Access to Document

10.1371/journal.pone.0249920Licence: CC BY

Cite this

Chatterjee, A., Wu, G., Primakov, S., Oberije, C., Woodruff, H., Kubben, P., Henry, R., Aries, M. J. H., Beudel, M., Noordzij, P. G., Dormans, T., van den Oever, N. C. G., van den Bergh, J. P., Wyers, C. E., Simsek, S., Douma, R., Reidinga, A. C., de Kruif, M. D., Guiot, J., ... Lambin, P. (2021). Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction: An externally validated study. PLOS ONE, 16(4), Article e0249920. https://doi.org/10.1371/journal.pone.0249920

@article{3f81ef7f43d4405cb0821e0afc170e51,

title = "Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction: An externally validated study",

abstract = "ObjectiveTo establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.MethodsThe training cohort comprised 2337 COVID-19 inpatients from nine hospitals in The Netherlands. The clinical outcome was death within 21 days of being discharged. The features were derived from electronic health records collected during admission. Three feature selection methods were used: LASSO, univariate using a novel metric, and pairwise (age being half of each pair). 478 patients from Belgium were used to test the model. All modeling attempts were compared against an age-only model.ResultsIn the training cohort, the mortality group's median age was 77 years (interquartile range = 70-83), higher than the non-mortality group (median = 65, IQR = 55-75). The incidence of former/active smokers, male gender, hypertension, diabetes, dementia, cancer, chronic obstructive pulmonary disease, chronic cardiac disease, chronic neurological disease, and chronic kidney disease was higher in the mortality group. All stated differences were statistically significant after Bonferroni correction. LASSO selected eight features, novel univariate chose five, and pairwise chose none. No model was able to surpass an age-only model in the external validation set, where age had an AUC of 0.85 and a balanced accuracy of 0.77.ConclusionWhen applied to an external validation set, we found that an age-only mortality model outperformed all modeling attempts (curated on ) using three feature selection methods on 22 demographic and comorbid features.",

keywords = "RISK, OBESITY",

author = "Avishek Chatterjee and Guangyao Wu and Sergey Primakov and Cary Oberije and Henry Woodruff and Pieter Kubben and Ronald Henry and Aries, {Marcel J. H.} and Martijn Beudel and Noordzij, {Peter G.} and Tom Dormans and {van den Oever}, {Niels C. Gritters} and {van den Bergh}, {Joop P.} and Wyers, {Caroline E.} and Suat Simsek and Renee Douma and Reidinga, {Auke C.} and {de Kruif}, {Martijn D.} and Julien Guiot and Anne-Noelle Frix and Renaud Louis and Michel Moutschen and Pierre Lovinfosse and Philippe Lambin",

note = "Publisher Copyright: {\textcopyright} 2021 Chatterjee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.",

year = "2021",

month = apr,

day = "15",

doi = "10.1371/journal.pone.0249920",

language = "English",

volume = "16",

journal = "PLOS ONE",

issn = "1932-6203",

publisher = "Public Library of Science",

number = "4",

}

Chatterjee, A, Wu, G, Primakov, S, Oberije, C, Woodruff, H , Kubben, P , Henry, R , Aries, MJH, Beudel, M, Noordzij, PG, Dormans, T, van den Oever, NCG, van den Bergh, JP , Wyers, CE, Simsek, S, Douma, R, Reidinga, AC, de Kruif, MD, Guiot, J, Frix, A-N, Louis, R, Moutschen, M, Lovinfosse, P & Lambin, P 2021, 'Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction: An externally validated study', PLOS ONE, vol. 16, no. 4, e0249920. https://doi.org/10.1371/journal.pone.0249920

TY - JOUR

T1 - Can predicting COVID-19 mortality in a European cohort using only demographic and comorbidity data surpass age-based prediction

T2 - An externally validated study

AU - Chatterjee, Avishek

AU - Wu, Guangyao

AU - Primakov, Sergey

AU - Oberije, Cary

AU - Woodruff, Henry

AU - Kubben, Pieter

AU - Henry, Ronald

AU - Aries, Marcel J. H.

AU - Beudel, Martijn

AU - Noordzij, Peter G.

AU - Dormans, Tom

AU - van den Oever, Niels C. Gritters

AU - van den Bergh, Joop P.

AU - Wyers, Caroline E.

AU - Simsek, Suat

AU - Douma, Renee

AU - Reidinga, Auke C.

AU - de Kruif, Martijn D.

AU - Guiot, Julien

AU - Frix, Anne-Noelle

AU - Louis, Renaud

AU - Moutschen, Michel

AU - Lovinfosse, Pierre

AU - Lambin, Philippe

N1 - Publisher Copyright: © 2021 Chatterjee et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PY - 2021/4/15

Y1 - 2021/4/15

N2 - ObjectiveTo establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.MethodsThe training cohort comprised 2337 COVID-19 inpatients from nine hospitals in The Netherlands. The clinical outcome was death within 21 days of being discharged. The features were derived from electronic health records collected during admission. Three feature selection methods were used: LASSO, univariate using a novel metric, and pairwise (age being half of each pair). 478 patients from Belgium were used to test the model. All modeling attempts were compared against an age-only model.ResultsIn the training cohort, the mortality group's median age was 77 years (interquartile range = 70-83), higher than the non-mortality group (median = 65, IQR = 55-75). The incidence of former/active smokers, male gender, hypertension, diabetes, dementia, cancer, chronic obstructive pulmonary disease, chronic cardiac disease, chronic neurological disease, and chronic kidney disease was higher in the mortality group. All stated differences were statistically significant after Bonferroni correction. LASSO selected eight features, novel univariate chose five, and pairwise chose none. No model was able to surpass an age-only model in the external validation set, where age had an AUC of 0.85 and a balanced accuracy of 0.77.ConclusionWhen applied to an external validation set, we found that an age-only mortality model outperformed all modeling attempts (curated on ) using three feature selection methods on 22 demographic and comorbid features.

AB - ObjectiveTo establish whether one can build a mortality prediction model for COVID-19 patients based solely on demographics and comorbidity data that outperforms age alone. Such a model could be a precursor to implementing smart lockdowns and vaccine distribution strategies.MethodsThe training cohort comprised 2337 COVID-19 inpatients from nine hospitals in The Netherlands. The clinical outcome was death within 21 days of being discharged. The features were derived from electronic health records collected during admission. Three feature selection methods were used: LASSO, univariate using a novel metric, and pairwise (age being half of each pair). 478 patients from Belgium were used to test the model. All modeling attempts were compared against an age-only model.ResultsIn the training cohort, the mortality group's median age was 77 years (interquartile range = 70-83), higher than the non-mortality group (median = 65, IQR = 55-75). The incidence of former/active smokers, male gender, hypertension, diabetes, dementia, cancer, chronic obstructive pulmonary disease, chronic cardiac disease, chronic neurological disease, and chronic kidney disease was higher in the mortality group. All stated differences were statistically significant after Bonferroni correction. LASSO selected eight features, novel univariate chose five, and pairwise chose none. No model was able to surpass an age-only model in the external validation set, where age had an AUC of 0.85 and a balanced accuracy of 0.77.ConclusionWhen applied to an external validation set, we found that an age-only mortality model outperformed all modeling attempts (curated on ) using three feature selection methods on 22 demographic and comorbid features.

KW - RISK

KW - OBESITY

U2 - 10.1371/journal.pone.0249920

DO - 10.1371/journal.pone.0249920

M3 - Article

C2 - 33857224

SN - 1932-6203

VL - 16

JO - PLOS ONE

JF - PLOS ONE

IS - 4

M1 - e0249920

ER -