TY - JOUR
T1 - At-admission prediction of mortality and pulmonary embolism in an international cohort of hospitalised patients with COVID-19 using statistical and machine learning methods
AU - Mesinovic, Munib
AU - Wong, Xin Ci
AU - Rajahram, Giri Shan
AU - Citarella, Barbara Wanjiru
AU - Peariasamy, Kalaiarasu M.
AU - van Someren Greve, Frank
AU - Olliaro, Piero
AU - Merson, Laura
AU - Clifton, Lei
AU - Kartsonaki, Christiana
AU - Abdukahil, Sheryl Ann
AU - Abdulkadir, Nurul Najmee
AU - Abe, Ryuzo
AU - Abel, Laurent
AU - Abrous, Amal
AU - Absil, Lara
AU - Acker, Andrew
AU - Adachi, Shingo
AU - Adam, Elisabeth
AU - Adriano, Enrico
AU - Adrião, Diana
AU - Ageel, Saleh Al
AU - Ahmed, Shakeel
AU - Aiello, Marina
AU - Ainscough, Kate
AU - Airlangga, Eka
AU - Aisa, Tharwat
AU - Hssain, Ali Ait
AU - Tamlihat, Younes Ait
AU - Akimoto, Takako
AU - Akmal, Ernita
AU - Qasim, Eman Al
AU - Alalqam, Razi
AU - Alberti, Angela
AU - Al-dabbous, Tala
AU - Alegesan, Senthilkumar
AU - Alegre, Cynthia
AU - Alessi, Marta
AU - Alex, Beatrice
AU - Alexandre, Kévin
AU - Al-Fares, Abdulrahman
AU - Alfoudri, Huda
AU - Ali, Adam
AU - Ali, Imran
AU - Alidjnou, Kazali Enagnon
AU - Aliudin, Jeffrey
AU - Alkhafajee, Qabas
AU - Allavena, Clotilde
AU - Allou, Nathalie
AU - Alves, João
AU - ISARIC Characterisation Group
AU - Mazankowski Heart Institute
AU - de Piero, Maria E.
N1 - Funding Information:
This work was made possible by the UK Foreign, Commonwealth and Development Office and Wellcome [215091/Z/18/Z, 222410/Z/21/Z, 225288/Z/22/Z and 220757/Z/20/Z]; the Bill & Melinda Gates Foundation [OPP1209135]; the philanthropic support of the donors to the University of Oxford\u2019s COVID-19 Research Response Fund (0009109); CIHR Coronavirus Rapid Research Funding Opportunity OV2170359 and the coordination in Canada by Sunnybrook Research Institute; endorsement of the Irish Critical Care-Clinical Trials Group, co-ordination in Ireland by the Irish Critical Care-Clinical Trials Network at University College Dublin and funding by the Health Research Board of Ireland [CTN-2014-12]; the Rapid European COVID-19 Emergency Response research (RECOVER) [H2020 project 101003589] and European Clinical Research Alliance on Infectious Diseases (ECRAID) [965313]; the COVID clinical management team, AIIMS, Rishikesh, India; the COVID-19 Clinical Management team, Manipal Hospital Whitefield, Bengaluru, India; Cambridge NIHR Biomedical Research Centre; the dedication and hard work of the Groote Schuur Hospital Covid ICU Team and supported by the Groote Schuur nursing and University of Cape Town registrar bodies coordinated by the Division of Critical Care at the University of Cape Town; the Liverpool School of Tropical Medicine and the University of Oxford; the dedication and hard work of the Norwegian SARS-CoV-2 study team and the Research Council of Norway grant no 312780, and a philanthropic donation from Vivaldi Invest A/S owned by Jon Stephenson von Tetzchner; Imperial NIHR Biomedical Research Centre; the Comprehensive Local Research Networks (CLRNs) of which PJMO is an NIHR Senior Investigator (NIHR201385); Innovative Medicines Initiative Joint Undertaking under Grant Agreement No. 115523 COMBACTE, resources of which are composed of financial contribution from the European Union\u2019s Seventh Framework Programme (FP7/2007- 2013) and EFPIA companies, in-kind contribution; Stiftungsfonds zur F\u00F6rderung der Bek\u00E4mpfung der Tuberkulose und anderer Lungenkrankheiten of the City of Vienna, Project Number: APCOV22BGM; Italian Ministry of Health \u201CFondi Ricerca corrente-L1P6\u201D to IRCCS Ospedale Sacro Cuore-Don Calabria; Australian Department of Health grant (3273191); Gender Equity Strategic Fund at University of Queensland, Artificial Intelligence for Pandemics (A14PAN) at University of Queensland, the Australian Research Council Centre of Excellence for Engineered Quantum Systems (EQUS, CE170100009), the Prince Charles Hospital Foundation, Australia; grants from Instituto de Salud Carlos III, Ministerio de Ciencia, Spain; Brazil, National Council for Scientific and Technological Development Scholarship number 303953/2018-7; the Firland Foundation, Shoreline, Washington, USA; the French COVID cohort (NCT04262921) is sponsored by INSERM and is funded by the REACTing (REsearch & ACtion emergING infectious diseases) consortium and by a grant of the French Ministry of Health (PHRC n20-0424); a grant from foundation Bevordering Onderzoek Franciscus; the South Eastern Norway Health Authority and the Research Council of Norway; Institute for Clinical Research (ICR), National Institutes of Health (NIH) supported by the Ministry of Health Malaysia; preparedness work conducted by the Short Period Incidence Study of Severe Acute Respiratory Infection; the U.S. DoD Armed Forces Health Surveillance Division, Global Emerging Infectious Diseases Branch to the U.S Naval Medical Research Unit No. TWO (NAMRU-2) (Work Unit #: P0153_21_N2). These authors would like to thank Vysnova Partners, Inc. for the management of this research project. The Lao-Oxford-Mahosot Hospital-Wellcome Trust Research Unit is funded by the Wellcome Trust.
Funding Information:
M. Mesinovic appreciates the support of the EPSRC Center for Doctoral Training in Health Data Science (EP/S02428X/1) and the Rhodes Trust.
Funding Information:
This work uses data provided by patients and collected by the NHS as part of their care and support #DataSavesLives. The data used for this research were obtained from ISARIC4C. We are extremely grateful to the 2648 frontline NHS clinical and research staff and volunteer medical students who collected these data in challenging circumstances; and the generosity of the patients and their families for their individual contributions in these difficult times. The COVID-19 Clinical Information Network (CO-CIN) data was collated by ISARIC4C Investigators. Data and Material provision was supported by grants from: the National Institute for Health Research (NIHR; award CO-CIN-01), the Medical Research Council (MRC; grant MC_PC_19059), and by the NIHR Health Protection Research Unit (HPRU) in Emerging and Zoonotic Infections at University of Liverpool in partnership with Public Health England (PHE), (award 200907), NIHR HPRU in Respiratory Infections at Imperial College London with PHE (award 200927), Liverpool Experimental Cancer Medicine Centre (grant C18616/A25153), NIHR Biomedical Research Centre at Imperial College London (award ISBRC-1215-20013), and NIHR Clinical Research Network providing infrastructure support. We also acknowledge the support of Jeremy J Farrar and Nahoko Shindo.
Funding Information:
GR declares receiving a grant from United States National Institute of Health, R01 Grant: Emerging Zoonotic Malaria in Malaysia: Strenghtening Surveillance and Evaluating Population Genetics Structure to Improve Regional Risk Prediction Tool and travel support from the European Society of Clinical Microbiology and Infectious Disease (ESCMID) for observership at European Centre for Disease Prevention and Control (ECDC). All authors declare no competing interests.
Publisher Copyright:
© The Author(s) 2024.
PY - 2024/12/1
Y1 - 2024/12/1
N2 - By September 2022, more than 600 million cases of SARS-CoV-2 infection have been reported globally, resulting in over 6.5 million deaths. COVID-19 mortality risk estimators are often, however, developed with small unrepresentative samples and with methodological limitations. It is highly important to develop predictive tools for pulmonary embolism (PE) in COVID-19 patients as one of the most severe preventable complications of COVID-19. Early recognition can help provide life-saving targeted anti-coagulation therapy right at admission. Using a dataset of more than 800,000 COVID-19 patients from an international cohort, we propose a cost-sensitive gradient-boosted machine learning model that predicts occurrence of PE and death at admission. Logistic regression, Cox proportional hazards models, and Shapley values were used to identify key predictors for PE and death. Our prediction model had a test AUROC of 75.9% and 74.2%, and sensitivities of 67.5% and 72.7% for PE and all-cause mortality respectively on a highly diverse and held-out test set. The PE prediction model was also evaluated on patients in UK and Spain separately with test results of 74.5% AUROC, 63.5% sensitivity and 78.9% AUROC, 95.7% sensitivity. Age, sex, region of admission, comorbidities (chronic cardiac and pulmonary disease, dementia, diabetes, hypertension, cancer, obesity, smoking), and symptoms (any, confusion, chest pain, fatigue, headache, fever, muscle or joint pain, shortness of breath) were the most important clinical predictors at admission. Age, overall presence of symptoms, shortness of breath, and hypertension were found to be key predictors for PE using our extreme gradient boosted model. This analysis based on the, until now, largest global dataset for this set of problems can inform hospital prioritisation policy and guide long term clinical research and decision-making for COVID-19 patients globally. Our machine learning model developed from an international cohort can serve to better regulate hospital risk prioritisation of at-risk patients.
AB - By September 2022, more than 600 million cases of SARS-CoV-2 infection have been reported globally, resulting in over 6.5 million deaths. COVID-19 mortality risk estimators are often, however, developed with small unrepresentative samples and with methodological limitations. It is highly important to develop predictive tools for pulmonary embolism (PE) in COVID-19 patients as one of the most severe preventable complications of COVID-19. Early recognition can help provide life-saving targeted anti-coagulation therapy right at admission. Using a dataset of more than 800,000 COVID-19 patients from an international cohort, we propose a cost-sensitive gradient-boosted machine learning model that predicts occurrence of PE and death at admission. Logistic regression, Cox proportional hazards models, and Shapley values were used to identify key predictors for PE and death. Our prediction model had a test AUROC of 75.9% and 74.2%, and sensitivities of 67.5% and 72.7% for PE and all-cause mortality respectively on a highly diverse and held-out test set. The PE prediction model was also evaluated on patients in UK and Spain separately with test results of 74.5% AUROC, 63.5% sensitivity and 78.9% AUROC, 95.7% sensitivity. Age, sex, region of admission, comorbidities (chronic cardiac and pulmonary disease, dementia, diabetes, hypertension, cancer, obesity, smoking), and symptoms (any, confusion, chest pain, fatigue, headache, fever, muscle or joint pain, shortness of breath) were the most important clinical predictors at admission. Age, overall presence of symptoms, shortness of breath, and hypertension were found to be key predictors for PE using our extreme gradient boosted model. This analysis based on the, until now, largest global dataset for this set of problems can inform hospital prioritisation policy and guide long term clinical research and decision-making for COVID-19 patients globally. Our machine learning model developed from an international cohort can serve to better regulate hospital risk prioritisation of at-risk patients.
U2 - 10.1038/s41598-024-63212-7
DO - 10.1038/s41598-024-63212-7
M3 - Article
SN - 2045-2322
VL - 14
JO - Scientific Reports
JF - Scientific Reports
IS - 1
M1 - 16387
ER -