TY - JOUR
T1 - A comparison of modeling approaches for static and dynamic prediction of central-line bloodstream infections using electronic health records (part 1)
T2 - regression models
AU - Gao, Shan
AU - Albu, Elena
AU - Putter, Hein
AU - Stijnen, Pieter
AU - Rademakers, Frank E.
AU - Cossey, Veerle
AU - Debaveye, Yves
AU - Janssens, Christel
AU - Van Calster, Ben
AU - Wynants, Laure
PY - 2025/7/21
Y1 - 2025/7/21
N2 - BackgroundHospitals register information in the electronic health records (EHRs) continuously until discharge or death. As such, there is no censoring for in-hospital outcomes. We aimed to compare different static and dynamic regression modeling approaches to predict central line-associated bloodstream infections (CLABSIs) in EHR while accounting for competing events precluding CLABSI.MethodsWe analyzed data from 30,862 catheter episodes at University Hospitals Leuven from 2012 and 2013 to predict 7-day risk of CLABSI. Competing events are discharge and death. Static models using information at catheter onset included logistic, multinomial logistic, Cox, cause-specific hazard, and Fine-Gray regression. Dynamic models updated predictions daily up to 30 days after catheter onset (i.e., landmarks 0 to 30 days) and included landmark supermodel extensions of the static models, separate Fine-Gray models per landmark time, and regularized multi-task learning (RMTL). Model performance was assessed using 100 random 2:1 train-test splits.ResultsThe Cox model performed worst of all static models in terms of area under the receiver operating characteristic curve (AUROC) and calibration. Dynamic landmark supermodels reached peak AUROCs between 0.741 and 0.747 at landmark 5. The Cox landmark supermodel had the worst AUROCs (<= 0.731) and calibration up to landmark 7. Separate Fine-Gray models per landmark performed worst for later landmarks, when the number of patients at risk was low.ConclusionsCategorical and time-to-event approaches had similar performance in the static and dynamic settings, except Cox models. Ignoring competing risks caused problems for risk prediction in the time-to-event framework (Cox), but not in the categorical framework (logistic regression).
AB - BackgroundHospitals register information in the electronic health records (EHRs) continuously until discharge or death. As such, there is no censoring for in-hospital outcomes. We aimed to compare different static and dynamic regression modeling approaches to predict central line-associated bloodstream infections (CLABSIs) in EHR while accounting for competing events precluding CLABSI.MethodsWe analyzed data from 30,862 catheter episodes at University Hospitals Leuven from 2012 and 2013 to predict 7-day risk of CLABSI. Competing events are discharge and death. Static models using information at catheter onset included logistic, multinomial logistic, Cox, cause-specific hazard, and Fine-Gray regression. Dynamic models updated predictions daily up to 30 days after catheter onset (i.e., landmarks 0 to 30 days) and included landmark supermodel extensions of the static models, separate Fine-Gray models per landmark time, and regularized multi-task learning (RMTL). Model performance was assessed using 100 random 2:1 train-test splits.ResultsThe Cox model performed worst of all static models in terms of area under the receiver operating characteristic curve (AUROC) and calibration. Dynamic landmark supermodels reached peak AUROCs between 0.741 and 0.747 at landmark 5. The Cox landmark supermodel had the worst AUROCs (<= 0.731) and calibration up to landmark 7. Separate Fine-Gray models per landmark performed worst for later landmarks, when the number of patients at risk was low.ConclusionsCategorical and time-to-event approaches had similar performance in the static and dynamic settings, except Cox models. Ignoring competing risks caused problems for risk prediction in the time-to-event framework (Cox), but not in the categorical framework (logistic regression).
KW - Risk prediction
KW - Central line-associated bloodstream infection
KW - Dynamic model
KW - Logistic regression
KW - Survival analysis
KW - CARE-ASSOCIATED INFECTIONS
KW - COX REGRESSION
KW - RISK
KW - LANDMARKING
KW - IMPACT
U2 - 10.1186/s41512-025-00199-3
DO - 10.1186/s41512-025-00199-3
M3 - Article
SN - 2397-7523
VL - 9
JO - Diagnostic and prognostic research
JF - Diagnostic and prognostic research
IS - 1
M1 - 20
ER -