Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury

Benjamin Y. Gravesteijn; Daan Nieboer; Ari Ercole; Hester F. Lingsma; David Nelson; Ben van Calster; Ewout W. Steyerberg; CENTER-TBI Collaborators; Caroline M. van Heugten

doi:10.1016/j.jclinepi.2020.03.005

Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury

Benjamin Y. Gravesteijn^*, Daan Nieboer, Ari Ercole, Hester F. Lingsma, David Nelson, Ben van Calster, Ewout W. Steyerberg, CENTER-TBI Collaborators, Caroline M. van Heugten

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Objective: We aimed to explore the added value of common machine learning (ML) algorithms for prediction of outcome for moderate and severe traumatic brain injury.

Study Design and Setting: We performed logistic regression (LR), lasso regression, and ridge regression with key baseline predictors in the IMPACT-II database (15 studies, n = 11,022). ML algorithms included support vector machines, random forests, gradient boosting machines, and artificial neural networks and were trained using the same predictors. To assess generalizability of predictions, we performed internal, internal-external, and external validation on the recent CENTER-TBI study (patients with Glasgow Coma Scale

Results: In the IMPACT-II database, 3,332/11,022 (30%) died and 5,233(48%) had unfavorable outcome (Glasgow Outcome Scale less than 4). In the CENTER-TBI study, 348/1,554(29%) died and 651(54%) had unfavorable outcome. Discrimination and calibration varied widely between the studies and less so between the studied algorithms. The mean area under the curve was 0.82 for mortality and 0.77 for unfavorable outcomes in the CENTER-TBI study.

Conclusion: ML algorithms may not outperform traditional regression approaches in a low-dimensional setting for outcome prediction after moderate or severe traumatic brain injury. Similar to regression-based prediction models, ML algorithms should be rigorously validated to ensure applicability to new populations. (C) 2020 The Authors. Published by Elsevier Inc.

Original language	English
Pages (from-to)	95-107
Number of pages	13
Journal	Journal of Clinical Epidemiology
Volume	122
DOIs	https://doi.org/10.1016/j.jclinepi.2020.03.005
Publication status	Published - Jun 2020

Keywords

Machine learning
Prognosis
Traumatic brain injury
Prediction
Data science
Cohort study
BIG DATA
VALIDATION
PROGNOSIS
TUTORIAL
CARE

Access to Document

10.1016/j.jclinepi.2020.03.005Licence: CC BY

Cite this

Gravesteijn, B. Y., Nieboer, D., Ercole, A., Lingsma, H. F., Nelson, D., van Calster, B., Steyerberg, E. W., CENTER-TBI Collaborators, & van Heugten, C. M. (2020). Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury. Journal of Clinical Epidemiology, 122, 95-107. https://doi.org/10.1016/j.jclinepi.2020.03.005

@article{40becb8d7dbd4277a139d9cb75ed53b6,

title = "Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury",

abstract = "Objective: We aimed to explore the added value of common machine learning (ML) algorithms for prediction of outcome for moderate and severe traumatic brain injury.Study Design and Setting: We performed logistic regression (LR), lasso regression, and ridge regression with key baseline predictors in the IMPACT-II database (15 studies, n = 11,022). ML algorithms included support vector machines, random forests, gradient boosting machines, and artificial neural networks and were trained using the same predictors. To assess generalizability of predictions, we performed internal, internal-external, and external validation on the recent CENTER-TBI study (patients with Glasgow Coma ScaleResults: In the IMPACT-II database, 3,332/11,022 (30%) died and 5,233(48%) had unfavorable outcome (Glasgow Outcome Scale less than 4). In the CENTER-TBI study, 348/1,554(29%) died and 651(54%) had unfavorable outcome. Discrimination and calibration varied widely between the studies and less so between the studied algorithms. The mean area under the curve was 0.82 for mortality and 0.77 for unfavorable outcomes in the CENTER-TBI study.Conclusion: ML algorithms may not outperform traditional regression approaches in a low-dimensional setting for outcome prediction after moderate or severe traumatic brain injury. Similar to regression-based prediction models, ML algorithms should be rigorously validated to ensure applicability to new populations. (C) 2020 The Authors. Published by Elsevier Inc.",

keywords = "Machine learning, Prognosis, Traumatic brain injury, Prediction, Data science, Cohort study, BIG DATA, VALIDATION, PROGNOSIS, TUTORIAL, CARE",

author = "Gravesteijn, {Benjamin Y.} and Daan Nieboer and Ari Ercole and Lingsma, {Hester F.} and David Nelson and {van Calster}, Ben and Steyerberg, {Ewout W.} and {CENTER-TBI Collaborators} and {van Heugten}, {Caroline M.}",

note = "Publisher Copyright: {\textcopyright} 2020 The Authors",

year = "2020",

month = jun,

doi = "10.1016/j.jclinepi.2020.03.005",

language = "English",

volume = "122",

pages = "95--107",

journal = "Journal of Clinical Epidemiology",

issn = "0895-4356",

publisher = "Elsevier Science",

}

TY - JOUR

T1 - Machine learning algorithms performed no better than regression models for prognostication in traumatic brain injury

AU - Gravesteijn, Benjamin Y.

AU - Nieboer, Daan

AU - Ercole, Ari

AU - Lingsma, Hester F.

AU - Nelson, David

AU - van Calster, Ben

AU - Steyerberg, Ewout W.

AU - CENTER-TBI Collaborators

AU - van Heugten, Caroline M.

PY - 2020/6

Y1 - 2020/6

N2 - Objective: We aimed to explore the added value of common machine learning (ML) algorithms for prediction of outcome for moderate and severe traumatic brain injury.Study Design and Setting: We performed logistic regression (LR), lasso regression, and ridge regression with key baseline predictors in the IMPACT-II database (15 studies, n = 11,022). ML algorithms included support vector machines, random forests, gradient boosting machines, and artificial neural networks and were trained using the same predictors. To assess generalizability of predictions, we performed internal, internal-external, and external validation on the recent CENTER-TBI study (patients with Glasgow Coma ScaleResults: In the IMPACT-II database, 3,332/11,022 (30%) died and 5,233(48%) had unfavorable outcome (Glasgow Outcome Scale less than 4). In the CENTER-TBI study, 348/1,554(29%) died and 651(54%) had unfavorable outcome. Discrimination and calibration varied widely between the studies and less so between the studied algorithms. The mean area under the curve was 0.82 for mortality and 0.77 for unfavorable outcomes in the CENTER-TBI study.Conclusion: ML algorithms may not outperform traditional regression approaches in a low-dimensional setting for outcome prediction after moderate or severe traumatic brain injury. Similar to regression-based prediction models, ML algorithms should be rigorously validated to ensure applicability to new populations. (C) 2020 The Authors. Published by Elsevier Inc.

AB - Objective: We aimed to explore the added value of common machine learning (ML) algorithms for prediction of outcome for moderate and severe traumatic brain injury.Study Design and Setting: We performed logistic regression (LR), lasso regression, and ridge regression with key baseline predictors in the IMPACT-II database (15 studies, n = 11,022). ML algorithms included support vector machines, random forests, gradient boosting machines, and artificial neural networks and were trained using the same predictors. To assess generalizability of predictions, we performed internal, internal-external, and external validation on the recent CENTER-TBI study (patients with Glasgow Coma ScaleResults: In the IMPACT-II database, 3,332/11,022 (30%) died and 5,233(48%) had unfavorable outcome (Glasgow Outcome Scale less than 4). In the CENTER-TBI study, 348/1,554(29%) died and 651(54%) had unfavorable outcome. Discrimination and calibration varied widely between the studies and less so between the studied algorithms. The mean area under the curve was 0.82 for mortality and 0.77 for unfavorable outcomes in the CENTER-TBI study.Conclusion: ML algorithms may not outperform traditional regression approaches in a low-dimensional setting for outcome prediction after moderate or severe traumatic brain injury. Similar to regression-based prediction models, ML algorithms should be rigorously validated to ensure applicability to new populations. (C) 2020 The Authors. Published by Elsevier Inc.

KW - Machine learning

KW - Prognosis

KW - Traumatic brain injury

KW - Prediction

KW - Data science

KW - Cohort study

KW - BIG DATA

KW - VALIDATION

KW - PROGNOSIS

KW - TUTORIAL

KW - CARE

U2 - 10.1016/j.jclinepi.2020.03.005

DO - 10.1016/j.jclinepi.2020.03.005

M3 - Article

SN - 0895-4356

VL - 122

SP - 95

EP - 107

JO - Journal of Clinical Epidemiology

JF - Journal of Clinical Epidemiology

ER -