A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer

H. Hasannejadasl; B. Osong; I. Bermejo; H. van der Poel; B. Vanneste; J. van Roermund; K. Aben; Z. Zhang; L. Kiemeney; I. Van Oort; R. Verwey; L. Hochstenbach; E. Bloemen; A. Dekker; R.R.R. Fijten

doi:10.3389/fonc.2023.1168219

A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer

H. Hasannejadasl, B. Osong, I. Bermejo, H. van der Poel, B. Vanneste, J. van Roermund, K. Aben, Z. Zhang, L. Kiemeney, I. Van Oort, R. Verwey, L. Hochstenbach, E. Bloemen, A. Dekker, R.R.R. Fijten^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

IntroductionUrinary incontinence (UI) is a common side effect of prostate cancer treatment, but in clinical practice, it is difficult to predict. Machine learning (ML) models have shown promising results in predicting outcomes, yet the lack of transparency in complex models known as "black-box" has made clinicians wary of relying on them in sensitive decisions. Therefore, finding a balance between accuracy and explainability is crucial for the implementation of ML models. The aim of this study was to employ three different ML classifiers to predict the probability of experiencing UI in men with localized prostate cancer 1-year and 2-year after treatment and compare their accuracy and explainability. MethodsWe used the ProZIB dataset from the Netherlands Comprehensive Cancer Organization (Integraal Kankercentrum Nederland; IKNL) which contained clinical, demographic, and PROM data of 964 patients from 65 Dutch hospitals. Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms were applied to predict (in)continence after prostate cancer treatment. ResultsAll models have been externally validated according to the TRIPOD Type 3 guidelines and their performance was assessed by accuracy, sensitivity, specificity, and AUC. While all three models demonstrated similar performance, LR showed slightly better accuracy than RF and SVM in predicting the risk of UI one year after prostate cancer treatment, achieving an accuracy of 0.75, a sensitivity of 0.82, and an AUC of 0.79. All models for the 2-year outcome performed poorly in the validation set, with an accuracy of 0.6 for LR, 0.65 for RF, and 0.54 for SVM. ConclusionThe outcomes of our study demonstrate the promise of using non-black box models, such as LR, to assist clinicians in recognizing high-risk patients and making informed treatment choices. The coefficients of the LR model show the importance of each feature in predicting results, and the generated nomogram provides an accessible illustration of how each feature impacts the predicted outcome. Additionally, the model's simplicity and interpretability make it a more appropriate option in scenarios where comprehending the model's predictions is essential.

Original language	English
Article number	1168219
Number of pages	9
Journal	Frontiers in Oncology
Volume	13
Issue number	1
DOIs	https://doi.org/10.3389/fonc.2023.1168219
Publication status	Published - 12 Apr 2023

Keywords

prostate cancer
personalized medicine
machine learning (ML)
PROMs = patient-reported outcome measures
urinary in continence
prediction modeling
shared decision making
QUALITY-OF-LIFE
RADIATION
OUTCOMES

Access to Document

10.3389/fonc.2023.1168219Licence: CC BY

Cite this

Hasannejadasl, H., Osong, B., Bermejo, I., van der Poel, H., Vanneste, B., van Roermund, J., Aben, K., Zhang, Z., Kiemeney, L., Van Oort, I., Verwey, R., Hochstenbach, L., Bloemen, E., Dekker, A., & Fijten, R. R. R. (2023). A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer. Frontiers in Oncology, 13(1), Article 1168219. https://doi.org/10.3389/fonc.2023.1168219

@article{3e12db5a35ea4d98b18e06fd7ccea1f6,

title = "A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer",

abstract = "IntroductionUrinary incontinence (UI) is a common side effect of prostate cancer treatment, but in clinical practice, it is difficult to predict. Machine learning (ML) models have shown promising results in predicting outcomes, yet the lack of transparency in complex models known as {"}black-box{"} has made clinicians wary of relying on them in sensitive decisions. Therefore, finding a balance between accuracy and explainability is crucial for the implementation of ML models. The aim of this study was to employ three different ML classifiers to predict the probability of experiencing UI in men with localized prostate cancer 1-year and 2-year after treatment and compare their accuracy and explainability. MethodsWe used the ProZIB dataset from the Netherlands Comprehensive Cancer Organization (Integraal Kankercentrum Nederland; IKNL) which contained clinical, demographic, and PROM data of 964 patients from 65 Dutch hospitals. Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms were applied to predict (in)continence after prostate cancer treatment. ResultsAll models have been externally validated according to the TRIPOD Type 3 guidelines and their performance was assessed by accuracy, sensitivity, specificity, and AUC. While all three models demonstrated similar performance, LR showed slightly better accuracy than RF and SVM in predicting the risk of UI one year after prostate cancer treatment, achieving an accuracy of 0.75, a sensitivity of 0.82, and an AUC of 0.79. All models for the 2-year outcome performed poorly in the validation set, with an accuracy of 0.6 for LR, 0.65 for RF, and 0.54 for SVM. ConclusionThe outcomes of our study demonstrate the promise of using non-black box models, such as LR, to assist clinicians in recognizing high-risk patients and making informed treatment choices. The coefficients of the LR model show the importance of each feature in predicting results, and the generated nomogram provides an accessible illustration of how each feature impacts the predicted outcome. Additionally, the model's simplicity and interpretability make it a more appropriate option in scenarios where comprehending the model's predictions is essential.",

keywords = "prostate cancer, personalized medicine, machine learning (ML), PROMs = patient-reported outcome measures, urinary in continence, prediction modeling, shared decision making, QUALITY-OF-LIFE, RADIATION, OUTCOMES",

author = "H. Hasannejadasl and B. Osong and I. Bermejo and {van der Poel}, H. and B. Vanneste and {van Roermund}, J. and K. Aben and Z. Zhang and L. Kiemeney and {Van Oort}, I. and R. Verwey and L. Hochstenbach and E. Bloemen and A. Dekker and R.R.R. Fijten",

note = "Copyright {\textcopyright} 2023 Hasannejadasl, Osong, Bermejo, van der Poel, Vanneste, van Roermund, Aben, Zhang, Kiemeney, Van Oort, Verwey, Hochstenbach, Bloemen, Dekker and Fijten.",

year = "2023",

month = apr,

day = "12",

doi = "10.3389/fonc.2023.1168219",

language = "English",

volume = "13",

journal = "Frontiers in Oncology",

issn = "2234-943X",

publisher = "Frontiers Media S.A.",

number = "1",

}

Hasannejadasl, H , Osong, B , Bermejo, I, van der Poel, H, Vanneste, B, van Roermund, J, Aben, K, Zhang, Z, Kiemeney, L, Van Oort, I, Verwey, R, Hochstenbach, L, Bloemen, E, Dekker, A & Fijten, RRR 2023, 'A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer', Frontiers in Oncology, vol. 13, no. 1, 1168219. https://doi.org/10.3389/fonc.2023.1168219

TY - JOUR

T1 - A comparison of machine learning models for predicting urinary incontinence in men with localized prostate cancer

AU - Hasannejadasl, H.

AU - Osong, B.

AU - Bermejo, I.

AU - van der Poel, H.

AU - Vanneste, B.

AU - van Roermund, J.

AU - Aben, K.

AU - Zhang, Z.

AU - Kiemeney, L.

AU - Van Oort, I.

AU - Verwey, R.

AU - Hochstenbach, L.

AU - Bloemen, E.

AU - Dekker, A.

AU - Fijten, R.R.R.

PY - 2023/4/12

Y1 - 2023/4/12

N2 - IntroductionUrinary incontinence (UI) is a common side effect of prostate cancer treatment, but in clinical practice, it is difficult to predict. Machine learning (ML) models have shown promising results in predicting outcomes, yet the lack of transparency in complex models known as "black-box" has made clinicians wary of relying on them in sensitive decisions. Therefore, finding a balance between accuracy and explainability is crucial for the implementation of ML models. The aim of this study was to employ three different ML classifiers to predict the probability of experiencing UI in men with localized prostate cancer 1-year and 2-year after treatment and compare their accuracy and explainability. MethodsWe used the ProZIB dataset from the Netherlands Comprehensive Cancer Organization (Integraal Kankercentrum Nederland; IKNL) which contained clinical, demographic, and PROM data of 964 patients from 65 Dutch hospitals. Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms were applied to predict (in)continence after prostate cancer treatment. ResultsAll models have been externally validated according to the TRIPOD Type 3 guidelines and their performance was assessed by accuracy, sensitivity, specificity, and AUC. While all three models demonstrated similar performance, LR showed slightly better accuracy than RF and SVM in predicting the risk of UI one year after prostate cancer treatment, achieving an accuracy of 0.75, a sensitivity of 0.82, and an AUC of 0.79. All models for the 2-year outcome performed poorly in the validation set, with an accuracy of 0.6 for LR, 0.65 for RF, and 0.54 for SVM. ConclusionThe outcomes of our study demonstrate the promise of using non-black box models, such as LR, to assist clinicians in recognizing high-risk patients and making informed treatment choices. The coefficients of the LR model show the importance of each feature in predicting results, and the generated nomogram provides an accessible illustration of how each feature impacts the predicted outcome. Additionally, the model's simplicity and interpretability make it a more appropriate option in scenarios where comprehending the model's predictions is essential.

AB - IntroductionUrinary incontinence (UI) is a common side effect of prostate cancer treatment, but in clinical practice, it is difficult to predict. Machine learning (ML) models have shown promising results in predicting outcomes, yet the lack of transparency in complex models known as "black-box" has made clinicians wary of relying on them in sensitive decisions. Therefore, finding a balance between accuracy and explainability is crucial for the implementation of ML models. The aim of this study was to employ three different ML classifiers to predict the probability of experiencing UI in men with localized prostate cancer 1-year and 2-year after treatment and compare their accuracy and explainability. MethodsWe used the ProZIB dataset from the Netherlands Comprehensive Cancer Organization (Integraal Kankercentrum Nederland; IKNL) which contained clinical, demographic, and PROM data of 964 patients from 65 Dutch hospitals. Logistic Regression (LR), Random Forest (RF), and Support Vector Machine (SVM) algorithms were applied to predict (in)continence after prostate cancer treatment. ResultsAll models have been externally validated according to the TRIPOD Type 3 guidelines and their performance was assessed by accuracy, sensitivity, specificity, and AUC. While all three models demonstrated similar performance, LR showed slightly better accuracy than RF and SVM in predicting the risk of UI one year after prostate cancer treatment, achieving an accuracy of 0.75, a sensitivity of 0.82, and an AUC of 0.79. All models for the 2-year outcome performed poorly in the validation set, with an accuracy of 0.6 for LR, 0.65 for RF, and 0.54 for SVM. ConclusionThe outcomes of our study demonstrate the promise of using non-black box models, such as LR, to assist clinicians in recognizing high-risk patients and making informed treatment choices. The coefficients of the LR model show the importance of each feature in predicting results, and the generated nomogram provides an accessible illustration of how each feature impacts the predicted outcome. Additionally, the model's simplicity and interpretability make it a more appropriate option in scenarios where comprehending the model's predictions is essential.

KW - prostate cancer

KW - personalized medicine

KW - machine learning (ML)

KW - PROMs = patient-reported outcome measures

KW - urinary in continence

KW - prediction modeling

KW - shared decision making

KW - QUALITY-OF-LIFE

KW - RADIATION

KW - OUTCOMES

U2 - 10.3389/fonc.2023.1168219

DO - 10.3389/fonc.2023.1168219

M3 - Article

C2 - 37124522

SN - 2234-943X

VL - 13

JO - Frontiers in Oncology

JF - Frontiers in Oncology

IS - 1

M1 - 1168219

ER -