TY - JOUR
T1 - Post-processing steps improve generalisability and robustness of an MRI-based radiogenomic model for human papillomavirus status prediction in oropharyngeal cancer
AU - Ahmadian, Milad
AU - Bodalal, Zuhir
AU - Bos, Paula
AU - Martens, Roland M.
AU - Agrotis, Georgios
AU - van der Hulst, Hedda J.
AU - Vens, Conchita
AU - Karssemakers, Luc
AU - Al-Mamgani, Abrahim
AU - de Graaf, Pim
AU - Jasperse, Bas
AU - Brakenhoff, Ruud H.
AU - Leemans, C. Rene
AU - Beets-Tan, Regina G. H.
AU - Castelijns, Jonas A.
AU - van den Brekel, Michiel W. M.
PY - 2025
Y1 - 2025
N2 - Objectives To assess the impact of image post-processing steps on the generalisability of MRI-based radiogenomic models. Using a human papillomavirus (HPV) status in oropharyngeal squamous cell carcinoma (OPSCC) prediction model, this study examines the potential of different post-processing strategies to increase its generalisability across data from different centres and image acquisition protocols. Materials and methods Contrast-enhanced T1-weighted MR images of OPSCC patients of two cohorts from different centres, with confirmed HPV status, were manually segmented. After radiomic feature extraction, the HPV prediction model trained on a training set with 91 patients was subsequently tested on two independent cohorts: a test set with 62 patients and an externally derived cohort of 157 patients. The data processing options included: data harmonisation, a process to ensure consistency in data from different centres; exclusion of unstable features across different segmentations and scan protocols; and removal of highly correlated features to reduce redundancy. Results The predictive model, trained without post-processing, showed high performance on the test set, with an AUC of 0.79 (95% CI: 0.66-0.90, p < 0.001). However, when tested on the external data, the model performed less well, resulting in an AUC of 0.52 (95% CI: 0.45-0.58, p = 0.334). The model's generalisability substantially improved after performing post-processing steps. The AUC for the test set reached 0.76 (95% CI: 0.63-0.87, p < 0.001), while for the external cohort, the predictive model achieved an AUC of 0.73 (95% CI: 0.64-0.81, p < 0.001). Conclusions When applied before model development, post-processing steps can enhance the robustness and generalisability of predictive radiogenomics models.
AB - Objectives To assess the impact of image post-processing steps on the generalisability of MRI-based radiogenomic models. Using a human papillomavirus (HPV) status in oropharyngeal squamous cell carcinoma (OPSCC) prediction model, this study examines the potential of different post-processing strategies to increase its generalisability across data from different centres and image acquisition protocols. Materials and methods Contrast-enhanced T1-weighted MR images of OPSCC patients of two cohorts from different centres, with confirmed HPV status, were manually segmented. After radiomic feature extraction, the HPV prediction model trained on a training set with 91 patients was subsequently tested on two independent cohorts: a test set with 62 patients and an externally derived cohort of 157 patients. The data processing options included: data harmonisation, a process to ensure consistency in data from different centres; exclusion of unstable features across different segmentations and scan protocols; and removal of highly correlated features to reduce redundancy. Results The predictive model, trained without post-processing, showed high performance on the test set, with an AUC of 0.79 (95% CI: 0.66-0.90, p < 0.001). However, when tested on the external data, the model performed less well, resulting in an AUC of 0.52 (95% CI: 0.45-0.58, p = 0.334). The model's generalisability substantially improved after performing post-processing steps. The AUC for the test set reached 0.76 (95% CI: 0.63-0.87, p < 0.001), while for the external cohort, the predictive model achieved an AUC of 0.73 (95% CI: 0.64-0.81, p < 0.001). Conclusions When applied before model development, post-processing steps can enhance the robustness and generalisability of predictive radiogenomics models.
KW - Radiomics
KW - Imaging genomics
KW - Machine learning
KW - Magnetic resonance imaging
KW - Human papillomavirus
KW - CORTICAL THICKNESS
KW - RADIOMIC FEATURES
KW - REPRODUCIBILITY
KW - RELIABILITY
KW - CARCINOMA
KW - STABILITY
U2 - 10.1007/s00330-025-11709-8
DO - 10.1007/s00330-025-11709-8
M3 - Article
SN - 0938-7994
JO - European Radiology
JF - European Radiology
ER -