Using 3D deep features from CT scans for cancer prognosis based on a video classification model: A multi-dataset feasibility study

Junhua Chen; Leonard Wee; Andre Dekker; Inigo Bermejo

doi:10.1002/mp.16430

Using 3D deep features from CT scans for cancer prognosis based on a video classification model: A multi-dataset feasibility study

Junhua Chen^*, Leonard Wee, Andre Dekker, Inigo Bermejo

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Background: Cancer prognosis before and after treatment is key for patient management and decision making. Handcrafted imaging biomarkers—radiomics—have shown potential in predicting prognosis. Purpose: However, given the recent progress in deep learning, it is timely and relevant to pose the question: could deep learning based 3D imaging features be used as imaging biomarkers and outperform radiomics?. Methods: Effectiveness, reproducibility in test/retest, across modalities, and correlation of deep features with clinical features such as tumor volume and TNM staging were tested in this study. Radiomics was introduced as the reference image biomarker. For deep feature extraction, we transformed the CT scans into videos, and we adopted the pre-trained Inflated 3D ConvNet (I3D) video classification network as the architecture. We used four datasets—LUNG 1 (n = 422), LUNG 4 (n = 106), OPC (n = 605), and H&N 1 (n = 89)—with 1270 samples from different centers and cancer types—lung and head and neck cancer—to test deep features’ predictiveness and two additional datasets to assess the reproducibility of deep features. Results: Support Vector Machine–Recursive Feature Elimination (SVM–RFE) selected top 100 deep features achieved a concordance index (CI) of 0.67 in survival prediction in LUNG 1, 0.87 in LUNG 4, 0.76 in OPC, and 0.87 in H&N 1, while SVM-RFE selected top 100 radiomics achieved CIs of 0.64, 0.77, 0.73, and 0.74, respectively, all statistically significant differences (p < 0.01, Wilcoxon's test). Most selected deep features are not correlated with tumor volume and TNM staging. However, full radiomics features show higher reproducibility than full deep features in a test/retest setting (0.89 vs. 0.62, concordance correlation coefficient). Conclusion: The results show that deep features can outperform radiomics while providing different views for tumor prognosis compared to tumor volume and TNM staging. However, deep features suffer from lower reproducibility than radiomic features and lack the interpretability of the latter.

Original language	English
Pages (from-to)	4220-4233
Number of pages	14
Journal	Medical Physics
Volume	50
Issue number	7
Early online date	1 Apr 2023
DOIs	https://doi.org/10.1002/mp.16430
Publication status	Published - Jul 2023

Keywords

3D deep neural network
cancer prognosis
deep features
radiomics
transfer learning
LUNG-CANCER
RADIOMICS
IMPACT
EXPRESSION
PREDICTION
SIZE

Access to Document

10.1002/mp.16430Licence: CC BY

Cite this

@article{e15f56c3b4b24eb59f42d4c35e0542f6,

title = "Using 3D deep features from CT scans for cancer prognosis based on a video classification model: A multi-dataset feasibility study",

abstract = "Background: Cancer prognosis before and after treatment is key for patient management and decision making. Handcrafted imaging biomarkers—radiomics—have shown potential in predicting prognosis. Purpose: However, given the recent progress in deep learning, it is timely and relevant to pose the question: could deep learning based 3D imaging features be used as imaging biomarkers and outperform radiomics?. Methods: Effectiveness, reproducibility in test/retest, across modalities, and correlation of deep features with clinical features such as tumor volume and TNM staging were tested in this study. Radiomics was introduced as the reference image biomarker. For deep feature extraction, we transformed the CT scans into videos, and we adopted the pre-trained Inflated 3D ConvNet (I3D) video classification network as the architecture. We used four datasets—LUNG 1 (n = 422), LUNG 4 (n = 106), OPC (n = 605), and H&N 1 (n = 89)—with 1270 samples from different centers and cancer types—lung and head and neck cancer—to test deep features{\textquoteright} predictiveness and two additional datasets to assess the reproducibility of deep features. Results: Support Vector Machine–Recursive Feature Elimination (SVM–RFE) selected top 100 deep features achieved a concordance index (CI) of 0.67 in survival prediction in LUNG 1, 0.87 in LUNG 4, 0.76 in OPC, and 0.87 in H&N 1, while SVM-RFE selected top 100 radiomics achieved CIs of 0.64, 0.77, 0.73, and 0.74, respectively, all statistically significant differences (p < 0.01, Wilcoxon's test). Most selected deep features are not correlated with tumor volume and TNM staging. However, full radiomics features show higher reproducibility than full deep features in a test/retest setting (0.89 vs. 0.62, concordance correlation coefficient). Conclusion: The results show that deep features can outperform radiomics while providing different views for tumor prognosis compared to tumor volume and TNM staging. However, deep features suffer from lower reproducibility than radiomic features and lack the interpretability of the latter.",

keywords = "3D deep neural network, cancer prognosis, deep features, radiomics, transfer learning, LUNG-CANCER, RADIOMICS, IMPACT, EXPRESSION, PREDICTION, SIZE",

author = "Junhua Chen and Leonard Wee and Andre Dekker and Inigo Bermejo",

year = "2023",

month = jul,

doi = "10.1002/mp.16430",

language = "English",

volume = "50",

pages = "4220--4233",

journal = "Medical Physics",

issn = "0094-2405",

publisher = "Wiley",

number = "7",

}

TY - JOUR

T1 - Using 3D deep features from CT scans for cancer prognosis based on a video classification model: A multi-dataset feasibility study

AU - Chen, Junhua

AU - Wee, Leonard

AU - Dekker, Andre

AU - Bermejo, Inigo

PY - 2023/7

Y1 - 2023/7

N2 - Background: Cancer prognosis before and after treatment is key for patient management and decision making. Handcrafted imaging biomarkers—radiomics—have shown potential in predicting prognosis. Purpose: However, given the recent progress in deep learning, it is timely and relevant to pose the question: could deep learning based 3D imaging features be used as imaging biomarkers and outperform radiomics?. Methods: Effectiveness, reproducibility in test/retest, across modalities, and correlation of deep features with clinical features such as tumor volume and TNM staging were tested in this study. Radiomics was introduced as the reference image biomarker. For deep feature extraction, we transformed the CT scans into videos, and we adopted the pre-trained Inflated 3D ConvNet (I3D) video classification network as the architecture. We used four datasets—LUNG 1 (n = 422), LUNG 4 (n = 106), OPC (n = 605), and H&N 1 (n = 89)—with 1270 samples from different centers and cancer types—lung and head and neck cancer—to test deep features’ predictiveness and two additional datasets to assess the reproducibility of deep features. Results: Support Vector Machine–Recursive Feature Elimination (SVM–RFE) selected top 100 deep features achieved a concordance index (CI) of 0.67 in survival prediction in LUNG 1, 0.87 in LUNG 4, 0.76 in OPC, and 0.87 in H&N 1, while SVM-RFE selected top 100 radiomics achieved CIs of 0.64, 0.77, 0.73, and 0.74, respectively, all statistically significant differences (p < 0.01, Wilcoxon's test). Most selected deep features are not correlated with tumor volume and TNM staging. However, full radiomics features show higher reproducibility than full deep features in a test/retest setting (0.89 vs. 0.62, concordance correlation coefficient). Conclusion: The results show that deep features can outperform radiomics while providing different views for tumor prognosis compared to tumor volume and TNM staging. However, deep features suffer from lower reproducibility than radiomic features and lack the interpretability of the latter.

AB - Background: Cancer prognosis before and after treatment is key for patient management and decision making. Handcrafted imaging biomarkers—radiomics—have shown potential in predicting prognosis. Purpose: However, given the recent progress in deep learning, it is timely and relevant to pose the question: could deep learning based 3D imaging features be used as imaging biomarkers and outperform radiomics?. Methods: Effectiveness, reproducibility in test/retest, across modalities, and correlation of deep features with clinical features such as tumor volume and TNM staging were tested in this study. Radiomics was introduced as the reference image biomarker. For deep feature extraction, we transformed the CT scans into videos, and we adopted the pre-trained Inflated 3D ConvNet (I3D) video classification network as the architecture. We used four datasets—LUNG 1 (n = 422), LUNG 4 (n = 106), OPC (n = 605), and H&N 1 (n = 89)—with 1270 samples from different centers and cancer types—lung and head and neck cancer—to test deep features’ predictiveness and two additional datasets to assess the reproducibility of deep features. Results: Support Vector Machine–Recursive Feature Elimination (SVM–RFE) selected top 100 deep features achieved a concordance index (CI) of 0.67 in survival prediction in LUNG 1, 0.87 in LUNG 4, 0.76 in OPC, and 0.87 in H&N 1, while SVM-RFE selected top 100 radiomics achieved CIs of 0.64, 0.77, 0.73, and 0.74, respectively, all statistically significant differences (p < 0.01, Wilcoxon's test). Most selected deep features are not correlated with tumor volume and TNM staging. However, full radiomics features show higher reproducibility than full deep features in a test/retest setting (0.89 vs. 0.62, concordance correlation coefficient). Conclusion: The results show that deep features can outperform radiomics while providing different views for tumor prognosis compared to tumor volume and TNM staging. However, deep features suffer from lower reproducibility than radiomic features and lack the interpretability of the latter.

KW - 3D deep neural network

KW - cancer prognosis

KW - deep features

KW - radiomics

KW - transfer learning

KW - LUNG-CANCER

KW - RADIOMICS

KW - IMPACT

KW - EXPRESSION

KW - PREDICTION

KW - SIZE

U2 - 10.1002/mp.16430

DO - 10.1002/mp.16430

M3 - Article

C2 - 37102270

SN - 0094-2405

VL - 50

SP - 4220

EP - 4233

JO - Medical Physics

JF - Medical Physics

IS - 7

ER -