Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning

Divyanshu Tak; Zezhong Ye; Anna Zapaishchykova; Yining Zha; Aidan Boyd; Sridhar Vajapeyam; Rishi Chopra; Hasaan Hayat; Sanjay Prabhu; Kevin X Liu; Hesham Elhalawani; Ali Nabavidazeh; Ariana Familiar; Adam Resnick; Sabine Mueller; Hugo J W L Aerts; Pratiti Bandopadhayay; Keith Ligon; Daphne Haas-Kogan; Tina Poussaint; Benjamin H Kann

doi:10.1101/2023.08.04.23293673

Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning

Divyanshu Tak, Zezhong Ye, Anna Zapaishchykova, Yining Zha, Aidan Boyd, Sridhar Vajapeyam, Rishi Chopra, Hasaan Hayat, Sanjay Prabhu, Kevin X Liu, Hesham Elhalawani, Ali Nabavidazeh, Ariana Familiar, Adam Resnick, Sabine Mueller, Hugo J W L Aerts, Pratiti Bandopadhayay, Keith Ligon, Daphne Haas-Kogan, Tina PoussaintBenjamin H Kann

Research output: Working paper / Preprint › Preprint

Abstract

PURPOSE: To develop and externally validate a scan-to-prediction deep-learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pLGG. MATERIALS AND METHODS: We conducted a retrospective study of two pLGG datasets with linked genomic and diagnostic T2-weighted MRI of patients: BCH (development dataset, n=214 [60 (28%) BRAF fusion, 50 (23%) BRAF V600E, 104 (49%) wild-type), and Child Brain Tumor Network (CBTN) (external validation, n=112 [60 (53%) BRAF-Fusion, 17 (15%) BRAF-V600E, 35 (32%) wild-type]). We developed a deep learning pipeline to classify BRAF mutational status (V600E vs. fusion vs. wild-type) via a two-stage process: 1) 3D tumor segmentation and extraction of axial tumor images, and 2) slice-wise, deep learning-based classification of mutational status. We investigated knowledge-transfer and self-supervised approaches to prevent model overfitting with a primary endpoint of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, we developed a novel metric, COMDist, that quantifies the accuracy of model attention around the tumor. RESULTS: A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest macro-average AUC (0.82 [95% CI: 0.70-0.90]) and accuracy (77%) on internal validation, with an AUC improvement of +17.7% and a COMDist improvement of +6.4% versus training from scratch. On external validation, the TransferX model yielded AUC (0.73 [95% CI 0.68-0.88]) and accuracy (75%). CONCLUSION: Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pLGG mutational status prediction in a limited data scenario.

Original language	English
Publisher	MedRxiv
DOIs	https://doi.org/10.1101/2023.08.04.23293673
Publication status	Published - 18 Sept 2023

Access to Document

10.1101/2023.08.04.23293673Licence: CC BY-NC

Cite this

Tak, D., Ye, Z., Zapaishchykova, A., Zha, Y., Boyd, A., Vajapeyam, S., Chopra, R., Hayat, H., Prabhu, S., Liu, K. X., Elhalawani, H., Nabavidazeh, A., Familiar, A., Resnick, A., Mueller, S., Aerts, H. J. W. L., Bandopadhayay, P., Ligon, K., Haas-Kogan, D., ... Kann, B. H. (2023). Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning. MedRxiv . https://doi.org/10.1101/2023.08.04.23293673

@techreport{d30bf01d0c6a4a72a1bbc8437506a0ac,

title = "Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning",

abstract = "PURPOSE: To develop and externally validate a scan-to-prediction deep-learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pLGG. MATERIALS AND METHODS: We conducted a retrospective study of two pLGG datasets with linked genomic and diagnostic T2-weighted MRI of patients: BCH (development dataset, n=214 [60 (28%) BRAF fusion, 50 (23%) BRAF V600E, 104 (49%) wild-type), and Child Brain Tumor Network (CBTN) (external validation, n=112 [60 (53%) BRAF-Fusion, 17 (15%) BRAF-V600E, 35 (32%) wild-type]). We developed a deep learning pipeline to classify BRAF mutational status (V600E vs. fusion vs. wild-type) via a two-stage process: 1) 3D tumor segmentation and extraction of axial tumor images, and 2) slice-wise, deep learning-based classification of mutational status. We investigated knowledge-transfer and self-supervised approaches to prevent model overfitting with a primary endpoint of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, we developed a novel metric, COMDist, that quantifies the accuracy of model attention around the tumor. RESULTS: A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest macro-average AUC (0.82 [95% CI: 0.70-0.90]) and accuracy (77%) on internal validation, with an AUC improvement of +17.7% and a COMDist improvement of +6.4% versus training from scratch. On external validation, the TransferX model yielded AUC (0.73 [95% CI 0.68-0.88]) and accuracy (75%). CONCLUSION: Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pLGG mutational status prediction in a limited data scenario.",

author = "Divyanshu Tak and Zezhong Ye and Anna Zapaishchykova and Yining Zha and Aidan Boyd and Sridhar Vajapeyam and Rishi Chopra and Hasaan Hayat and Sanjay Prabhu and Liu, {Kevin X} and Hesham Elhalawani and Ali Nabavidazeh and Ariana Familiar and Adam Resnick and Sabine Mueller and Aerts, {Hugo J W L} and Pratiti Bandopadhayay and Keith Ligon and Daphne Haas-Kogan and Tina Poussaint and Kann, {Benjamin H}",

year = "2023",

month = sep,

day = "18",

doi = "10.1101/2023.08.04.23293673",

language = "English",

publisher = "MedRxiv ",

address = "United States",

type = "WorkingPaper",

institution = "MedRxiv ",

}

Tak, D, Ye, Z, Zapaishchykova, A, Zha, Y, Boyd, A, Vajapeyam, S, Chopra, R, Hayat, H, Prabhu, S, Liu, KX, Elhalawani, H, Nabavidazeh, A, Familiar, A, Resnick, A, Mueller, S, Aerts, HJWL, Bandopadhayay, P, Ligon, K, Haas-Kogan, D, Poussaint, T & Kann, BH 2023 'Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning' MedRxiv . https://doi.org/10.1101/2023.08.04.23293673

TY - UNPB

T1 - Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning

AU - Tak, Divyanshu

AU - Ye, Zezhong

AU - Zapaishchykova, Anna

AU - Zha, Yining

AU - Boyd, Aidan

AU - Vajapeyam, Sridhar

AU - Chopra, Rishi

AU - Hayat, Hasaan

AU - Prabhu, Sanjay

AU - Liu, Kevin X

AU - Elhalawani, Hesham

AU - Nabavidazeh, Ali

AU - Familiar, Ariana

AU - Resnick, Adam

AU - Mueller, Sabine

AU - Aerts, Hugo J W L

AU - Bandopadhayay, Pratiti

AU - Ligon, Keith

AU - Haas-Kogan, Daphne

AU - Poussaint, Tina

AU - Kann, Benjamin H

PY - 2023/9/18

Y1 - 2023/9/18

N2 - PURPOSE: To develop and externally validate a scan-to-prediction deep-learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pLGG. MATERIALS AND METHODS: We conducted a retrospective study of two pLGG datasets with linked genomic and diagnostic T2-weighted MRI of patients: BCH (development dataset, n=214 [60 (28%) BRAF fusion, 50 (23%) BRAF V600E, 104 (49%) wild-type), and Child Brain Tumor Network (CBTN) (external validation, n=112 [60 (53%) BRAF-Fusion, 17 (15%) BRAF-V600E, 35 (32%) wild-type]). We developed a deep learning pipeline to classify BRAF mutational status (V600E vs. fusion vs. wild-type) via a two-stage process: 1) 3D tumor segmentation and extraction of axial tumor images, and 2) slice-wise, deep learning-based classification of mutational status. We investigated knowledge-transfer and self-supervised approaches to prevent model overfitting with a primary endpoint of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, we developed a novel metric, COMDist, that quantifies the accuracy of model attention around the tumor. RESULTS: A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest macro-average AUC (0.82 [95% CI: 0.70-0.90]) and accuracy (77%) on internal validation, with an AUC improvement of +17.7% and a COMDist improvement of +6.4% versus training from scratch. On external validation, the TransferX model yielded AUC (0.73 [95% CI 0.68-0.88]) and accuracy (75%). CONCLUSION: Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pLGG mutational status prediction in a limited data scenario.

AB - PURPOSE: To develop and externally validate a scan-to-prediction deep-learning pipeline for noninvasive, MRI-based BRAF mutational status classification for pLGG. MATERIALS AND METHODS: We conducted a retrospective study of two pLGG datasets with linked genomic and diagnostic T2-weighted MRI of patients: BCH (development dataset, n=214 [60 (28%) BRAF fusion, 50 (23%) BRAF V600E, 104 (49%) wild-type), and Child Brain Tumor Network (CBTN) (external validation, n=112 [60 (53%) BRAF-Fusion, 17 (15%) BRAF-V600E, 35 (32%) wild-type]). We developed a deep learning pipeline to classify BRAF mutational status (V600E vs. fusion vs. wild-type) via a two-stage process: 1) 3D tumor segmentation and extraction of axial tumor images, and 2) slice-wise, deep learning-based classification of mutational status. We investigated knowledge-transfer and self-supervised approaches to prevent model overfitting with a primary endpoint of the area under the receiver operating characteristic curve (AUC). To enhance model interpretability, we developed a novel metric, COMDist, that quantifies the accuracy of model attention around the tumor. RESULTS: A combination of transfer learning from a pretrained medical imaging-specific network and self-supervised label cross-training (TransferX) coupled with consensus logic yielded the highest macro-average AUC (0.82 [95% CI: 0.70-0.90]) and accuracy (77%) on internal validation, with an AUC improvement of +17.7% and a COMDist improvement of +6.4% versus training from scratch. On external validation, the TransferX model yielded AUC (0.73 [95% CI 0.68-0.88]) and accuracy (75%). CONCLUSION: Transfer learning and self-supervised cross-training improved classification performance and generalizability for noninvasive pLGG mutational status prediction in a limited data scenario.

U2 - 10.1101/2023.08.04.23293673

DO - 10.1101/2023.08.04.23293673

M3 - Preprint

BT - Noninvasive molecular subtyping of pediatric low-grade glioma with self-supervised transfer learning

PB - MedRxiv

ER -