Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease

E.E. Bron; S. Klein; J.M. Papma; L.C. Jiskoot; V. Venkatraghavan; J. Linders; P. Aalten; P.P. De Deyn; G.J. Biessels; J.A.H.R. Claassen; H.A.M. Middelkoop; M. Smits; W.J. Niessen; J.C. van Swieten; W.M. van der Flier; I.H.G.B. Ramakers; A. van der Lugt

doi:10.1016/j.nicl.2021.102712

Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease

E.E. Bron^*, S. Klein, J.M. Papma, L.C. Jiskoot, V. Venkatraghavan, J. Linders, P. Aalten, P.P. De Deyn, G.J. Biessels, J.A.H.R. Claassen, H.A.M. Middelkoop, M. Smits, W.J. Niessen, J.C. van Swieten, W.M. van der Flier, I.H.G.B. Ramakers, A. van der Lugt

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to the task of prediction of conversion to AD in individuals with mild cognitive impairment (MCI). We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwent either minimal pre-processing or more extensive preprocessing into modulated gray matter (GM) maps. Classifiers were optimized and evaluated using cross validation in the Alzheimer's Disease Neuroimaging Initiative (ADNI; 334 AD, 520 CN). Trained classifiers were subsequently applied to predict conversion to AD in ADNI MCI patients (231 converters, 628 non converters) and in the independent Health-RI Parelsnoer Neurodegenerative Diseases Biobank data set. From this multi-center study representing a tertiary memory clinic population, we included 199 AD patients, 139 participants with subjective cognitive decline, 48 MCI patients converting to dementia, and 91 MCI patients who did not convert to dementia. AD-CN classification based on modulated GM maps resulted in a similar area-under-the-curve (AUC) for SVM (0.940; 95%CI: 0.924-0.955) and CNN (0.933; 95%CI: 0.918-0.948). Application to conversion prediction in MCI yielded significantly higher performance for SVM (AUC = 0.756; 95%CI: 0.720-0.788) than for CNN (AUC = 0.742; 95%CI: 0.709-0.776) (p < 0.01 for McNemar's test). In external validation, performance was slightly decreased. For AD-CN, it again gave similar AUCs for SVM (0.896; 95%CI: 0.855-0.932) and CNN (0.876; 95%CI: 0.836-0.913). For prediction in MCI, performances decreased for both SVM (AUC = 0.665; 95%CI: 0.576-0.760) and CNN (AUC = 0.702; 95%CI: 0.624-0.786). Both with SVM and CNN, classification based on modulated GM maps significantly outperformed classification based on minimally processed images (p = 0.01). Deep and conventional classifiers performed equally well for AD classification and their performance decreased only slightly when applied to the external cohort. We expect that this work on external validation contributes towards translation of machine learning to clinical practice.

Original language	English
Article number	102712
Number of pages	9
Journal	NeuroImage: Clinical
Volume	31
DOIs	https://doi.org/10.1016/j.nicl.2021.102712
Publication status	Published - 2021

Keywords

Alzheimer's disease
Support vector machine
Convolutional Neural Network
External validation
MILD COGNITIVE IMPAIRMENT
NEUROIMAGING INITIATIVE ADNI
STRUCTURAL MRI
CLASSIFICATION
DEMENTIA
MODELS
IMAGES

Access to Document

10.1016/j.nicl.2021.102712Licence: CC BY

Cite this

Bron, E. E., Klein, S., Papma, J. M., Jiskoot, L. C., Venkatraghavan, V., Linders, J., Aalten, P., De Deyn, P. P., Biessels, G. J., Claassen, J. A. H. R., Middelkoop, H. A. M., Smits, M., Niessen, W. J., van Swieten, J. C., van der Flier, W. M., Ramakers, I. H. G. B., & van der Lugt, A. (2021). Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease. NeuroImage: Clinical, 31, Article 102712. https://doi.org/10.1016/j.nicl.2021.102712

@article{051fe641587b45d78283a67f99807f3c,

title = "Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease",

abstract = "This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to the task of prediction of conversion to AD in individuals with mild cognitive impairment (MCI). We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwent either minimal pre-processing or more extensive preprocessing into modulated gray matter (GM) maps. Classifiers were optimized and evaluated using cross validation in the Alzheimer's Disease Neuroimaging Initiative (ADNI; 334 AD, 520 CN). Trained classifiers were subsequently applied to predict conversion to AD in ADNI MCI patients (231 converters, 628 non converters) and in the independent Health-RI Parelsnoer Neurodegenerative Diseases Biobank data set. From this multi-center study representing a tertiary memory clinic population, we included 199 AD patients, 139 participants with subjective cognitive decline, 48 MCI patients converting to dementia, and 91 MCI patients who did not convert to dementia. AD-CN classification based on modulated GM maps resulted in a similar area-under-the-curve (AUC) for SVM (0.940; 95%CI: 0.924-0.955) and CNN (0.933; 95%CI: 0.918-0.948). Application to conversion prediction in MCI yielded significantly higher performance for SVM (AUC = 0.756; 95%CI: 0.720-0.788) than for CNN (AUC = 0.742; 95%CI: 0.709-0.776) (p < 0.01 for McNemar's test). In external validation, performance was slightly decreased. For AD-CN, it again gave similar AUCs for SVM (0.896; 95%CI: 0.855-0.932) and CNN (0.876; 95%CI: 0.836-0.913). For prediction in MCI, performances decreased for both SVM (AUC = 0.665; 95%CI: 0.576-0.760) and CNN (AUC = 0.702; 95%CI: 0.624-0.786). Both with SVM and CNN, classification based on modulated GM maps significantly outperformed classification based on minimally processed images (p = 0.01). Deep and conventional classifiers performed equally well for AD classification and their performance decreased only slightly when applied to the external cohort. We expect that this work on external validation contributes towards translation of machine learning to clinical practice.",

keywords = "Alzheimer's disease, Support vector machine, Convolutional Neural Network, External validation, MILD COGNITIVE IMPAIRMENT, NEUROIMAGING INITIATIVE ADNI, STRUCTURAL MRI, CLASSIFICATION, DEMENTIA, MODELS, IMAGES",

author = "E.E. Bron and S. Klein and J.M. Papma and L.C. Jiskoot and V. Venkatraghavan and J. Linders and P. Aalten and {De Deyn}, P.P. and G.J. Biessels and J.A.H.R. Claassen and H.A.M. Middelkoop and M. Smits and W.J. Niessen and {van Swieten}, J.C. and {van der Flier}, W.M. and I.H.G.B. Ramakers and {van der Lugt}, A.",

year = "2021",

doi = "10.1016/j.nicl.2021.102712",

language = "English",

volume = "31",

journal = "NeuroImage: Clinical",

issn = "2213-1582",

publisher = "ELSEVIER SCI LTD",

}

Bron, EE, Klein, S, Papma, JM, Jiskoot, LC, Venkatraghavan, V, Linders, J, Aalten, P, De Deyn, PP, Biessels, GJ, Claassen, JAHR, Middelkoop, HAM, Smits, M, Niessen, WJ, van Swieten, JC, van der Flier, WM, Ramakers, IHGB & van der Lugt, A 2021, 'Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease', NeuroImage: Clinical, vol. 31, 102712. https://doi.org/10.1016/j.nicl.2021.102712

TY - JOUR

T1 - Cross-cohort generalizability of deep and conventional machine learning for MRI-based diagnosis and prediction of Alzheimer's disease

AU - Bron, E.E.

AU - Klein, S.

AU - Papma, J.M.

AU - Jiskoot, L.C.

AU - Venkatraghavan, V.

AU - Linders, J.

AU - Aalten, P.

AU - De Deyn, P.P.

AU - Biessels, G.J.

AU - Claassen, J.A.H.R.

AU - Middelkoop, H.A.M.

AU - Smits, M.

AU - Niessen, W.J.

AU - van Swieten, J.C.

AU - van der Flier, W.M.

AU - Ramakers, I.H.G.B.

AU - van der Lugt, A.

PY - 2021

Y1 - 2021

N2 - This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to the task of prediction of conversion to AD in individuals with mild cognitive impairment (MCI). We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwent either minimal pre-processing or more extensive preprocessing into modulated gray matter (GM) maps. Classifiers were optimized and evaluated using cross validation in the Alzheimer's Disease Neuroimaging Initiative (ADNI; 334 AD, 520 CN). Trained classifiers were subsequently applied to predict conversion to AD in ADNI MCI patients (231 converters, 628 non converters) and in the independent Health-RI Parelsnoer Neurodegenerative Diseases Biobank data set. From this multi-center study representing a tertiary memory clinic population, we included 199 AD patients, 139 participants with subjective cognitive decline, 48 MCI patients converting to dementia, and 91 MCI patients who did not convert to dementia. AD-CN classification based on modulated GM maps resulted in a similar area-under-the-curve (AUC) for SVM (0.940; 95%CI: 0.924-0.955) and CNN (0.933; 95%CI: 0.918-0.948). Application to conversion prediction in MCI yielded significantly higher performance for SVM (AUC = 0.756; 95%CI: 0.720-0.788) than for CNN (AUC = 0.742; 95%CI: 0.709-0.776) (p < 0.01 for McNemar's test). In external validation, performance was slightly decreased. For AD-CN, it again gave similar AUCs for SVM (0.896; 95%CI: 0.855-0.932) and CNN (0.876; 95%CI: 0.836-0.913). For prediction in MCI, performances decreased for both SVM (AUC = 0.665; 95%CI: 0.576-0.760) and CNN (AUC = 0.702; 95%CI: 0.624-0.786). Both with SVM and CNN, classification based on modulated GM maps significantly outperformed classification based on minimally processed images (p = 0.01). Deep and conventional classifiers performed equally well for AD classification and their performance decreased only slightly when applied to the external cohort. We expect that this work on external validation contributes towards translation of machine learning to clinical practice.

AB - This work validates the generalizability of MRI-based classification of Alzheimer's disease (AD) patients and controls (CN) to an external data set and to the task of prediction of conversion to AD in individuals with mild cognitive impairment (MCI). We used a conventional support vector machine (SVM) and a deep convolutional neural network (CNN) approach based on structural MRI scans that underwent either minimal pre-processing or more extensive preprocessing into modulated gray matter (GM) maps. Classifiers were optimized and evaluated using cross validation in the Alzheimer's Disease Neuroimaging Initiative (ADNI; 334 AD, 520 CN). Trained classifiers were subsequently applied to predict conversion to AD in ADNI MCI patients (231 converters, 628 non converters) and in the independent Health-RI Parelsnoer Neurodegenerative Diseases Biobank data set. From this multi-center study representing a tertiary memory clinic population, we included 199 AD patients, 139 participants with subjective cognitive decline, 48 MCI patients converting to dementia, and 91 MCI patients who did not convert to dementia. AD-CN classification based on modulated GM maps resulted in a similar area-under-the-curve (AUC) for SVM (0.940; 95%CI: 0.924-0.955) and CNN (0.933; 95%CI: 0.918-0.948). Application to conversion prediction in MCI yielded significantly higher performance for SVM (AUC = 0.756; 95%CI: 0.720-0.788) than for CNN (AUC = 0.742; 95%CI: 0.709-0.776) (p < 0.01 for McNemar's test). In external validation, performance was slightly decreased. For AD-CN, it again gave similar AUCs for SVM (0.896; 95%CI: 0.855-0.932) and CNN (0.876; 95%CI: 0.836-0.913). For prediction in MCI, performances decreased for both SVM (AUC = 0.665; 95%CI: 0.576-0.760) and CNN (AUC = 0.702; 95%CI: 0.624-0.786). Both with SVM and CNN, classification based on modulated GM maps significantly outperformed classification based on minimally processed images (p = 0.01). Deep and conventional classifiers performed equally well for AD classification and their performance decreased only slightly when applied to the external cohort. We expect that this work on external validation contributes towards translation of machine learning to clinical practice.

KW - Alzheimer's disease

KW - Support vector machine

KW - Convolutional Neural Network

KW - External validation

KW - MILD COGNITIVE IMPAIRMENT

KW - NEUROIMAGING INITIATIVE ADNI

KW - STRUCTURAL MRI

KW - CLASSIFICATION

KW - DEMENTIA

KW - MODELS

KW - IMAGES

U2 - 10.1016/j.nicl.2021.102712

DO - 10.1016/j.nicl.2021.102712

M3 - Article

C2 - 34118592

SN - 2213-1582

VL - 31

JO - NeuroImage: Clinical

JF - NeuroImage: Clinical

M1 - 102712

ER -