TY - JOUR
T1 - Challenges in multi-task learning for fMRI-based diagnosis
T2 - Benefits for psychiatric conditions and CNVs would likely require thousands of patients
AU - Harvey, Annabelle
AU - Moreau, Clara A.
AU - Kumar, Kuldeep
AU - Huguet, Guillaume
AU - Urchs, Sebastian G.W.
AU - Sharmarke, Hanad
AU - Jizi, Khadije
AU - Martin, Charles Olivier
AU - Younis, Nadine
AU - Tamer, Petra
AU - Martineau, Jean Louis
AU - Orban, Pierre
AU - Silva, Ana Isabel
AU - Hall, Jeremy
AU - van den Bree, Marianne B.M.
AU - Owen, Michael J.
AU - Linden, David E.J.
AU - Lippé, Sarah
AU - Bearden, Carrie E.
AU - Dumas, Guillaume
AU - Jacquemont, Sébastien
AU - Bellec, Pierre
N1 - Funding Information:
Data from the UCLA cohort provided by CEB (participants with 22q11.2 deletions or duplications and control subjects) were supported through grants from the National Institutes of Health (NIH) (Grant No. U54EB020403), the National Institute of Mental Health (Grant Nos. R01MH100900, R01MH085953, U01MH119736, and R21MH116473), and the Simons Foundation (SFARI Explorer Award).
Funding Information:
This research was supported by a donation from the Courtois foundation (to P.B.), grants from the Canadian Institutes of Health Research (Canadian Consortium on Neurodegeneration in Aging, to P.B., Grant No. CIHR 400528 to S.J.), the Institute of Data Valorization (IVADO PRF3, to P.B. and S.J.), a grant from Compute Canada (scq-952 to S.J. and gsf-624 to P.B.), the Brain Canada Multi-Investigator Research Initiative (MIRI, to S.J.), Canada First Research Excellence Fund (to S.J.), and Healthy Brain, Healthy Lives (to S.J.). P.B. is a fellow (Chercheur boursier Junior 2) of the Fonds de recherche du Qu\u00E9bec-Sant\u00E9. S.J. is a recipient of a Canada Research Chair in neurodevelopmental disorders, and a chair from the Jeanne et Jean Louis Levesque Foundation.
Funding Information:
The Cardiff Copy Number Variant cohort was supported by the Wellcome Trust Strategic Award DEFINE and the National Centre for Mental Health with funds from Health and Care Research Wales (code 100202/Z/12/Z).
Funding Information:
M.J.O., J.H., and M.B.M.v.d.B. have a research grant from Takeda Pharmaceuticals outside the scope of this work. J.H. is a founding director of the company Meomics (unrelated to this work). P.B. is a consultant in fMRI processing for NeuroRX Inc., outside of the scope of this work. All other authors report no biomedical financial interests or potential conflicts of interest.
Funding Information:
Finally, data from another study were obtained through the OpenFMRI project ( http://openfmri.org ) from the Consortium for Neuropsychiatric Phenomics (CNP), which was supported by NIH Roadmap for Medical Research grants (Grant Nos. UL1-DE019580, RL1MH083268, RL1MH083269, RL1DA024853, RL1MH083270, RL1LM009833, PL1MH083271, and PL1NS062410). Finally, this work was supported by the Simons Foundation (Grant Nos. SFARI219193 and SFARI274424).
Funding Information:
C.A.M. is supported by AIMS-2-TRIALS, which received support from the Innovative Medicines Initiative 2 Joint undertaking under grant agreement (Grant No. 777394).
Publisher Copyright:
© 2024 The Authors. Published under a Creative Commons Attribution 4.0 International (CC BY 4.0) license.
PY - 2024/7/26
Y1 - 2024/7/26
N2 - There is a growing interest in using machine learning (ML) models to perform automatic diagnosis of psychiatric conditions; however, generalising the prediction of ML models to completely independent data can lead to sharp decrease in performance. Patients with different psychiatric diagnoses have traditionally been studied independently, yet there is a growing recognition of neuroimaging signatures shared across them as well as rare genetic copy number variants (CNVs). In this work, we assess the potential of multi-task learning (MTL) to improve accuracy by characterising multiple related conditions with a single model, making use of information shared across diagnostic categories and exposing the model to a larger and more diverse dataset. As a proof of concept, we first established the efficacy of MTL in a context where there is clearly information shared across tasks: the same target (age or sex) is predicted at different sites of data collection in a large functional magnetic resonance imaging (fMRI) dataset compiled from multiple studies. MTL generally led to substantial gains relative to independent prediction at each site. Performing scaling experiments on the UK Biobank, we observed that performance was highly dependent on sample size: for large sample sizes (N > 6000) sex prediction was better using MTL across three sites (N = K per site) than prediction at a single site (N = 3K), but for small samples (N < 500) MTL was actually detrimental for age prediction. We then used established machine-learning methods to benchmark the diagnostic accuracy of each of the 7 CNVs (N = 19–103) and 4 psychiatric conditions (N = 44–472) independently, replicating the accuracy previously reported in the literature on psychiatric conditions. We observed that MTL hurt performance when applied across the full set of diagnoses, and complementary analyses failed to identify pairs of conditions which would benefit from MTL. Taken together, our results show that if a successful multi-task diagnostic model of psychiatric conditions were to be developed with resting-state fMRI, it would likely require datasets with thousands of patients across different diagnoses.
AB - There is a growing interest in using machine learning (ML) models to perform automatic diagnosis of psychiatric conditions; however, generalising the prediction of ML models to completely independent data can lead to sharp decrease in performance. Patients with different psychiatric diagnoses have traditionally been studied independently, yet there is a growing recognition of neuroimaging signatures shared across them as well as rare genetic copy number variants (CNVs). In this work, we assess the potential of multi-task learning (MTL) to improve accuracy by characterising multiple related conditions with a single model, making use of information shared across diagnostic categories and exposing the model to a larger and more diverse dataset. As a proof of concept, we first established the efficacy of MTL in a context where there is clearly information shared across tasks: the same target (age or sex) is predicted at different sites of data collection in a large functional magnetic resonance imaging (fMRI) dataset compiled from multiple studies. MTL generally led to substantial gains relative to independent prediction at each site. Performing scaling experiments on the UK Biobank, we observed that performance was highly dependent on sample size: for large sample sizes (N > 6000) sex prediction was better using MTL across three sites (N = K per site) than prediction at a single site (N = 3K), but for small samples (N < 500) MTL was actually detrimental for age prediction. We then used established machine-learning methods to benchmark the diagnostic accuracy of each of the 7 CNVs (N = 19–103) and 4 psychiatric conditions (N = 44–472) independently, replicating the accuracy previously reported in the literature on psychiatric conditions. We observed that MTL hurt performance when applied across the full set of diagnoses, and complementary analyses failed to identify pairs of conditions which would benefit from MTL. Taken together, our results show that if a successful multi-task diagnostic model of psychiatric conditions were to be developed with resting-state fMRI, it would likely require datasets with thousands of patients across different diagnoses.
KW - CNVs
KW - fMRI
KW - machine learning
KW - multi-site data
KW - multi-task learning
KW - psychiatric conditions
U2 - 10.1162/imag_a_00222
DO - 10.1162/imag_a_00222
M3 - Article
SN - 2837-6056
VL - 2
SP - 1
EP - 20
JO - Imaging Neuroscience
JF - Imaging Neuroscience
ER -