Longitudinal reproducibility of automatically segmented hippocampal subfields: A multisite European 3T study on healthy elderly

Moira Marizzoni*, Luigi Antelmi, Beatriz Bosch, David Bartres-Faz, Bernhard W. Mueller, Jens Wiltfang, Ute Fiedler, Luca Roccatagliata, Agnese Picco, Flavio Nobili, Olivier Blin, Stephanie Bombois, Renaud Lopes, Julien Sein, Jean-Philippe Ranjeva, Mira Didic, Helene Gros-Dagnac, Pierre Payoux, Giada Zoccatelli, Franco AlessandriniAlberto Beltramello, Nuria Bargallo, Antonio Ferretti, Massimo Caulo, Marco Aiello, Carlo Cavaliere, Andrea Soricelli, Nicola Salvadori, Lucilla Parnetti, Roberto Tarducci, Piero Floridi, Magda Tsolaki, Manos Constantinidis, Antonios Drevelegas, Paolo Maria Rossini, Camillo Marra, Karl-Titus Hoffmann, Tilman Hensch, Peter Schoenknecht, Joost P. Kuijer, Pieter Jelle Visser, Frederik Barkhof, Regis Bordet, Giovanni B. Frisoni, Jorge Jovicich

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Recently, there has been an increased interest in the use of automatically segmented subfields of the human hippocampal formation derived from magnetic resonance imaging (MRI). However, little is known about the test‐retest reproducibility of such measures, particularly in the context of multisite studies. Here, we report the reproducibility of automated Freesurfer hippocampal subfields segmentations in 65 healthy elderly enrolled in a consortium of 13 3T MRI sites (five subjects per site). Participants were scanned in two sessions (test and retest) at least one week apart. Each session included two anatomical 3D T1 MRI acquisitions harmonized in the consortium. We evaluated the test‐retest reproducibility of subfields segmentation (i) to assess the effects of averaging two within‐session T1 images and (ii) to compare subfields with whole hippocampus volume and spatial reliability. We found that within‐session averaging of two T1 images significantly improved the reproducibility of all hippocampal subfields but not that of the whole hippocampus. Volumetric and spatial reproducibility across MRI sites were very good for the whole hippocampus, CA2‐3, CA4‐dentate gyrus (DG), subiculum (reproducibility error∼2% and DICE > 0.90), good for CA1 and presubiculum (reproducibility error ∼ 5% and DICE ∼ 0.90), and poorer for fimbria and hippocampal fissure (reproducibility error ∼ 15% and DICE < 0.80). Spearman's correlations confirmed that test‐retest reproducibility improved with volume size. Despite considerable differences of MRI scanner configurations, we found consistent hippocampal subfields volumes estimation. CA2‐3, CA4‐DG, and sub‐CA1 (subiculum, presubiculum, and CA1 pooled together) gave test‐retest reproducibility similar to the whole hippocampus. Our findings suggest that the larger hippocampal subfields volume may be reliable longitudinal markers in multisite studies. Hum Brain Mapp 36:3516–3527, 2015.
Original languageEnglish
Pages (from-to)3516-3527
JournalHuman Brain Mapping
Volume36
Issue number9
DOIs
Publication statusPublished - Sept 2015

Keywords

  • hippocampus
  • Freesurfer
  • within session T1 averaging
  • test-retest reproducibility

Cite this