Inter- and Intra-Observer Variability and the Effect of Experience in Cine-MRI for Adhesion Detection

B. de Wilde*, F. Joosten, W. Venderink, M.E.J. Davidse, J. Geurts, H. Kruijt, A. Vermeulen, B. Martens, M.V.P. Schyns, J.C.B.M. Huige, Myrte C. de Boer, B.A.R. Tonino, H.J.A. Zandvoort, K. Lammert, H. Parviainen, A.M. Vuorinen, S. Syvaeranta, R.R.M. Vogels, W. Prins, A. CoppolaN. Bossa, R.P.G. ten Broek, H. Huisman

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Cine-MRI for adhesion detection is a promising novel modality that can help the large group of patients developing pain after abdominal surgery. Few studies into its diagnostic accuracy are available, and none address observer variability. This retrospective study explores the inter- and intra-observer variability, diagnostic accuracy, and the effect of experience. A total of 15 observers with a variety of experience reviewed 61 sagittal cine-MRI slices, placing box annotations with a confidence score at locations suspect for adhesions. Five observers reviewed the slices again one year later. Inter- and intra-observer variability are quantified using Fleiss' (inter) and Cohen's (intra) kappa and percentage agreement. Diagnostic accuracy is quantified with receiver operating characteristic (ROC) analysis based on a consensus standard. Inter-observer Fleiss' kappa values range from 0.04 to 0.34, showing poor to fair agreement. High general and cine-MRI experience led to significantly (p < 0.001) better agreement among observers. The intra-observer results show Cohen's kappa values between 0.37 and 0.53 for all observers, except one with a low kappa of -0.11. Group AUC scores lie between 0.66 and 0.72, with individual observers reaching 0.78. This study confirms that cine-MRI can diagnose adhesions, with respect to a radiologist consensus panel and shows that experience improves reading cine-MRI. Observers without specific experience adapt to this modality quickly after a short online tutorial. Observer agreement is fair at best and area under the receiver operating characteristic curve (AUC) scores leave room for improvement. Consistently interpreting this novel modality needs further research, for instance, by developing reporting guidelines or artificial intelligence-based methods.
Original languageEnglish
Article number55
Number of pages11
JournalJournal of Imaging
Volume9
Issue number3
DOIs
Publication statusPublished - 1 Mar 2023

Keywords

  • (MeSH)
  • tissue adhesions
  • magnetic resonance imaging
  • observer variation
  • CHRONIC ABDOMINAL-PAIN
  • INTRAABDOMINAL ADHESIONS

Cite this