Machine learning for grading and prognosis of esophageal dysplasia using mass spectrometry and histological imaging

M. Beuque*, M. Martin-Lorenzo*, B. Balluff, H.C. Woodruff, M. Lucas, D.M. de Bruin, J.E. van Timmeren, O.J. de Boer, R.M.A. Heeren, S.L. Meijer, P. Lambin

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

10 Downloads (Pure)

Abstract

Background: Barrett's esophagus (BE) is a precursor lesion of esophageal adenocarcinoma and may progress from non-dysplastic through low-grade dysplasia (LGD) to high-grade dysplasia (HGD) and cancer. Grading BE is of crucial prognostic value and is currently based on the subjective evaluation of biopsies. This study aims to investigate the potential of machine learning (ML) using spatially resolved molecular data from mass spectrometry imaging (MSI) and histological data from microscopic hematoxylin and eosin (H&E)-stained imaging for computer-aided diagnosis and prognosis of BE. Methods: Biopsies from 57 patients were considered, divided into non-dysplastic (n = 15), LGD non-progressive (n = 14), LGD progressive (n = 14), and HGD (n = 14). MSI experiments were conducted at 50 x 50 mu m spatial resolution per pixel corresponding to a tile size of 96x96 pixels in the co-registered H&E images, making a total of 144,823 tiles for the whole dataset. Results: ML models were trained to distinguish epithelial tissue from stroma with area-under-the-curve (AUC) values of 0.89 (MSI) and 0.95 (H&E)) and dysplastic grade (AUC of 0.97 (MSI) and 0.85 (H&E)) on a tile level, and low-grade progressors from non-progressors on a patient level (accuracies of 0.72 (MSI) and 0.48 (H&E)). Conclusions: In summary, while the H&E-based classifier was best at distinguishing tissue types, the MSI-based model was more accurate at distinguishing dysplastic grades and patients at progression risk, which demonstrates the complementarity of both approaches. Data are available via ProteomeXchange with identifier PXD028949.
Original languageEnglish
Article number104918
Number of pages8
JournalComputers in Biology and Medicine
Volume138
DOIs
Publication statusPublished - 1 Nov 2021

Keywords

  • Deep learning
  • Machine learning
  • Mass spectrometry imaging
  • H&E stainfing
  • Barrett's esophagus
  • DIAGNOSIS
  • ADENOCARCINOMA

Cite this