Lung cancer diagnosis using deep attention-based multiple instance learning and radiomics

Junhua Chen*, Haiyan Zeng, Chong Zhang, Zhenwei Shi, Andre Dekker, Leonard Wee, Inigo Bermejo

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Background: Early diagnosis of lung cancer is a key intervention for the treatment of lung cancer in which computer-aided diagnosis (CAD) can play a crucial role. Most published CAD methods perform lung cancer diagnosis by classifying each lung nodule in isolation. However, this does not reflect clinical practice, where clinicians diagnose a patient based on a set of images of nodules, instead of looking at one nodule at a time. Besides, the low interpretability of the output provided by these methods presents an important barrier for their adoption. Method: In this article, we treat lung cancer diagnosis as a multiple instance learning (MIL) problem, which better reflects the diagnosis process in the clinical setting and provides higher interpretability of the output. We selected radiomics as the source of input features and deep attention-based MIL as the classification algorithm. The attention mechanism provides higher interpretability by estimating the importance of each instance in the set for the final diagnosis. To improve the model's performance in a small imbalanced dataset, we propose a new bag simulation method for MIL. Results and conclusion: The results show that our method can achieve a mean accuracy of 0.807 with a standard error of the mean (SEM) of 0.069, a recall of 0.870 (SEM 0.061), a positive predictive value of 0.928 (SEM 0.078), a negative predictive value of (Formula presented.) (SEM 0.155), and an area under the curve (AUC) of 0.842 (SEM 0.074), outperforming other MIL methods. Additional experiments show that the proposed oversampling strategy significantly improves the model's performance. In addition, experiments show that our method provides a good indication of the importance of each nodule in determining the diagnosis, which combined with the well-defined radiomic features, to make the results more interpretable and acceptable for doctors and patients.

Original languageEnglish
Pages (from-to)3134-3143
Number of pages10
JournalMedical Physics
Volume49
Issue number5
Early online date3 Mar 2022
DOIs
Publication statusPublished - May 2022

Keywords

  • ALGORITHMS
  • CLASSIFICATION
  • COMBINATION
  • COMPUTER-AIDED DIAGNOSIS
  • CT
  • FEATURES
  • FRAMEWORK
  • INFORMATION
  • PULMONARY NODULES
  • TEXTURE
  • attention mechanism
  • lung cancer diagnosis
  • multiple instance learning
  • radiomics
  • Radiomics
  • Attention mechanism
  • Lung cancer diagnosis
  • Multiple instance learning

Cite this