A 3D transfer learning approach for identifying multiple simultaneous errors during radiotherapy

Kars van den Berg, Cecile J A Wolfs*, Frank Verhaegen

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Objective. Deep learning models, such as convolutional neural networks (CNNs), can take full dose comparison images as input and have shown promising results for error identification during treatment. Clinically, complex scenarios should be considered, with the risk of multiple anatomical and/or mechanical errors occurring simultaneously during treatment. The purpose of this study was to evaluate the capability of CNN-based error identification in this more complex scenario. Approach. For 40 lung cancer patients, clinically realistic ranges of combinations of various treatment errors within treatment plans and/or computed tomography (CT) images were simulated. Modified CT images and treatment plans were used to predict 2580 3D dose distributions, which were compared to dose distributions without errors using various gamma analysis criteria and relative dose difference as dose comparison methods. A 3D CNN capable of multilabel classification was trained to identify treatment errors at two classification levels, using dose comparison volumes as input: Level 1 (main error type, e.g. anatomical change, mechanical error) and Level 2 (error subtype, e.g. tumor regression, patient rotation). For training the CNNs, a transfer learning approach was employed. An ensemble model was also evaluated, which consisted of three separate CNNs each taking a region of interest of the dose comparison volume as input. Model performance was evaluated by calculating sample F1-scores for training and validation sets. Main results. The model had high F1-scores for Level 1 classification, but performance for Level 2 was lower, and overfitting became more apparent. Using relative dose difference instead of gamma volumes as input improved performance for Level 2 classification, whereas using an ensemble model additionally reduced overfitting. The models obtained F1-scores of 0.86 and 0.62 on an independent test set for Level 1 and Level 2, respectively. Significance. This study shows that it is possible to identify multiple errors occurring simultaneously in 3D dose verification data.

Original languageEnglish
Article number035002
Number of pages13
JournalPhysics in Medicine and Biology
Volume69
Issue number3
Early online date13 Dec 2023
DOIs
Publication statusPublished - 7 Feb 2024

Keywords

  • artificial intelligence
  • deep learning
  • error identification
  • multilabel
  • radiotherapy
  • treatment verification

Cite this