Predicting Lung Cancer Survival Using Probabilistic Reclassification of TNM Editions With a Bayesian Network

Melle S. Sieswerda*, Inigo Bermejo, Gijs Geleijnse, Mieke J. Aarts, Valery E. P. P. Lemmens, Dirk De Ruysscher, Andre L. A. J. Dekker, Xander A. A. M. Verbeek

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

PURPOSE The TNM classification system is used for prognosis, treatment, and research. Regular updates potentially break backward compatibility. Reclassification is not always possible, is labor intensive, or requires additional data. We developed a Bayesian network (BN) for reclassifying the 5th, 6th, and 7th editions of the TNM and predicting survival for non-small-cell lung cancer (NSCLC) without training data with known classifications in multiple editions.

METHODS Data were obtained from the Netherlands Cancer Registry (n = 146,084). A BN was designed with nodes for TNM edition and survival, and a group of nodes was designed for all TNM editions, with a group for edition 7 only. Before learning conditional probabilities, priors for relations between the groups were manually specified after analysis of changes between editions. For performance evaluation only, part of the 7th edition test data were manually reclassified. Performance was evaluated using sensitivity, specificity, and accuracy. Twoyear survival was evaluated with the receiver operating characteristic area under the curve (AUC), and model calibration was visualized.

RESULTS Manual reclassification of 7th to 6th edition stage group as ground truth for testing was impossible in 5.6% of the patients. Predicting 6th edition stage grouping using 7th edition data and vice versa resulted in average accuracies, sensitivities, and specificities between 0.85 and 0.99. The AUC for 2-year survival was 0.81.

CONCLUSION We have successfully created a BN for reclassifying TNM stage grouping across TNM editions and predicting survival in NSCLC without knowing the true TNM classification in various editions in the training set. We suggest binary prediction of survival is less relevant than predicted probability and model calibration. For research, probabilities can be used for weighted reclassification. (C) 2020 by American Society of Clinical Oncology.

Original languageEnglish
Pages (from-to)436-443
Number of pages8
JournalJCO Clinical Cancer Informatics
Volume4
DOIs
Publication statusPublished - 11 May 2020

Keywords

  • SYSTEM

Cite this