Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation

Nilakash Das, Sofie Happaerts, Iwein Gyselinck, Michael Staes, Eric Derom, Guy Brusselle, Felip Burgos, Marco Contoli, Anh Tuan Dinh-Xuan, Frits M. E. Franssen, Sherif Gonem, Neil Greening, Christel Haenebalcke, William D-C. Man, Jorge Moises, Rudi Peche, Vitalii Poberezhets, Jennifer K. Quint, Michael C. Steiner, Eef VanderhelstMustafa Abdo, Marko Topalovic, Wim Janssens*

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

Background Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic interpretation of pulmonary function tests (PFTs) than the pulmonologist without support. Methods The study was conducted in two phases, a monocentre study (phase 1) and a multicentre intervention study (phase 2). Each phase utilised two different sets of 24 PFT reports of patients with clinically validated gold standard diagnosis. Each PFT was interpreted without (control) and with XAI's suggestions (intervention). Pulmonologists provided a differential diagnosis consisting of a preferential diagnosis and optionally up to three additional diagnoses. The primary end-point compared accuracy of preferential and additional diagnoses between control and intervention. Secondary end-points were the number of diagnoses in differential diagnosis, diagnostic confidence and inter-rater agreement. We also analysed how XAI influenced pulmonologists' decisions. Results In phase 1 (n=16 pulmonologists), mean preferential and differential diagnostic accuracy significantly increased by 10.4% and 9.4%, respectively, between control and intervention (p<0.001). Improvements were somewhat lower but highly significant (p<0.0001) in phase 2 (5.4% and 8.7%, respectively; n=62 pulmonologists). In both phases, the number of diagnoses in the differential diagnosis did not reduce, but diagnostic confidence and inter-rater agreement significantly increased during intervention. Pulmonologists updated their decisions with XAI's feedback and consistently improved their baseline performance if AI provided correct predictions. Conclusion A collaboration between a pulmonologist and XAI is better at interpreting PFTs than individual pulmonologists reading without XAI support or XAI alone.
Original languageEnglish
Article number2201720
Number of pages10
JournalEuropean Respiratory Journal
Volume61
Issue number5
DOIs
Publication statusPublished - 1 May 2023

Keywords

  • BLACK-BOX
  • DIAGNOSIS

Cite this