Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation

Nilakash Das; Sofie Happaerts; Iwein Gyselinck; Michael Staes; Eric Derom; Guy Brusselle; Felip Burgos; Marco Contoli; Anh Tuan Dinh-Xuan; Frits M. E. Franssen; Sherif Gonem; Neil Greening; Christel Haenebalcke; William D-C. Man; Jorge Moises; Rudi Peche; Vitalii Poberezhets; Jennifer K. Quint; Michael C. Steiner; Eef Vanderhelst; Mustafa Abdo; Marko Topalovic; Wim Janssens

doi:10.1183/13993003.01720-2022

Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation

Nilakash Das, Sofie Happaerts, Iwein Gyselinck, Michael Staes, Eric Derom, Guy Brusselle, Felip Burgos, Marco Contoli, Anh Tuan Dinh-Xuan, Frits M. E. Franssen, Sherif Gonem, Neil Greening, Christel Haenebalcke, William D-C. Man, Jorge Moises, Rudi Peche, Vitalii Poberezhets, Jennifer K. Quint, Michael C. Steiner, Eef VanderhelstMustafa Abdo, Marko Topalovic, Wim Janssens^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Background Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic interpretation of pulmonary function tests (PFTs) than the pulmonologist without support. Methods The study was conducted in two phases, a monocentre study (phase 1) and a multicentre intervention study (phase 2). Each phase utilised two different sets of 24 PFT reports of patients with clinically validated gold standard diagnosis. Each PFT was interpreted without (control) and with XAI's suggestions (intervention). Pulmonologists provided a differential diagnosis consisting of a preferential diagnosis and optionally up to three additional diagnoses. The primary end-point compared accuracy of preferential and additional diagnoses between control and intervention. Secondary end-points were the number of diagnoses in differential diagnosis, diagnostic confidence and inter-rater agreement. We also analysed how XAI influenced pulmonologists' decisions. Results In phase 1 (n=16 pulmonologists), mean preferential and differential diagnostic accuracy significantly increased by 10.4% and 9.4%, respectively, between control and intervention (p<0.001). Improvements were somewhat lower but highly significant (p<0.0001) in phase 2 (5.4% and 8.7%, respectively; n=62 pulmonologists). In both phases, the number of diagnoses in the differential diagnosis did not reduce, but diagnostic confidence and inter-rater agreement significantly increased during intervention. Pulmonologists updated their decisions with XAI's feedback and consistently improved their baseline performance if AI provided correct predictions. Conclusion A collaboration between a pulmonologist and XAI is better at interpreting PFTs than individual pulmonologists reading without XAI support or XAI alone.

Original language	English
Article number	2201720
Number of pages	10
Journal	European Respiratory Journal
Volume	61
Issue number	5
DOIs	https://doi.org/10.1183/13993003.01720-2022
Publication status	Published - 1 May 2023

Keywords

BLACK-BOX
DIAGNOSIS

Access to Document

10.1183/13993003.01720-2022Licence: CC BY-NC

Cite this

Das, N., Happaerts, S., Gyselinck, I., Staes, M., Derom, E., Brusselle, G., Burgos, F., Contoli, M., Dinh-Xuan, A. T., Franssen, F. M. E., Gonem, S., Greening, N., Haenebalcke, C., Man, W. D.-C., Moises, J., Peche, R., Poberezhets, V., Quint, J. K., Steiner, M. C., ... Janssens, W. (2023). Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation. European Respiratory Journal, 61(5), Article 2201720. https://doi.org/10.1183/13993003.01720-2022

@article{4b2205c9c8e84b19b88c993a31953aa2,

title = "Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation",

abstract = "Background Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic interpretation of pulmonary function tests (PFTs) than the pulmonologist without support. Methods The study was conducted in two phases, a monocentre study (phase 1) and a multicentre intervention study (phase 2). Each phase utilised two different sets of 24 PFT reports of patients with clinically validated gold standard diagnosis. Each PFT was interpreted without (control) and with XAI's suggestions (intervention). Pulmonologists provided a differential diagnosis consisting of a preferential diagnosis and optionally up to three additional diagnoses. The primary end-point compared accuracy of preferential and additional diagnoses between control and intervention. Secondary end-points were the number of diagnoses in differential diagnosis, diagnostic confidence and inter-rater agreement. We also analysed how XAI influenced pulmonologists' decisions. Results In phase 1 (n=16 pulmonologists), mean preferential and differential diagnostic accuracy significantly increased by 10.4% and 9.4%, respectively, between control and intervention (p<0.001). Improvements were somewhat lower but highly significant (p<0.0001) in phase 2 (5.4% and 8.7%, respectively; n=62 pulmonologists). In both phases, the number of diagnoses in the differential diagnosis did not reduce, but diagnostic confidence and inter-rater agreement significantly increased during intervention. Pulmonologists updated their decisions with XAI's feedback and consistently improved their baseline performance if AI provided correct predictions. Conclusion A collaboration between a pulmonologist and XAI is better at interpreting PFTs than individual pulmonologists reading without XAI support or XAI alone.",

keywords = "BLACK-BOX, DIAGNOSIS",

author = "Nilakash Das and Sofie Happaerts and Iwein Gyselinck and Michael Staes and Eric Derom and Guy Brusselle and Felip Burgos and Marco Contoli and Dinh-Xuan, {Anh Tuan} and Franssen, {Frits M. E.} and Sherif Gonem and Neil Greening and Christel Haenebalcke and Man, {William D-C.} and Jorge Moises and Rudi Peche and Vitalii Poberezhets and Quint, {Jennifer K.} and Steiner, {Michael C.} and Eef Vanderhelst and Mustafa Abdo and Marko Topalovic and Wim Janssens",

year = "2023",

month = may,

day = "1",

doi = "10.1183/13993003.01720-2022",

language = "English",

volume = "61",

journal = "European Respiratory Journal",

issn = "0903-1936",

publisher = "European Respiratory Society",

number = "5",

}

Das, N, Happaerts, S, Gyselinck, I, Staes, M, Derom, E, Brusselle, G, Burgos, F, Contoli, M, Dinh-Xuan, AT, Franssen, FME, Gonem, S, Greening, N, Haenebalcke, C, Man, WD-C, Moises, J, Peche, R, Poberezhets, V, Quint, JK, Steiner, MC, Vanderhelst, E, Abdo, M, Topalovic, M & Janssens, W 2023, 'Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation', European Respiratory Journal, vol. 61, no. 5, 2201720. https://doi.org/10.1183/13993003.01720-2022

TY - JOUR

T1 - Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation

AU - Das, Nilakash

AU - Happaerts, Sofie

AU - Gyselinck, Iwein

AU - Staes, Michael

AU - Derom, Eric

AU - Brusselle, Guy

AU - Burgos, Felip

AU - Contoli, Marco

AU - Dinh-Xuan, Anh Tuan

AU - Franssen, Frits M. E.

AU - Gonem, Sherif

AU - Greening, Neil

AU - Haenebalcke, Christel

AU - Man, William D-C.

AU - Moises, Jorge

AU - Peche, Rudi

AU - Poberezhets, Vitalii

AU - Quint, Jennifer K.

AU - Steiner, Michael C.

AU - Vanderhelst, Eef

AU - Abdo, Mustafa

AU - Topalovic, Marko

AU - Janssens, Wim

PY - 2023/5/1

Y1 - 2023/5/1

N2 - Background Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic interpretation of pulmonary function tests (PFTs) than the pulmonologist without support. Methods The study was conducted in two phases, a monocentre study (phase 1) and a multicentre intervention study (phase 2). Each phase utilised two different sets of 24 PFT reports of patients with clinically validated gold standard diagnosis. Each PFT was interpreted without (control) and with XAI's suggestions (intervention). Pulmonologists provided a differential diagnosis consisting of a preferential diagnosis and optionally up to three additional diagnoses. The primary end-point compared accuracy of preferential and additional diagnoses between control and intervention. Secondary end-points were the number of diagnoses in differential diagnosis, diagnostic confidence and inter-rater agreement. We also analysed how XAI influenced pulmonologists' decisions. Results In phase 1 (n=16 pulmonologists), mean preferential and differential diagnostic accuracy significantly increased by 10.4% and 9.4%, respectively, between control and intervention (p<0.001). Improvements were somewhat lower but highly significant (p<0.0001) in phase 2 (5.4% and 8.7%, respectively; n=62 pulmonologists). In both phases, the number of diagnoses in the differential diagnosis did not reduce, but diagnostic confidence and inter-rater agreement significantly increased during intervention. Pulmonologists updated their decisions with XAI's feedback and consistently improved their baseline performance if AI provided correct predictions. Conclusion A collaboration between a pulmonologist and XAI is better at interpreting PFTs than individual pulmonologists reading without XAI support or XAI alone.

AB - Background Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic interpretation of pulmonary function tests (PFTs) than the pulmonologist without support. Methods The study was conducted in two phases, a monocentre study (phase 1) and a multicentre intervention study (phase 2). Each phase utilised two different sets of 24 PFT reports of patients with clinically validated gold standard diagnosis. Each PFT was interpreted without (control) and with XAI's suggestions (intervention). Pulmonologists provided a differential diagnosis consisting of a preferential diagnosis and optionally up to three additional diagnoses. The primary end-point compared accuracy of preferential and additional diagnoses between control and intervention. Secondary end-points were the number of diagnoses in differential diagnosis, diagnostic confidence and inter-rater agreement. We also analysed how XAI influenced pulmonologists' decisions. Results In phase 1 (n=16 pulmonologists), mean preferential and differential diagnostic accuracy significantly increased by 10.4% and 9.4%, respectively, between control and intervention (p<0.001). Improvements were somewhat lower but highly significant (p<0.0001) in phase 2 (5.4% and 8.7%, respectively; n=62 pulmonologists). In both phases, the number of diagnoses in the differential diagnosis did not reduce, but diagnostic confidence and inter-rater agreement significantly increased during intervention. Pulmonologists updated their decisions with XAI's feedback and consistently improved their baseline performance if AI provided correct predictions. Conclusion A collaboration between a pulmonologist and XAI is better at interpreting PFTs than individual pulmonologists reading without XAI support or XAI alone.

KW - BLACK-BOX

KW - DIAGNOSIS

U2 - 10.1183/13993003.01720-2022

DO - 10.1183/13993003.01720-2022

M3 - Article

C2 - 37080566

SN - 0903-1936

VL - 61

JO - European Respiratory Journal

JF - European Respiratory Journal

IS - 5

M1 - 2201720

ER -