TY - JOUR
T1 - COVID-BLUeS - A Prospective Study on the Value of AI in Lung Ultrasound Analysis
AU - Wiedemann, Nina
AU - Korte-De Boer, Dianne de
AU - Richter, Matthias
AU - Weijer, Sjors van de
AU - Buhre, Charlotte
AU - Eggert, Franz A.M.
AU - Aarnoudse, Sophie
AU - Grevendonk, Lotte
AU - Röber, Steffen
AU - Remie, Carlijn M.E.
AU - Buhre, Wolfgang
AU - Henry, Ronald
AU - Born, Jannis
N1 - Publisher Copyright:
© 2013 IEEE.
PY - 2025/1/1
Y1 - 2025/1/1
N2 - As a lightweight and non-invasive imaging technique, lung ultrasound (LUS) has gained importance for assessing lung pathologies. The use of Artificial intelligence (AI) in medical decision support systems is promising due to the time- and expertise-intensive interpretation, however, due to the poor quality of existing data used for training AI models, their usability for real-world applications remains unclear. Methods: In a prospective study, we analyze data from 63 COVID-19 suspects (33 positive) collected at Maastricht University Medical Centre. Ultrasound recordings at six body locations were acquired following the BLUE protocol and manually labeled for severity of lung involvement. Anamnesis and complete blood count (CBC) analyses were conducted. Several AI models were applied and trained for detection and severity of pulmonary infection. Results: The severity of the lung infection, as assigned by human annotators based on the LUS videos, is not significantly different between COVID-19 positive and negative patients (p = 0.89). Nevertheless, the predictions of image-based AI models identify a COVID-19 infection with 65% accuracy when applied zero-shot (i.e., trained on other datasets), and up to 79% with targeted training, whereas the accuracy based on human annotations is at most 65%. Multi-modal models combining images and CBC improve significantly over image-only models. Conclusion: Although our analysis generally supports the value of AI in LUS assessment, the evaluated models fall short of the performance expected from previous work. We find this is due to 1) the heterogeneity of LUS datasets, limiting the generalization ability to new data, 2) the frame-based processing of AI models ignoring video-level information, and 3) lack of work on multimodal models that can extract the most relevant information from video-, image- and variable-based inputs.
AB - As a lightweight and non-invasive imaging technique, lung ultrasound (LUS) has gained importance for assessing lung pathologies. The use of Artificial intelligence (AI) in medical decision support systems is promising due to the time- and expertise-intensive interpretation, however, due to the poor quality of existing data used for training AI models, their usability for real-world applications remains unclear. Methods: In a prospective study, we analyze data from 63 COVID-19 suspects (33 positive) collected at Maastricht University Medical Centre. Ultrasound recordings at six body locations were acquired following the BLUE protocol and manually labeled for severity of lung involvement. Anamnesis and complete blood count (CBC) analyses were conducted. Several AI models were applied and trained for detection and severity of pulmonary infection. Results: The severity of the lung infection, as assigned by human annotators based on the LUS videos, is not significantly different between COVID-19 positive and negative patients (p = 0.89). Nevertheless, the predictions of image-based AI models identify a COVID-19 infection with 65% accuracy when applied zero-shot (i.e., trained on other datasets), and up to 79% with targeted training, whereas the accuracy based on human annotations is at most 65%. Multi-modal models combining images and CBC improve significantly over image-only models. Conclusion: Although our analysis generally supports the value of AI in LUS assessment, the evaluated models fall short of the performance expected from previous work. We find this is due to 1) the heterogeneity of LUS datasets, limiting the generalization ability to new data, 2) the frame-based processing of AI models ignoring video-level information, and 3) lack of work on multimodal models that can extract the most relevant information from video-, image- and variable-based inputs.
KW - computer vision
KW - COVID-19
KW - Lung ultrasound
U2 - 10.1109/JBHI.2025.3543686
DO - 10.1109/JBHI.2025.3543686
M3 - Article
SN - 2168-2194
JO - IEEE Journal of Biomedical and Health Informatics
JF - IEEE Journal of Biomedical and Health Informatics
ER -