Chatbots Vs. Human Experts: Evaluating Diagnostic Performance of Chatbots in Uveitis and the Perspectives on AI Adoption in Ophthalmology

William Rojas-Carabali; Alok Sen; Aniruddha Agarwal; Gavin Tan; Carol Y Cheung; Andres Rousselot; Rajdeep Agrawal; Renee Liu; Carlos Cifuentes-González; Tobias Elze; John H Kempen; Lucia Sobrin; Quan Dong Nguyen; Alejandra de-la-Torre; Bernett Lee; Vishali Gupta; Rupesh Agrawal

doi:10.1080/09273948.2023.2266730

Chatbots Vs. Human Experts: Evaluating Diagnostic Performance of Chatbots in Uveitis and the Perspectives on AI Adoption in Ophthalmology

William Rojas-Carabali, Alok Sen, Aniruddha Agarwal, Gavin Tan, Carol Y Cheung, Andres Rousselot, Rajdeep Agrawal, Renee Liu, Carlos Cifuentes-González, Tobias Elze, John H Kempen, Lucia Sobrin, Quan Dong Nguyen, Alejandra de-la-Torre, Bernett Lee, Vishali Gupta, Rupesh Agrawal^*

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

PURPOSE: To assess the diagnostic performance of two chatbots, ChatGPT and Glass, in uveitis diagnosis compared to renowned uveitis specialists, and evaluate clinicians' perception about utilizing artificial intelligence (AI) in ophthalmology practice. METHODS: Six cases were presented to uveitis experts, ChatGPT (version 3.5 and 4.0) and Glass 1.0, and diagnostic accuracy was analyzed. Additionally, a survey about the emotions, confidence in utilizing AI-based tools, and the likelihood of incorporating such tools in clinical practice was done. RESULTS: Uveitis experts accurately diagnosed all cases (100%), while ChatGPT achieved a diagnostic success rate of 66% and Glass 1.0 achieved 33%. Most attendees felt excited or optimistic about utilizing AI in ophthalmology practice. Older age and high level of education were positively correlated with increased inclination to adopt AI-based tools. CONCLUSIONS: ChatGPT demonstrated promising diagnostic capabilities in uveitis cases and ophthalmologist showed enthusiasm for the integration of AI into clinical practice.

Original language	English
Pages (from-to)	1-8
Number of pages	8
Journal	Ocular Immunology and Inflammation
DOIs	https://doi.org/10.1080/09273948.2023.2266730
Publication status	E-pub ahead of print - Oct 2023

Keywords

Artificial intelligence
ChatGPT
diagnosis
large language model
ophthalmology

Access to Document

10.1080/09273948.2023.2266730

Cite this

Rojas-Carabali, W., Sen, A., Agarwal, A., Tan, G., Cheung, C. Y., Rousselot, A., Agrawal, R., Liu, R., Cifuentes-González, C., Elze, T., Kempen, J. H., Sobrin, L., Nguyen, Q. D., de-la-Torre, A., Lee, B., Gupta, V., & Agrawal, R. (2023). Chatbots Vs. Human Experts: Evaluating Diagnostic Performance of Chatbots in Uveitis and the Perspectives on AI Adoption in Ophthalmology. Ocular Immunology and Inflammation, 1-8. Advance online publication. https://doi.org/10.1080/09273948.2023.2266730

@article{d2e0b93cf6f1461a890a3a3e618b0e67,

title = "Chatbots Vs. Human Experts: Evaluating Diagnostic Performance of Chatbots in Uveitis and the Perspectives on AI Adoption in Ophthalmology",

abstract = "PURPOSE: To assess the diagnostic performance of two chatbots, ChatGPT and Glass, in uveitis diagnosis compared to renowned uveitis specialists, and evaluate clinicians' perception about utilizing artificial intelligence (AI) in ophthalmology practice. METHODS: Six cases were presented to uveitis experts, ChatGPT (version 3.5 and 4.0) and Glass 1.0, and diagnostic accuracy was analyzed. Additionally, a survey about the emotions, confidence in utilizing AI-based tools, and the likelihood of incorporating such tools in clinical practice was done. RESULTS: Uveitis experts accurately diagnosed all cases (100%), while ChatGPT achieved a diagnostic success rate of 66% and Glass 1.0 achieved 33%. Most attendees felt excited or optimistic about utilizing AI in ophthalmology practice. Older age and high level of education were positively correlated with increased inclination to adopt AI-based tools. CONCLUSIONS: ChatGPT demonstrated promising diagnostic capabilities in uveitis cases and ophthalmologist showed enthusiasm for the integration of AI into clinical practice.",

keywords = "Artificial intelligence, ChatGPT, diagnosis, large language model, ophthalmology",

author = "William Rojas-Carabali and Alok Sen and Aniruddha Agarwal and Gavin Tan and Cheung, {Carol Y} and Andres Rousselot and Rajdeep Agrawal and Renee Liu and Carlos Cifuentes-Gonz{\'a}lez and Tobias Elze and Kempen, {John H} and Lucia Sobrin and Nguyen, {Quan Dong} and Alejandra de-la-Torre and Bernett Lee and Vishali Gupta and Rupesh Agrawal",

year = "2023",

month = oct,

doi = "10.1080/09273948.2023.2266730",

language = "English",

pages = "1--8",

journal = "Ocular Immunology and Inflammation",

issn = "0927-3948",

publisher = "Routledge/Taylor & Francis Group",

}

Rojas-Carabali, W, Sen, A, Agarwal, A, Tan, G, Cheung, CY, Rousselot, A, Agrawal, R, Liu, R, Cifuentes-González, C, Elze, T, Kempen, JH, Sobrin, L, Nguyen, QD, de-la-Torre, A, Lee, B, Gupta, V & Agrawal, R 2023, 'Chatbots Vs. Human Experts: Evaluating Diagnostic Performance of Chatbots in Uveitis and the Perspectives on AI Adoption in Ophthalmology', Ocular Immunology and Inflammation, pp. 1-8. https://doi.org/10.1080/09273948.2023.2266730

TY - JOUR

T1 - Chatbots Vs. Human Experts

T2 - Evaluating Diagnostic Performance of Chatbots in Uveitis and the Perspectives on AI Adoption in Ophthalmology

AU - Rojas-Carabali, William

AU - Sen, Alok

AU - Agarwal, Aniruddha

AU - Tan, Gavin

AU - Cheung, Carol Y

AU - Rousselot, Andres

AU - Agrawal, Rajdeep

AU - Liu, Renee

AU - Cifuentes-González, Carlos

AU - Elze, Tobias

AU - Kempen, John H

AU - Sobrin, Lucia

AU - Nguyen, Quan Dong

AU - de-la-Torre, Alejandra

AU - Lee, Bernett

AU - Gupta, Vishali

AU - Agrawal, Rupesh

PY - 2023/10

Y1 - 2023/10

N2 - PURPOSE: To assess the diagnostic performance of two chatbots, ChatGPT and Glass, in uveitis diagnosis compared to renowned uveitis specialists, and evaluate clinicians' perception about utilizing artificial intelligence (AI) in ophthalmology practice. METHODS: Six cases were presented to uveitis experts, ChatGPT (version 3.5 and 4.0) and Glass 1.0, and diagnostic accuracy was analyzed. Additionally, a survey about the emotions, confidence in utilizing AI-based tools, and the likelihood of incorporating such tools in clinical practice was done. RESULTS: Uveitis experts accurately diagnosed all cases (100%), while ChatGPT achieved a diagnostic success rate of 66% and Glass 1.0 achieved 33%. Most attendees felt excited or optimistic about utilizing AI in ophthalmology practice. Older age and high level of education were positively correlated with increased inclination to adopt AI-based tools. CONCLUSIONS: ChatGPT demonstrated promising diagnostic capabilities in uveitis cases and ophthalmologist showed enthusiasm for the integration of AI into clinical practice.

AB - PURPOSE: To assess the diagnostic performance of two chatbots, ChatGPT and Glass, in uveitis diagnosis compared to renowned uveitis specialists, and evaluate clinicians' perception about utilizing artificial intelligence (AI) in ophthalmology practice. METHODS: Six cases were presented to uveitis experts, ChatGPT (version 3.5 and 4.0) and Glass 1.0, and diagnostic accuracy was analyzed. Additionally, a survey about the emotions, confidence in utilizing AI-based tools, and the likelihood of incorporating such tools in clinical practice was done. RESULTS: Uveitis experts accurately diagnosed all cases (100%), while ChatGPT achieved a diagnostic success rate of 66% and Glass 1.0 achieved 33%. Most attendees felt excited or optimistic about utilizing AI in ophthalmology practice. Older age and high level of education were positively correlated with increased inclination to adopt AI-based tools. CONCLUSIONS: ChatGPT demonstrated promising diagnostic capabilities in uveitis cases and ophthalmologist showed enthusiasm for the integration of AI into clinical practice.

KW - Artificial intelligence

KW - ChatGPT

KW - diagnosis

KW - large language model

KW - ophthalmology

U2 - 10.1080/09273948.2023.2266730

DO - 10.1080/09273948.2023.2266730

M3 - Article

SN - 0927-3948

SP - 1

EP - 8

JO - Ocular Immunology and Inflammation

JF - Ocular Immunology and Inflammation

ER -