Predicting Glaucoma Before Onset Using a Large Language Model Chatbot.
Xiaoqin Huang, Hina Raja, Yeganeh Madadi, Mohammad Delsoz, Asma Poursoroush, Malik Y Kahook, Siamak Yousefi
Summary
The performance of ChatGPT4.0 in forecasting development of glaucoma 1 year before onset was reasonable.
Abstract
PURPOSE
To investigate the capability of ChatGPT for forecasting the conversion from ocular hypertension (OHT) to glaucoma based on the Ocular Hypertension Treatment Study (OHTS).
DESIGN
Retrospective case-control study.
PARTICIPANTS
A total of 3008 eyes of 1504 subjects from the OHTS were included in the study.
METHODS
We selected demographic, clinical, ocular, optic nerve head, and visual field (VF) parameters 1 year before glaucoma development from the OHTS participants. Subsequently, we developed queries by converting tabular parameters into textual format based on both eyes of all participants. We used the ChatGPT application program interface (API) to automatically perform ChatGPT prompting for all subjects. We then investigated whether ChatGPT can accurately forecast conversion from OHT to glaucoma based on various objective metrics.
MAIN OUTCOME MEASURE
Accuracy, area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and weighted F1 score.
RESULTS
ChatGPT4.0 demonstrated an accuracy of 75%, AUC of 0.67, sensitivity of 56%, specificity of 78%, and weighted F1 score of 0.77 in predicting conversion to glaucoma 1 year before onset. ChatGPT3.5 provided an accuracy of 61%, AUC of 0.62, sensitivity of 64%, specificity of 59%, and weighted F1 score of 0.63 in predicting conversion to glaucoma 1 year before onset.
CONCLUSIONS
The performance of ChatGPT4.0 in forecasting development of glaucoma 1 year before onset was reasonable. The overall performance of ChatGPT4.0 was consistently higher than ChatGPT3.5. Large language models (LLMs) hold great promise for augmenting glaucoma research capabilities and enhancing clinical care. Future efforts in creating ophthalmology-specific LLMs that leverage multimodal data in combination with active learning may lead to more useful integration with clinical practice and deserve further investigations.
More by Xiaoqin Huang
View full profile →Estimating the Severity of Visual Field Damage From Retinal Nerve Fiber Layer Thickness Measurements With Artificial Intelligence.
An Objective and Easy-to-Use Glaucoma Functional Severity Staging System Based on Artificial Intelligence.
Identifying Factors Associated With Fast Visual Field Progression in Patients With Ocular Hypertension Based on Unsupervised Machine Learning.
Top Research in Diagnosis & Screening
Browse all →Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs.
Dry eye disease and oxidative stress.
Central Corneal Thickness in the Ocular Hypertension Treatment Study (OHTS).
Discussion
Comments and discussion will appear here in a future update.