Am J OphthalmolOctober 2024Journal Article

Predicting Glaucoma Before Onset Using a Large Language Model Chatbot.

Authors

Xiaoqin Huang, Hina Raja, Yeganeh Madadi, Mohammad Delsoz, Asma Poursoroush, Malik Y Kahook, Siamak Yousefi

Diagnosis & ScreeningGlaucoma Surgery

28 citations

Summary

The performance of ChatGPT4.0 in forecasting development of glaucoma 1 year before onset was reasonable.

Abstract

PURPOSE

To investigate the capability of ChatGPT for forecasting the conversion from ocular hypertension (OHT) to glaucoma based on the Ocular Hypertension Treatment Study (OHTS).

DESIGN

Retrospective case-control study.

PARTICIPANTS

A total of 3008 eyes of 1504 subjects from the OHTS were included in the study.

METHODS

We selected demographic, clinical, ocular, optic nerve head, and visual field (VF) parameters 1 year before glaucoma development from the OHTS participants. Subsequently, we developed queries by converting tabular parameters into textual format based on both eyes of all participants. We used the ChatGPT application program interface (API) to automatically perform ChatGPT prompting for all subjects. We then investigated whether ChatGPT can accurately forecast conversion from OHT to glaucoma based on various objective metrics.

MAIN OUTCOME MEASURE

Accuracy, area under the receiver operating characteristic curve (AUC), sensitivity, specificity, and weighted F1 score.

RESULTS

ChatGPT4.0 demonstrated an accuracy of 75%, AUC of 0.67, sensitivity of 56%, specificity of 78%, and weighted F1 score of 0.77 in predicting conversion to glaucoma 1 year before onset. ChatGPT3.5 provided an accuracy of 61%, AUC of 0.62, sensitivity of 64%, specificity of 59%, and weighted F1 score of 0.63 in predicting conversion to glaucoma 1 year before onset.

CONCLUSIONS

The performance of ChatGPT4.0 in forecasting development of glaucoma 1 year before onset was reasonable. The overall performance of ChatGPT4.0 was consistently higher than ChatGPT3.5. Large language models (LLMs) hold great promise for augmenting glaucoma research capabilities and enhancing clinical care. Future efforts in creating ophthalmology-specific LLMs that leverage multimodal data in combination with active learning may lead to more useful integration with clinical practice and deserve further investigations.

More by Xiaoqin Huang

View full profile →

Estimating the Severity of Visual Field Damage From Retinal Nerve Fiber Layer Thickness Measurements With Artificial Intelligence.

2021Transl Vis Sci Technol17 citations

An Objective and Easy-to-Use Glaucoma Functional Severity Staging System Based on Artificial Intelligence.

2022J Glaucoma11 citations

Identifying Factors Associated With Fast Visual Field Progression in Patients With Ocular Hypertension Based on Unsupervised Machine Learning.

2024J Glaucoma4 citations

Top Research in Diagnosis & Screening

Browse all →

Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs.

2018Ophthalmology701 citations

Dry eye disease and oxidative stress.

2018Acta Ophthalmol299 citations

Central Corneal Thickness in the Ocular Hypertension Treatment Study (OHTS).

2020Ophthalmology293 citations

Discussion

Comments and discussion will appear here in a future update.