Br J OphthalmolSeptember 2024Journal Article

Using artificial intelligence to improve human performance: efficient retinal disease detection training with synthetic images.

Authors

Hitoshi Tabuchi, Justin Engelmann, Fumiatsu Maeda, Ryo Nishikawa, Toshihiko Nagasawa, Tomofusa Yamauchi, Mao Tanabe, Masahiro Akada, Keita Kihara, Yasuyuki Nakae, Yoshiaki Kiuchi, Miguel O Bernabeu

Diagnosis & ScreeningArtificial Intelligence

Summary

Synthetic images can be used effectively in medical education. We also found that humans are more robust to novel situations than AI models, thus showcasing human judgement's essential role in medical diagnosis.

Abstract

BACKGROUND

Artificial intelligence (AI) in medical imaging diagnostics has huge potential, but human judgement is still indispensable. We propose an AI-aided teaching method that leverages generative AI to train students on many images while preserving patient privacy.

METHODS

A web-based course was designed using 600 synthetic ultra-widefield (UWF) retinal images to teach students to detect disease in these images. The images were generated by stable diffusion, a large generative foundation model, which we fine-tuned with 6285 real UWF images from six categories: five retinal diseases (age-related macular degeneration, glaucoma, diabetic retinopathy, retinal detachment and retinal vein occlusion) and normal. 161 trainee orthoptists took the course. They were evaluated with two tests: one consisting of UWF images and another of standard field (SF) images, which the students had not encountered in the course. Both tests contained 120 real patient images, 20 per category. The students took both tests once before and after training, with a cool-off period in between.

RESULTS

On average, students completed the course in 53 min, significantly improving their diagnostic accuracy. For UWF images, student accuracy increased from 43.6% to 74.1% (p<0.0001 by paired t-test), nearly matching the previously published state-of-the-art AI model's accuracy of 73.3%. For SF images, student accuracy rose from 42.7% to 68.7% (p<0.0001), surpassing the state-of-the-art AI model's 40%.

CONCLUSION

Keywords

Diagnostic tests/InvestigationImagingPublic healthRetinaTelemedicine

More by Hitoshi Tabuchi

View full profile →

Deep-learning Classifier With an Ultrawide-field Scanning Laser Ophthalmoscope Detects Glaucoma Visual Field Severity.

2018J Glaucoma45 citations

Comparison of the Intraocular Pressure Measured Using the New Rebound Tonometer Icare ic100 and Icare TA01i or Goldmann Applanation Tonometer.

2019J Glaucoma26 citations

Evaluation of Automatic Monitoring of Instillation Adherence Using Eye Dropper Bottle Sensor and Deep Learning in Patients With Glaucoma.

2019Transl Vis Sci Technol8 citations

Top Research in Diagnosis & Screening

Browse all →

Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs.

2018Ophthalmology701 citations

Dry eye disease and oxidative stress.

2018Acta Ophthalmol299 citations

Central Corneal Thickness in the Ocular Hypertension Treatment Study (OHTS).

2020Ophthalmology293 citations

Discussion

Comments and discussion will appear here in a future update.