Using artificial intelligence to improve human performance: efficient retinal disease detection training with synthetic images.
Hitoshi Tabuchi, Justin Engelmann, Fumiatsu Maeda, Ryo Nishikawa, Toshihiko Nagasawa, Tomofusa Yamauchi, Mao Tanabe, Masahiro Akada, Keita Kihara, Yasuyuki Nakae, Yoshiaki Kiuchi, Miguel O Bernabeu
Summary
Synthetic images can be used effectively in medical education. We also found that humans are more robust to novel situations than AI models, thus showcasing human judgement's essential role in medical diagnosis.
Abstract
BACKGROUND
Artificial intelligence (AI) in medical imaging diagnostics has huge potential, but human judgement is still indispensable. We propose an AI-aided teaching method that leverages generative AI to train students on many images while preserving patient privacy.
METHODS
A web-based course was designed using 600 synthetic ultra-widefield (UWF) retinal images to teach students to detect disease in these images. The images were generated by stable diffusion, a large generative foundation model, which we fine-tuned with 6285 real UWF images from six categories: five retinal diseases (age-related macular degeneration, glaucoma, diabetic retinopathy, retinal detachment and retinal vein occlusion) and normal. 161 trainee orthoptists took the course. They were evaluated with two tests: one consisting of UWF images and another of standard field (SF) images, which the students had not encountered in the course. Both tests contained 120 real patient images, 20 per category. The students took both tests once before and after training, with a cool-off period in between.
RESULTS
On average, students completed the course in 53 min, significantly improving their diagnostic accuracy. For UWF images, student accuracy increased from 43.6% to 74.1% (p<0.0001 by paired t-test), nearly matching the previously published state-of-the-art AI model's accuracy of 73.3%. For SF images, student accuracy rose from 42.7% to 68.7% (p<0.0001), surpassing the state-of-the-art AI model's 40%.
CONCLUSION
Synthetic images can be used effectively in medical education. We also found that humans are more robust to novel situations than AI models, thus showcasing human judgement's essential role in medical diagnosis.
Keywords
More by Hitoshi Tabuchi
View full profile →Deep-learning Classifier With an Ultrawide-field Scanning Laser Ophthalmoscope Detects Glaucoma Visual Field Severity.
Comparison of the Intraocular Pressure Measured Using the New Rebound Tonometer Icare ic100 and Icare TA01i or Goldmann Applanation Tonometer.
Evaluation of Automatic Monitoring of Instillation Adherence Using Eye Dropper Bottle Sensor and Deep Learning in Patients With Glaucoma.
Top Research in Diagnosis & Screening
Browse all →Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs.
Dry eye disease and oxidative stress.
Central Corneal Thickness in the Ocular Hypertension Treatment Study (OHTS).
Discussion
Comments and discussion will appear here in a future update.