Transl Vis Sci TechnolApril 2020Research Support, N.I.H., Extramural

Effects of Study Population, Labeling and Training on Glaucoma Detection Using Deep Learning Algorithms.

Artificial IntelligenceDiagnosis & Screening

Summary

Deep learning glaucoma detection can achieve high accuracy across diverse datasets with appropriate training strategies.

Abstract

PURPOSE

To compare performance of independently developed deep learning algorithms for detecting glaucoma from fundus photographs and to evaluate strategies for incorporating new data into models.

METHODS

Two fundus photograph datasets from the Diagnostic Innovations in Glaucoma Study/African Descent and Glaucoma Evaluation Study and Matsue Red Cross Hospital were used to independently develop deep learning algorithms for detection of glaucoma at the University of California, San Diego, and the University of Tokyo. We compared three versions of the University of California, San Diego, and University of Tokyo models: original (no retraining), sequential (retraining only on new data), and combined (training on combined data). Independent datasets were used to test the algorithms.

RESULTS

The original University of California, San Diego and University of Tokyo models performed similarly (area under the receiver operating characteristic curve = 0.96 and 0.97, respectively) for detection of glaucoma in the Matsue Red Cross Hospital dataset, but not the Diagnostic Innovations in Glaucoma Study/African Descent and Glaucoma Evaluation Study data (0.79 and 0.92;< .001), respectively. Model performance was higher when classifying moderate-to-severe compared with mild disease (area under the receiver operating characteristic curve = 0.98 and 0.91;< .001), respectively. Models trained with the combined strategy generally had better performance across all datasets than the original strategy.

CONCLUSIONS

Deep learning glaucoma detection can achieve high accuracy across diverse datasets with appropriate training strategies. Because model performance was influenced by the severity of disease, labeling, training strategies, and population characteristics, reporting accuracy stratified by relevant covariates is important for cross study comparisons.

TRANSLATIONAL RELEVANCE

High sensitivity and specificity of deep learning algorithms for moderate-to-severe glaucoma across diverse populations suggest a role for artificial intelligence in the detection of glaucoma in primary care.

Keywords

artificial intelligenceglaucomaimagingmachine learningoptic disc

More by Mark Christopher

View full profile →

Deep Learning Approaches Predict Glaucomatous Visual Field Damage from OCT Optic Nerve Head En Face Images and Retinal Nerve Fiber Layer Thickness Maps.

2020Ophthalmology140 citations

Macular and Optic Nerve Head Vessel Density and Progressive Retinal Nerve Fiber Layer Loss in Glaucoma.

2018Ophthalmology140 citations

Retinal Nerve Fiber Layer Features Identified by Unsupervised Machine Learning on Optical Coherence Tomography Scans Predict Glaucoma Progression.

2018Invest Ophthalmol Vis Sci106 citations

Top Research in Artificial Intelligence

Browse all →

Digital technology, tele-medicine and artificial intelligence in ophthalmology: A global perspective.

2021Prog Retin Eye Res492 citations

Deep learning in ophthalmology: The technical and clinical considerations.

2019Prog Retin Eye Res447 citations

Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs.

2018Ophthalmology262 citations

Discussion

Comments and discussion will appear here in a future update.