Am J OphthalmolMarch 2020Comparative Study

Human Versus Machine: Comparing a Deep Learning Algorithm to Human Gradings for Detecting Glaucoma on Fundus Photographs.

Authors

Alessandro A Jammal, Atalie C Thompson, Eduardo B Mariottoni, Samuel I Berchuck, Carla N Urata, Tais Estrela, Susan M Wakil, Vital P Costa, Felipe A Medeiros

Visual FieldOptic Nerve & Disc

95 citations 4 influential Open Access

Summary

An M2M DL algorithm performed as well as, if not better than, human graders at detecting eyes with repeatable glaucomatous visual field loss.

Abstract

PURPOSE

To compare the diagnostic performance of human gradings vs predictions provided by a machine-to-machine (M2M) deep learning (DL) algorithm trained to quantify retinal nerve fiber layer (RNFL) damage on fundus photographs.

DESIGN

Evaluation of a machine learning algorithm.

METHODS

An M2M DL algorithm trained with RNFL thickness parameters from spectral-domain optical coherence tomography was applied to a subset of 490 fundus photos of 490 eyes of 370 subjects graded by 2 glaucoma specialists for the probability of glaucomatous optical neuropathy (GON), and estimates of cup-to-disc (C/D) ratios. Spearman correlations with standard automated perimetry (SAP) global indices were compared between the human gradings vs the M2M DL-predicted RNFL thickness values. The area under the receiver operating characteristic curves (AUC) and partial AUC for the region of clinically meaningful specificity (85%-100%) were used to compare the ability of each output to discriminate eyes with repeatable glaucomatous SAP defects vs eyes with normal fields.

RESULTS

The M2M DL-predicted RNFL thickness had a significantly stronger absolute correlation with SAP mean deviation (rho=0.54) than the probability of GON given by human graders (rho=0.48; P < .001). The partial AUC for the M2M DL algorithm was significantly higher than that for the probability of GON by human graders (partial AUC = 0.529 vs 0.411, respectively; P = .016).

CONCLUSION

An M2M DL algorithm performed as well as, if not better than, human graders at detecting eyes with repeatable glaucomatous visual field loss. This DL algorithm could potentially replace human graders in population screening efforts for glaucoma.

More by Alessandro A Jammal

View full profile →

From Machine to Machine: An OCT-Trained Deep Learning Algorithm for Objective Quantification of Glaucomatous Damage in Fundus Photographs.

2019Ophthalmology211 citations

A Review of Deep Learning for Screening, Diagnosis, and Detection of Glaucoma Progression.

2020Transl Vis Sci Technol154 citations

Assessment of a Segmentation-Free Deep Learning Algorithm for Diagnosing Glaucoma From Optical Coherence Tomography Scans.

2020JAMA Ophthalmol112 citations

Top Research in Visual Field

Browse all →

Optical coherence tomography angiography: A comprehensive review of current methods and clinical applications.

2017Prog Retin Eye Res828 citations

Relationship between Optical Coherence Tomography Angiography Vessel Density and Severity of Visual Field Loss in Glaucoma.

2016Ophthalmology398 citations

Improving our understanding, and detection, of glaucomatous damage: An approach based upon optical coherence tomography (OCT).

2017Prog Retin Eye Res239 citations

Discussion

Comments and discussion will appear here in a future update.