Ophthalmol Sci2025Journal Article

The Impact of Race, Ethnicity, and Sex on Fairness in Artificial Intelligence for Glaucoma Prediction Models.

Authors

Rohith Ravindranath, Joshua D Stein, Tina Hernandez-Boussard, A Caroline Fisher, Sophia Y Wang

Artificial IntelligenceDisease Progression

Summary

AI models predicting glaucoma surgery progression show bias across sex, race, and ethnicity. Fairness and performance vary with sensitive attribute inclusion, highlighting the need for rigorous, population-specific fairness evaluations before deployment.

Abstract

OBJECTIVE

Despite advances in artificial intelligence (AI) in glaucoma prediction, most works lack multicenter focus and do not consider fairness concerning sex, race, or ethnicity. This study aims to examine the impact of these sensitive attributes on developing fair AI models that predict glaucoma progression to necessitating incisional glaucoma surgery.

DESIGN

Database study.

PARTICIPANTS

Thirty-nine thousand ninety patients with glaucoma, as identified by International Classification of Disease codes from 7 academic eye centers participating in the Sight OUtcomes Research Collaborative.

METHODS

We developed XGBoost models using 3 approaches: (1) excluding sensitive attributes as input features, (2) including them explicitly as input features, and (3) training separate models for each group. Model input features included demographic details, diagnosis codes, medications, and clinical information (intraocular pressure, visual acuity, etc.), from electronic health records. The models were trained on patients from 5 sites (N = 27 999) and evaluated on a held-out internal test set (N = 3499) and 2 external test sets consisting of N = 1550 and N = 2542 patients.

MAIN OUTCOMES AND MEASURES

Area under the receiver operating characteristic curve (AUROC) and equalized odds on the test set and external sites.

RESULTS

Six thousand six hundred eighty-two (17.1%) of 39 090 patients underwent glaucoma surgery with a mean age of 70.1 (standard deviation 14.6) years, 54.5% female, 62.3% White, 22.1% Black, and 4.7% Latinx/Hispanic. We found that not including the sensitive attributes led to better classification performance (AUROC: 0.77-0.82) but worsened fairness when evaluated on the internal test set. However, on external test sites, the opposite was true: including sensitive attributes resulted in better classification performance (AUROC: external #1 - [0.73-0.81], external #2 - [0.67-0.70]), but varying degrees of fairness for sex and race as measured by equalized odds.

CONCLUSIONS

Artificial intelligence models predicting whether patients with glaucoma progress to surgery demonstrated bias with respect to sex, race, and ethnicity. The effect of sensitive attribute inclusion and exclusion on fairness and performance varied based on internal versus external test sets. Prior to deployment, AI models should be evaluated for fairness on the target population.

FINANCIAL DISCLOSURES

Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.

Keywords

BiasFairnessGlaucomaHealth disparitiesMachine learning

More by Rohith Ravindranath

View full profile →

Multimodal Artificial Intelligence Models Predicting Glaucoma Progression Using Electronic Health Records and Retinal Nerve Fiber Layer Scans.

2025Transl Vis Sci Technol7 citations

Improving Fairness and Mitigating Bias in Multicenter Electronic Health Records Models to Predict Glaucoma Outcomes.

2026Ophthalmol Sci

Independent Evaluation of RETFound Foundation Model's Performance on Optic Nerve Analysis Using Fundus Photography.

2025Ophthalmol Sci

Top Research in Artificial Intelligence

Browse all →

Digital technology, tele-medicine and artificial intelligence in ophthalmology: A global perspective.

2021Prog Retin Eye Res492 citations

Deep learning in ophthalmology: The technical and clinical considerations.

2019Prog Retin Eye Res447 citations

Efficacy of a Deep Learning System for Detecting Glaucomatous Optic Neuropathy Based on Color Fundus Photographs.

2018Ophthalmology262 citations

This article has not yet been placed in the Knowledge Library.

Discussion

Comments and discussion will appear here in a future update.