DenseNet201
‐
SAT
: A Hybrid Attention‐Integrated Model for Glaucoma Detection
Shajila Beegam M. K., Mala Kalra ABSTRACT
This study presents and evaluates DenseNet201‐SAT, an attention enhanced deep learning framework for automated glaucoma diagnosis using ophthalmic fundus images. The developed composite neural network is called DenseNet201‐SAT and is composed of the DenseNet201 base with SE modules, self‐attention features, transformer blocks, and dense output classification layers. The model uses transfer learning to remediate the issue of limited training data. Training and testing of the model are done using five benchmark datasets, namely RIMONE‐DL, HRF, Drishti‐GS1, ACRIMA, and sjchoi86‐HRF. Performance evaluation is conducted using patient‐level splits among various fundus datasets (Drishti, sjchoi86‐HRF, ACRIMA, RIMONE, HRF) and a cross‐dataset. The size of test sets is between 6 and 141 images. The model achieves accuracy ranging from 0.8419 (95% CI: 0.500–1.0) on HRF to 0.9895 (95% CI: 0.969–1.0) on RIMONE, sensitivity 0.9701–1.0, specificity 0.7441–1.0, F1‐score 0.8169–0.9876, and AUC 0.8721–0.9851. In cross‐dataset evaluation, it achieves 0.9603 accuracy (95% CI: 0.921–0.990), 0.9715 F1‐score (95% CI: 0.940–0.994), and 0.9526 AUC (95% CI: 0.901–0.993), demonstrating strong generalization across datasets. Furthermore, the model achieved MCC values of 0.874 0.180, 0.966 0.060, 0.971 0.040, 0.979 0.040, 0.725 0.340, and 0.908 0.090 on DRISHTI‐GS1, sjchoi86‐HRF, ACRIMA, RIMONE‐DL, HRF, and the cross‐dataset setting, respectively, confirming balanced classification performance. The suggested DenseNet201‐SAT approach provides a robust and accurate solution for glaucoma diagnosis, utilizing the benefits of deep feature learning and strategies. Its consistent results across multiple datasets advocate significant potential for clinical application in automated glaucoma screening.