A Novel Deep Learning Framework for Enhanced Glaucoma Detection Using Attention-Gated U-Net, Deep Wavelet Scattering, and Vision Transformers
Abstract
Globally, Glaucoma is a major cause of permanent blindness, and maintaining eyesight depends on early detection. Here, a brand-new deep-learning system for glaucoma prediction. In this work, we offer a novel deep-learning approach for enhanced glaucoma prediction that uses a denoising generative adversarial network for preprocessing the input image is provided, later the segmentation is carried out by Attention-Gated U-Net with Dilated Convolutions to segment the optic cup and optic disc. Feature Extraction Using a Deep Wavelet Scattering Network and finally the glaucoma classification is carried out by the Vision Transformers. An attention-gated U-Net with dilated convolutions for segmentation, which improves the accuracy of optic disc and cup boundaries by 7% compared to conventional U-Net methods is introduced. A Deep Wavelet Scattering Network (DWSN) for feature extraction that achieves a 5% improvement in feature discrimination over conventional CNNs by capturing multiscale texture and structural information is suggested. Lastly, ViT, which is based on transfer learning, is used for classification; it has a 94.6% accuracy rate, a 93.8% sensitivity rate, and a 95.2% specificity rate. The suggested approach outperformed CNN-based models by improving by about 4% on all criteria. The system achieved an F1 score of 0.95 and an AUC (Area Under Curve) of 0.96 when tested on publicly accessible glaucoma datasets. Multi-stage deep-learning processing for glaucoma prediction by integrating a denoising generative adversarial network for image preprocessing, Attention-Gated U-Net with Dilated Convolutions for exact optic cup and disc segmentation, deep wavelet scattering for feature extraction, and Vision Transformers for glaucoma classification.
Downloads
References
P. B. Singh, P. Singh, H. Dev, A. Tiwari, D. Batra, and B. K. Chaurasia, “Glaucoma Classification using Light Vision Transformer,” EAI Endorsed Trans Pervasive Health Technol, vol. 9, Sep. 2023, doi: 10.4108/eetpht.9.3931.
J. Nayak, R. Acharya U., P. S. Bhat, N. Shetty, and T. C. Lim, “Automated diagnosis of glaucoma using digital fundus images,” J Med Syst, vol. 33, no. 5, 2009, doi: 10.1007/s10916-008-9195-z.
M. Juneja et al., “Automated detection of Glaucoma using deep learning convolution network (G-net),” Multimed Tools Appl, vol. 79, no. 21–22, pp. 15531–15553, Jun. 2020, doi: 10.1007/s11042-019-7460-4.
J. Raja, P. Shanmugam, and R. Pitchai, “An Automated Early Detection of Glaucoma using Support Vector Machine Based Visual Geometry Group 19 (VGG-19) Convolutional Neural Network,” Wirel Pers Commun, vol. 118, no. 1, pp. 523–534, May 2021, doi: 10.1007/s11277-020-08029-z.
J. Martins, J. S. Cardoso, and F. Soares, “Offline computer-aided diagnosis for Glaucoma detection using fundus images targeted at mobile devices,” Comput Methods Programs Biomed, vol. 192, Aug. 2020, doi: 10.1016/j.cmpb.2020.105341.
O. J. Afolabi, G. P. Mabuza-Hocquet, F. V. Nelwamondo, and B. S. Paul, “The Use of U-Net Lite and Extreme Gradient Boost (XGB) for Glaucoma Detection,” IEEE Access, vol. 9, pp. 47411–47424, 2021, doi: 10.1109/ACCESS.2021.3068204.
P. Mehta et al., “Automated Detection of Glaucoma With Interpretable Machine Learning Using Clinical Data and Multimodal Retinal Images,” Am J Ophthalmol, vol. 231, 2021, doi: 10.1016/j.ajo.2021.04.021.
R. Song, H. Wang, and Y. Xing, “Deep Learning-based Glaucoma Detection Using CNN and Digital Fundus Images: A Promising Approach for Precise Diagnosis,” Curr Med Imaging Rev, vol. 20, 2024, doi: 10.2174/0115734056257657231115051020.
N. Mojab, V. Noroozi, P. S. Yu, and J. A. Hallak, “Deep multi-task learning for interpretable glaucoma detection,” in Proceedings - 2019 IEEE 20th International Conference on Information Reuse and Integration for Data Science, IRI 2019, 2019. doi: 10.1109/IRI.2019.00037.
W. T. Song, I. C. Lai, and Y. Z. Su, “A Statistical Robust Glaucoma Detection Framework Combining Retinex, CNN, and DOE Using Fundus Images,” IEEE Access, vol. 9, pp. 103772–103783, 2021, doi: 10.1109/ACCESS.2021.3098032.
D. S. David, “Enhanced glaucoma detection using ensemble based CNN and spatially based ellipse fitting curve model,” J Ambient Intell Humaniz Comput, vol. 14, no. 4, 2023, doi: 10.1007/s12652-021-03467-4.
I. J. C. MacCormick et al., “Accurate, fast, data efficient and interpretable glaucoma diagnosis with automated spatial analysis of the whole cup to disc profile,” PLoS One, vol. 14, no. 1, 2019, doi: 10.1371/journal.pone.0209409.
S. Thainimit, P. Chaipayom, N. Sa-arnwong, D. Gansawat, S. Petchyim, and S. Pongrujikorn, “Robotic process automation support in telemedicine: Glaucoma screening usage case,” Inform Med Unlocked, vol. 31, Jan. 2022, doi: 10.1016/j.imu.2022.101001.
M. Murugesan, T. S. Jeyali Laseetha, S. Sundaram, and H. Kandasamy, “Glaucoma disease detection using stacked attention U-Net and deep convolutional neural network,” Journal of Intelligent & Fuzzy Systems, vol. 45, no. 1, pp. 1603–1616, Jul. 2023, doi: 10.3233/JIFS-230659.
S. Pathan, P. Kumar, R. M. Pai, and S. V. Bhandary, “Automated segmentation and classifcation of retinal features for glaucoma diagnosis,” Biomed Signal Process Control, vol. 63, Jan. 2021, doi: 10.1016/j.bspc.2020.102244.
P. K. Chaudhary and R. B. Pachori, “Automatic diagnosis of glaucoma using two-dimensional Fourier-Bessel series expansion based empirical wavelet transform,” Biomed Signal Process Control, vol. 64, Feb. 2021, doi: 10.1016/j.bspc.2020.102237.
I.-E. Haouli, W. Hariri, and H. Seridi-Bouchelaghem, “Exploring Vision Transformers for Automated Glaucoma Disease Diagnosis in Fundus Images,” in 2023 International Conference on Decision Aid Sciences and Applications (DASA), IEEE, Sep. 2023, pp. 520–524. doi: 10.1109/DASA59624.2023.10286714.
X. R. Gao, F. Wu, P. T. Yuhas, R. K. Rasel, and M. Chiariglione, “Automated vertical cup-to-disc ratio determination from fundus images for glaucoma detection,” Sci Rep, vol. 14, no. 1, 2024, doi: 10.1038/s41598-024-55056-y.
N. E. A. Khalid, N. M. Noor, and N. M. Ariff, “Fuzzy c-Means (FCM) for optic cup and disc segmentation with morphological operation,” in Procedia Computer Science, 2014. doi: 10.1016/j.procs.2014.11.060.
D. Padalia, A. Mazumdar, and B. Singh, “A CNN-LSTM Combination Network for Cataract Detection using Eye Fundus Images,” Oct. 2022, [Online]. Available: http://arxiv.org/abs/2210.16093
N. A. Alkhaldi and R. E. Alabdulathim, “Optimizing Glaucoma Diagnosis with Deep Learning-Based Segmentation and Classification of Retinal Images,” Applied Sciences, vol. 14, no. 17, p. 7795, Sep. 2024, doi: 10.3390/app14177795.
F. Alaoui Abdalaoui Slimani and M. Bentourkia, “Improving deep learning U-Net++ by discrete wavelet and attention gate mechanisms for effective pathological lung segmentation in chest X-ray imaging,” Phys Eng Sci Med, Nov. 2024, doi: 10.1007/s13246-024-01489-8.
V. Krishnamoorthy and S. Logeswari, “An interpretable glaucoma detection using dual scale cross-attention vision transformer-based long short term memory with optical cup and disk segmentation,” j mech med biol, oct. 2024, doi: 10.1142/s0219519424500325.
T. Shyamalee and D. Meedeniya, “Glaucoma Detection with Retinal Fundus Images Using Segmentation and Classification,” Machine Intelligence Research, vol. 19, no. 6, pp. 563–580, Dec. 2022, doi: 10.1007/s11633-022-1354-z.
Y. Xue et al., “A multi-feature deep learning system to enhance glaucoma severity diagnosis with high accuracy and fast speed,” J Biomed Inform, vol. 136, Dec. 2022, doi: 10.1016/j.jbi.2022.104233.
S. Gheisari et al., “A combined convolutional and recurrent neural network for enhanced glaucoma detection,” Sci Rep, vol. 11, no. 1, 2021, doi: 10.1038/s41598-021-81554-4.
R. Mahum, S. U. Rehman, O. D. Okon, A. Alabrah, T. Meraj, and H. T. Rauf, “A novel hybrid approach based on deep cnn to detect glaucoma using fundus imaging,” Electronics (Switzerland), vol. 11, no. 1, 2022, doi: 10.3390/electronics11010026.
V. V. N. S. Kumar, G. Harinath Reddy, and M. N. GiriPrasad, “A novel glaucoma detection model using Unet++-based segmentation and ResNet with GRU-based optimized deep learning,” Biomed Signal Process Control, vol. 86, 2023, doi: 10.1016/j.bspc.2023.105069.
Y. George, B. J. Antony, H. Ishikawa, G. Wollstein, J. S. Schuman, and R. Garnavi, “Attention-Guided 3D-CNN Framework for Glaucoma Detection and Structural-Functional Association Using Volumetric Images,” IEEE J Biomed Health Inform, vol. 24, no. 12, pp. 3421–3430, Dec. 2020, doi: 10.1109/JBHI.2020.3001019.
R. Kashyap, R. Nair, S. M. P. Gangadharan, M. Botto-Tobar, S. Farooq, and A. Rizwan, “Glaucoma Detection and Classification Using Improved U-Net Deep Learning Model,” Healthcare (Switzerland), vol. 10, no. 12, Dec. 2022, doi: 10.3390/healthcare10122497.
D. Parashar and D. K. Agrawal, “Automatic Classification of Glaucoma Stages Using Two-Dimensional Tensor Empirical Wavelet Transform,” IEEE Signal Process Lett, vol. 28, 2021, doi: 10.1109/LSP.2020.3045638.
Md. A. R. Ratul, M. H. Mozaffari, W.-S. Lee, and E. Parimbelli, “Skin Lesions Classification Using Deep Learning Based on Dilated Convolution,” Nov. 30, 2019. doi: 10.1101/860700.
C.-F. Chen, Q. Fan, and R. Panda, “CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification.” [Online]. Available: https://github.com/IBM/CrossViT.
L. Li et al., “A Large-scale Database and a CNN Model for Attention-based Glaucoma Detection,” 2019. [Online]. Available: https://github.com/smilell/AG-CNN.
M. B. Sudhan et al., “Segmentation and Classification of Glaucoma Using U-Net with Deep Learning Model,” J Healthc Eng, vol. 2022, 2022, doi: 10.1155/2022/1601354.
Copyright (c) 2025 Krishnamoorthy V, Sivanantham S, Akshaya V, Nivedha S, Sivakumar Depuru, Manikandan M

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Authors who publish with this journal agree to the following terms:
- Authors retain copyright and grant the journal right of first publication with the work simultaneously licensed under a Creative Commons Attribution-ShareAlikel 4.0 International (CC BY-SA 4.0) that allows others to share the work with an acknowledgement of the work's authorship and initial publication in this journal.
- Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the journal's published version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgement of its initial publication in this journal.
- Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See The Effect of Open Access).