Combination Of Gamma Correction and Vision Transformer In Lung Infection Classification On CT-Scan Images

Keywords: Lungs, CT-Scan, Classification, Gamma Correction, Vision Transformer.

Abstract

Lung infection is an inflammatory condition of the lungs with a high mortality rate. Lung infections can be identified using CT-Scan images, where the affected areas are analyzed to determine the infection type. However, manual interpretation of CT-Scan results by medical specialists is often time-consuming, subjective, and requires a high level of accuracy. To address these challenges, this study proposes an automated classification method for lung infections using deep learning techniques. Convolutional Neural Networks (CNNs) are widely used for image classification tasks. However, CNN operates locally with limited receptive fields, making capturing global patterns in complex lung CT images challenging. CNN also struggles to model long-range pixel dependencies, which is crucial for analyzing visually similar regions in lung CT-Scans. This study uses a Vision Transformer (ViT) to overcome CNN limitations. ViT employs self-attention mechanisms to capture global dependencies across the entire image. The main contribution of this study is the implementation of ViT to enhance classification performance in lung CT-Scan images by capturing complex and global image patterns that CNN fails to model. However, ViT requires a large dataset to perform optimally. To overcome these challenges, augmentation techniques such as flipping, rotation, and gamma correction are applied to increase the amount of data without altering the important features. The dataset comprises lung CT-scan images sourced from Kaggle and is divided into Covid and Non-Covid classes.  The proposed method demonstrated excellent classification performance, achieving accuracy, sensitivity, specificity, precision, and F1-Score above 90%. Additionally, the Cohen’s kappa coefficient reached 89%. These results show that the proposed method effectively classifies lung infections using CT-Scan images and has strong potential as a clinical decision-support tool, particularly in reducing diagnostic time and improving consistency in medical evaluations.

Downloads

Download data is not yet available.

References

J. Oliva and O. Terrier, “Viral and bacterial co-infections in the lungs: Dangerous liaisons,” Viruses, vol. 13, no. 9, 2021, doi: 10.3390/v13091725.

Y. Gao et al., “Size and charge adaptive clustered nanoparticles targeting the biofilm microenvironment for chronic lung infection management,” ACS Nano, vol. 14, no. 5, pp. 5686–5699, May 2020, doi: 10.1021/acsnano.0c00269.

K. Dimilier, B. Ugur, and Y. K. Ever, “Tumor Detection on CT Lung Images Using Image Enhancement,” J. Sci. Technol., vol. 7, no. 1, pp. 133–136, 2017.

G. Muhammad and M. Shamim Hossain, “COVID-19 and Non-COVID-19 Classification using Multi-layers Fusion From Lung Ultrasound Images,” Inf. Fusion, vol. 72, pp. 80–88, Aug. 2021, doi: 10.1016/j.inffus.2021.02.013.

M. Mazonakis and J. Damilakis, “Computed tomography: What and how does it measure?,” Eur. J. Radiol., vol. 85, no. 8, pp. 1499–1504, Aug. 2016, doi: 10.1016/j.ejrad.2016.03.002.

C. C. Ukwuoma et al., “Automated Lung-Related Pneumonia and COVID-19 Detection Based on Novel Feature Extraction Framework and Vision Transformer Approaches Using Chest X-ray Images,” Bioengineering, vol. 9, no. 11, p. 709, Nov. 2022, doi: 10.3390/bioengineering9110709.

Y. Guo and Y. Peng, “BSCN: Bidirectional Symmetric Cascade Network for retinal vessel segmentation,” BMC Med. Imaging, vol. 20, no. 1, p. 20, Dec. 2020, doi: 10.1186/s12880-020-0412-7.

X. Hua et al., “WSC-Trans: A 3D network model for automatic multi-structural segmentation of temporal bone CT,” arXiv Prepr. arXiv2211.07143, 2022.

G. I. Okolo, S. Katsigiannis, and N. Ramzan, “IEViT: An enhanced vision transformer architecture for chest X-ray image classification,” Comput. Methods Programs Biomed., vol. 226, p. 107141, 2022, doi: 10.1016/j.cmpb.2022.107141.

X. Liu, Z. Yu, and L. Tan, “Deep Learning for Lung Disease Classification Using Transfer Learning and a Customized CNN Architecture with Attention,” in 2024 IEEE 2nd International Conference on Sensors, Electronics and Computer Engineering (ICSECE), Aug. 2024, pp. 341–346. doi: 10.1109/ICSECE61636.2024.10729291.

M. N. Toroghi, U. U. Sheikh, and S. S. Irani, “Classification of COVID-19 and Lung Opacity using Vision Transformer on Chest X-ray Images,” J. Phys. Conf. Ser., vol. 2622, no. 1, pp. 1–8, 2023, doi: 10.1088/1742-6596/2622/1/012016.

M. Ragab, S. Alshehri, N. A. Alhakamy, R. F. Mansour, and D. Koundal, “Multiclass Classification of Chest X-Ray Images for the Prediction of COVID-19 Using Capsule Network,” Comput. Intell. Neurosci., vol. 2022, no. 6185013., pp. 1–8, May 2022, doi: 10.1155/2022/6185013.

Y. Bazi, L. Bashmal, M. M. Al Rahhal, R. Al Dayil, and N. Al Ajlan, “Vision transformers for remote sensing image classification,” Remote Sens., vol. 13, no. 3, pp. 1–20, 2021, doi: 10.3390/rs13030516.

C. F. Chen, Q. Fan, and R. Panda, “CrossViT: Cross-Attention Multi-Scale Vision Transformer for Image Classification,” Proc. IEEE Int. Conf. Comput. Vis., pp. 347–356, 2021, doi: 10.1109/ICCV48922.2021.00041.

A. Mezina and R. Burget, “Detection of Post-COVID-19-Related Pulmonary Diseases in X-ray Images using Vision Transformer-based Neural Network,” Biomed. Signal Process. Control, vol. 87, p. 105380, Jan. 2024, doi: 10.1016/j.bspc.2023.105380.

A. Souid, N. Sakli, and H. Sakli, “Classification and Predictions of Lung Diseases from Chest X‐Rays Using MobileNet V2,” Appl. Sci., vol. 11, no. 6, pp. 1–16, 2021, doi: 10.3390/app11062751.

X. Zhai et al., “An Image Is Worth 16 X 16 Words :,” Int. Conf. Learn. Represent., 2021.

A. Desiani, M. Erwin, B. Suprihatin, S. Yahdin, A. I. Putri, and F. R. Husein, “Bi-Path Architecture of CNN Segmentation and Classification Method for Cervical Cancer Disorders Based on Pap-smear Images,” Int. J. Comput. Sci., vol. 48, no. 3, pp. 1–9, 2021.

Erwin, A. Safmi, A. Desiani, B. Suprihatin, and Fathoni, “The Augmentation Data of Retina Image for Blood Vessel Segmentation Using U-Net Convolutional Neural Network Method,” Int. J. Comput. Intell. Appl., vol. 21, no. 01, p. 2250004, 2022, doi: 10.1142/S1469026822500043.

C. Shorten and T. M. Khoshgoftaar, “A Survey on Image Data Augmentation for Deep Learning,” J. Big Data, vol. 6, no. 1, 2019, doi: 10.1186/s40537-019-0197-0.

S. I. Maiyanti et al., “Rotation-Gamma Correction Augmentation on CNN-Dense Block for Soil Image Classification,” Appl. Comput. Sci., vol. 19, no. 3, pp. 96–115, 2023, doi: 10.35784/acs-2023-27.

X. Liu, G. Karagoz, and N. Meratnia, “Analyzing the Impact of Data Augmentation on the Explainability of Deep Learning-Based Medical Image Classification,” Mach. Learn. Knowl. Extr., vol. 7, no. 1, pp. 1–28, 2025, doi: 10.3390/make7010001.

A. Teramoto et al., “Automated classification of benign and malignant cells from lung cytological images using deep convolutional neural network,” Informatics Med. Unlocked, vol. 16, p. 100205, 2019, doi: 10.1016/j.imu.2019.100205.

B. A. R. and V. K. R. S., “Deep Learning-based Lung Cancer Classification of CT Images using Augmented Convolutional Neural Networks,” ELCVIA Electron. Lett. Comput. Vis. Image Anal., vol. 21, no. 1, Sep. 2022, doi: 10.5565/rev/elcvia.1490.

P. Yadlapalli, D. Bhavana, and S. Gunnam, “Intelligent classification of lung malignancies using deep learning techniques,” Int. J. Intell. Comput. Cybern., vol. 15, no. 3, pp. 345–362, Jul. 2022, doi: 10.1108/IJICC-07-2021-0147.

K. Alomar, H. I. Aysel, and X. Cai, “Data Augmentation in Classification and Segmentation: A Survey and New Strategies,” J. Imaging, vol. 9, no. 2, p. 46, Feb. 2023, doi: 10.3390/jimaging9020046.

P. Thanapol, K. Lavangnananda, P. Bouvry, F. Pinel, and F. Leprévost, “Reducing Overfitting and Improving Generalization in Training Convolutional Neural Network (CNN) under Limited Sample Sizes in Image Recognition,” in 2020 - 5th International Conference on Information Technology (InCIT), 2020, pp. 300–305. doi: 10.1109/InCIT50588.2020.9310787.

M. A. Khan et al., “Lungs cancer classification from CT images: An integrated design of contrast based classical features fusion and selection,” Pattern Recognit. Lett., vol. 129, pp. 77–85, 2020, doi: 10.1016/j.patrec.2019.11.014.

A. Desiani, Erwin, B. Suprihatin, M. Adrezo, and A. M. Alfan, “A Hybrid System for Enhancement Retinal Image Reduction,” in Proceedings - 3rd International Conference on Informatics, Multimedia, Cyber, and Information System, ICIMCIS 2021, 2021, pp. 80–85. doi: 10.1109/ICIMCIS53775.2021.9699259.

W. Setiawan, M. M. Suhadi, Husni, and Y. D. Pramudita, “Histopathology of Lung Cancer Classification Using Convolutional Neural Network With Gamma Correction,” Commun. Math. Biol. Neurosci., vol. 2022, pp. 1–17, 2022, doi: 10.28919/cmbn/7611.

S.-E. Weng, S.-G. Miaou, and R. Christanto, “A Lightweight Low-Light Image Enhancement Network via Channel Prior and Gamma Correction,” pp. 1–23, 2024.

T. Rahman et al., “Exploring the effect of image enhancement techniques on COVID-19 detection using chest X-ray images,” Comput. Biol. Med., vol. 132, p. 104319, 2021, doi: https://doi.org/10.1016/j.compbiomed.2021.104319.

X. Sun et al., “Robust Retinal Vessel Segmentation from a Data Augmentation Perspective,” in Ophthalmic Medical Image Analysis, 2021, pp. 189–198.

Erwin, H. K. Putra, B. Suprihatin, and F. Ramadhini, “A Hybrid CLAHE-GAMMA Adjustment and Densely Connected U-NET for Retinal Blood Vessel Segmentation using Augmentation Data,” Eng. Lett., vol. 30, no. 2, pp. 485–493, 2022.

X. Zhu, Y. Jia, S. Jian, L. Gu, and Z. Pu, “ViTT: Vision Transformer Tracker,” Sensors, vol. 21, no. 16. 2021. doi: 10.3390/s21165608.

A. Dosovitskiy et al., “An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale,” 2020.

A. Vaswani et al., “Attention is All you Need,” in Advances in Neural Information Processing Systems, 2017, vol. 30.

J. C. Ye, “Artificial Neural Networks and Backpropagation,” Math. Ind., vol. 37, no. August, pp. 91–112, 2022, doi: 10.1007/978-981-16-6046-7_6.

Y. Chen, X. Gu, Z. Liu, and J. Liang, “A Fast Inference Vision Transformer for Automatic Pavement Image Classification and Its Visual Interpretation Method,” Remote Sens., vol. 14, no. 8, pp. 1–20, 2022, doi: 10.3390/rs14081877.

S. M. Shafi and S. K. Chinnappan, Segmenting and classifying lung diseases with M-Segnet and Hybrid Squeezenet-CNN architecture on CT images, vol. 19, no. 5 May. 2024. doi: 10.1371/journal.pone.0302507.

K. Sriporn, C.-F. Tsai, C.-E. Tsai, and P. Wang, “Healthcare Analyzing Lung Disease Using Highly Effective Deep Teqhniques,” MDPI Healthc., vol. 8, no. 107, pp. 1–21, 2020.

H. Alshazly, C. Linse, M. Abdalla, E. Barth, and T. Martinetz, “COVID-Nets: Deep CNN Architectures for Detecting COVID-19 using Chest CT Scans,” PeerJ Comput. Sci., vol. 7, pp. 1–40, 2021, doi: 10.7717/peerj-cs.655.

S. Wang et al., “A Deep Learning Algorithm Using CT Images to Screen for Corona Virus Disease (COVID-19),” IMAGING INFORMATICS Artif. Intell. A, vol. 31, pp. 6096–6104, 2021.

X. He et al., “Sample-Efficient Deep Learning for COVID-19 Diagnosis Based on CT Scans,” IEEE Trans. Med. Imaging, pp. 1–10, 2020.

Published
2025-07-09
How to Cite
[1]
L. I. Kesuma, P. Octavia, P. Sari, G. M. C. Batubara, and K. Karina, “Combination Of Gamma Correction and Vision Transformer In Lung Infection Classification On CT-Scan Images”, j.electron.electromedical.eng.med.inform, vol. 7, no. 3, pp. 881-893, Jul. 2025.
Section
Medical Engineering