From Resolution to Explanation: Real-ESRGANand LIME Analysis of Vision Transformers andCNNs for Brain Tumor MRI Classification
DOI:
https://doi.org/10.53799/jaay3z78Keywords:
Vision Transformers, Convolutional Neural Networks, ResNet50, RealESRGAN, LIMEAbstract
This study compares the performance of Vision Transformers (ViTs) and Convolutional Neural Networks (CNNs) for brain tumor classification on MRI scans from the Kaggle Brain Tumor MRI dataset, establishing a comprehensive benchmark for evaluation. To enhance visual fidelity without altering spatial resolution, Real-Enhanced Super Resolution Generative Adversarial Networks (Real-ESRGAN) were employed for preprocessing. After applying Real-ESRGAN, a significant improvement in classification accuracy and feature clarity was observed across all models, indicating the importance of high-quality input in medical imaging tasks. Five transformer-based models—Swin-Tiny, ViT, DeiT, Mobile-ViT, and PiT—were benchmarked against five C NN architectures, including ResNet50, EfficientNet-B0, VGG16, AlexNet, and DenseNet-121. Building on these results, a modified late-fusion ensemble combining ResNet50 and Vision Transformer was developed to integrate both global and local feature extraction capabilities. The proposed hybrid architecture achieved superior classification performance, outperforming all individual ViT and CNN models. Furthermore, Explainable AI techniques were applied using Local Interpretable Model-agnostic Explanations (LIME) to visualize decision patterns, revealing that ViTs and the late-fusion ensemble exploit broader contextual regions for tumor localization, while CNNs concentrate on more confined spatial areas. The integrated framework of Real-ESRGAN enhancement, late-fusion ensembleing, and LIME-based interpretation collectively advances both accuracy and explainability, offering a promising direction for reliable and interpretable brain tumor diagnosis in clinical applications.
Downloads
Published
Issue
Section
License
Copyright (c) 2026 AIUB Journal of Science and Engineering

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
AJSE contents are under the terms of the Creative Commons Attribution License. This permits anyone to copy, distribute, transmit and adapt the work non-commercially provided the original work and source is appropriately cited.