Attention-Enhanced Hybrid CNN–ViT Framework for Genus-Level Classification of Selected Macrofungi from Basidiospore Micrographs
APPLIED SCIENCES, cilt.16, sa.12, ss.1-33, 2026 (SCI-Expanded, Scopus)
- Yayın Türü: Makale / Tam Makale
- Cilt numarası: 16 Sayı: 12
- Basım Tarihi: 2026
- Doi Numarası: 10.3390/app16126167
- Dergi Adı: APPLIED SCIENCES
- Derginin Tarandığı İndeksler: Applied Science & Technology Source, Scopus, Science Citation Index Expanded (SCI-EXPANDED), Compendex, INSPEC, Directory of Open Access Journals
- Sayfa Sayıları: ss.1-33
- Van Yüzüncü Yıl Üniversitesi Adresli: Evet
Özet
The development of rapid and reproducible image analysis approaches that support genus-level pre-classification of macrofungi is important for taxonomic pre-evaluation and controlled microscopic data analysis. In this study, an advanced deep learning-based approach, namely the Attention-Enhanced Hybrid CNN–ViT Framework, was rigorously evaluated for genus-level classification, using basidiospore micrographs of five carefully selected macrofungal genera. The proposed approach integrates the ability of convolutional neural networks to identify local texture and contour patterns with the global context-modelling capability of Vision Transformer structures. The objective is to enhance the extraction of distinctive representations from microscopic spore images through feature fusion and attention mechanisms. A series of experiments was conducted on a curated dataset consisting of light microscopy images of the genera Agaricus, Hebeloma, Inocybe, Amanita, and Russula. The models were compared using a range of evaluation metrics, including accuracy, F1-score, MCC, ROC-AUC, and PR-AUC. The results showed that the InceptionV3 + ViT-B16 + Fusion configuration was the most successful hybrid model, achieving an accuracy of 0.9213 ± 0.0182, an F1-score of 0.9212 ± 0.0179, a Matthews correlation coefficient (MCC) of 0.9040 ± 0.0222, a receiver operating characteristic (ROC)-area under the curve (AUC) of 0.9896 ± 0.0069, and a precision-recall (PR)-AUC of 0.9684 ± 0.0192, respectively. The present findings demonstrate that basidiospore images can carry distinctive visual information for genus-level automated classification under controlled conditions. However, it is important to note that these results should not be interpreted as claims of species-level identification or field generalisability. This is due to the use of a single microscope-camera system, a single preparation protocol, and the absence of an independent external test set. The present study demonstrates that deep learning-based microscopic image analysis can be evaluated as a preliminary classification tool in macrofungal taxonomy. It also shows that such tools can provide a foundation for future work supported by specimen-level validation, external test sets, and different imaging protocols.