Automated Lumbar Disc Intensity Classification From MRI Scans Using Region-Based CNNs and Transformer Models


ULUTAŞ H., ERKOÇ M. F., Ozbay E., ŞAHİN M. E., Karakus M. O., YÜCE E.

International Journal of Imaging Systems and Technology, cilt.35, sa.6, 2025 (SCI-Expanded, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 35 Sayı: 6
  • Basım Tarihi: 2025
  • Doi Numarası: 10.1002/ima.70229
  • Dergi Adı: International Journal of Imaging Systems and Technology
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Applied Science & Technology Source, Biotechnology Research Abstracts, Compendex, INSPEC
  • Anahtar Kelimeler: classification, faster R-CNN, lumbar, mask R-CNN, transformer
  • Yozgat Bozok Üniversitesi Adresli: Evet

Özet

This study explores the effectiveness of deep learning methodologies in the detection and classification of lumbar disc intensity using MRI scans. Initially, region-based deep learning frameworks, including Faster R-CNN and Mask R-CNN with different backbones such as ResNet50 and ResNet101 are evaluated. Results demonstrated that backbone selection significantly impacts model performance, with Mask R-CNN combined with ResNet101 achieving a remarkable mAP@0.50 (AP50) of 99.83%. In addition to object detection models, Transformer-based classification architectures, including MaxViT, Vision Transformer (ViT), a Hybrid CNN-ViT model, and Fine-Tuned Enhanced Pyramid Network (FT-EPN), are implemented. Among these, the Hybrid model achieved the highest classification accuracy (83.1%), while MaxViT yielded the highest precision (0.804). Comparative analyses highlighted that while Mask R-CNN models excelled in segmentation and detection tasks, Transformer-based models provided effective solutions for direct severity classification of lumbar discs. These findings emphasize the critical role of both backbone architecture and model type in optimizing diagnostic performance. The study demonstrates the potential of integrating region-based and Transformer-based models in advancing automated lumbar spine assessment, paving the way for more accurate and reliable medical diagnostic systems.