AI RESEARCH
Deep neural networks with Fisher vector encoding for medical image classification
arXiv CS.CV
•
ArXi:2605.01667v1 Announce Type: new Orderless encoding methods have shown to improve Convolutional Neural Networks (CNNs) for image classification in the context of limited availability of data. Additionally, hybrid CNN + Vision Transformers (ViT) models have been recently proposed to address CNN locality bias issues. These models outperformed CNN-only approaches. Despite that, the integration of such hybrid models with elaborated feature representation can be highly beneficial and remains large unexplored in the literature. In this context, we propose the