AI RESEARCH

Co-distilled attention guided masked image modeling with noisy teacher for self-supervised learning on medical images

arXiv CS.CV

ArXi:2604.14506v1 Announce Type: new Masked image modeling (MIM) is a highly effective self-supervised learning (SSL) approach to extract useful feature representations from unannotated data. Predominantly used random masking methods make SSL less effective for medical images due to the contextual similarity of neighboring patches, leading to information leakage and SSL simplification. Hierarchical shifted window (Swin) transformer, a highly effective approach for medical images cannot use advanced masking methods as it lacks a global [CLS] token. Hence, we.