AI RESEARCH

Regularizing Attention Scores with Bootstrapping

arXiv CS.LG

ArXi:2604.01339v1 Announce Type: cross Vision transformers (ViT) rely on attention mechanism to weigh input features, and. therefore. attention scores have naturally been considered as explanations for its decision-making process. However, attention scores are almost always non-zero, resulting in noisy and diffused attention maps and limiting interpretability.