AI RESEARCH

Unified Multimodal Brain Decoding via Cross-Subject Soft-ROI Fusion

arXiv CS.LG

ArXi:2512.20249v3 Announce Type: replace Multimodal brain decoding aims to reconstruct semantic information that is consistent with visual stimuli from brain activity signals such as fMRI, and then generate readable natural language descriptions. However, multimodal brain decoding still faces key challenges in cross-subject generalization and interpretability. We propose a BrainROI model and achieve leading-level results in brain-captioning evaluation on the NSD dataset.