AI RESEARCH

L2V-CoT: Cross-Modal Transfer of Chain-of-Thought Reasoning via Latent Intervention

arXiv CS.CL

ArXi:2511.17910v2 Announce Type: replace Recently, Chain-of-Thought (CoT) reasoning has significantly enhanced the capabilities of large language models (LLMs), but Vision-Language Models (VLMs) still struggle with multi-step reasoning tasks due to limited multimodal reasoning data. To bridge this gap, researchers have explored methods to transfer CoT reasoning from LLMs to VLMs. However, existing approaches either need high