L2V-CoT: Cross-Modal Transfer of Chain-of-Thought Reasoning via Latent Intervention

ArXi:2511.17910v2 Announce Type: replace Recently, Chain-of-Thought (CoT) reasoning has significantly enhanced the capabilities of large language models (LLMs), but Vision-Language Models (VLMs) still struggle with multi-step reasoning tasks due to limited multimodal reasoning data. To bridge this gap, researchers have explored methods to transfer CoT reasoning from LLMs to VLMs. However, existing approaches either need high