AI RESEARCH
ThinkOmni: Lifting Textual Reasoning to Omni-modal Scenarios via Guidance Decoding
arXiv CS.CV
•
ArXi:2602.23306v2 Announce Type: replace Omni-modal reasoning is essential for intelligent systems to understand and draw inferences from diverse data sources. While existing omni-modal large language models (OLLM) excel at perceiving diverse modalities, they lack the complex reasoning abilities of recent large reasoning models (LRM). However, enhancing the reasoning ability of OLLMs through additional